UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 1793: illegal multibyte sequence
来源:互联网 发布:趣玩网络 编辑:程序博客网 时间:2024/05/21 09:01
UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0x9d in position 1793: illegal multibyte sequence问题解决方法
最近在写一个用RNN进行文本生成,总结一下遇到的文件读取问题,闲蛋少说。
代码如下:
代码块
!/usr/bin/python
-- coding: utf-8 --
import numpy
from keras.models import Sequential
from keras.layers import Dense
from keras.layers import Dropout
from keras.layers import LSTM
from keras.callbacks import ModelCheckpoint
from keras.utils import np_utils
raw_text = open(u’F:/深度学习资料/自然语言处理班/自然语言处理-8课时/6/DLinNLP/DLinNLP/input/Winston_Churchil.txt’).read().decode(‘utf-8’)
raw_text = raw_text.lower()
print (raw_text)
###错误脚注生成一个脚注.UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 1793: illegal multibyte sequence如:raw_text = open(u'F:/深度学习资料/自然语言处理班/自然语言处理-8课时/6/DLinNLP/DLinNLP/input/Winston_Churchil.txt'***,encoding='UTF-8'***).read()2、OSError: Initializing from file failed问题代码:
encoding: UTF-8
import os
import pandas as pd
import numpy as np
from sklearn.metrics import roc_auc_score
from datetime import date
data = pd.read_csv(“F:/深度学习资料/自然语言处理班/自然语言处理-8课时/6/DLinNLP/DLinNLP/input/Combined_News_DJIA.csv”)
print (data.head())
注脚:OSError: Initializing from file failed解决方案:
import os
os.chdir(os.path.dirname(“F:/深度学习资料/自然语言处理班/自然语言处理-8课时/6/DLinNLP/DLinNLP/input/Combined_News_DJIA.csv”))
data = pd.read_csv(os.path.basename(“F:/深度学习资料/自然语言处理班/自然语言处理-8课时/6/DLinNLP/DLinNLP/input/Combined_News_DJIA.csv”))
print (data.head())
“`
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 1793: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xfd in position 3952: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xaf in position 683: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xab in position 11126: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xfe in position 45: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xab in position 11126: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 199: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 205: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xa4 in position 18: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 1106: illegal multibyte s
- Python错误 'gbk' codec can't decode byte 0x80 in position 0: illegal multibyte sequence
- 'gbk' codec can't decode byte 0xaf in position 6532: illegal multibyte sequence
- 'gbk' codec can't decode byte 0x94 in position 41:illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode bytes in position 12-13: illegal multibyte sequence
- Django 下载文件报错UnicodeDecodeError: ‘gbk’ codec can’t decode byte 0xb1 in position 5: illegal multibyte
- Python中遇到"UnicodeDecodeError: ‘gbk’ codec can’t decode bytes in position 0: illegal multibyte
- UnicodeDecodeError: 'gb2312' codec can't decode byte 0x88 in position 164111: illegal multibyte sequ
- 【UnicodeDecodeError: '' codec can't decode bytes in position : illegal multibyte sequence】
- Spring Cloud consul的安装和配置centos 7
- 关闭linux图形启动默认进入命令行模式
- oracle-表分区里面爬过的坑
- Java动画模板
- 如何使用Python开发神器-virtualenv
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 1793: illegal multibyte sequence
- HDOJ 2201 熊猫阿波的故事(水题)
- CentOS7安装redis扩展
- 期末考核任务:创建登录界面
- git常用命令
- junit4单元测试框架的使用
- 小结 | C++对传参和传返回值的优化
- struts2获取域对象
- java中与或非,异或,位运算