【NLP】UnicodeDecodeError: 'ascii' codec can't decode byte 0xd1 in position 74752: ordinal not in rang
来源:互联网 发布:三巨网络面试如何 编辑:程序博客网 时间:2024/06/01 16:12
这几天在用python进行自然语言处理,遇到了一个编码的问题:UnicodeDecodeError: 'ascii' codec can't decode byte 0xd1 in position 74752: ordinal not in range(128)。谷歌和百度了相关的关键词都没有找到很好的解决方案,最后自己调试出解决方法。解决方法如下:
1.看编辑器给出的错误提示:'ascii' codec can't decode byte 0xd1,意思是要被decode(解码)的字符为0xd1,应该是utf8编码的,不能用ascii来解码,应该用utf8来解码;
2.看你发生错误的需要解码或者编码的字符串,使用 chardet 对字符串进行编码检测,然后针对字符串的编码进行相应的解码和编码。
>>> import chardet>>> chardet.detect(rawdata){'confidence': 0.98999999999999999, 'encoding': 'GB2312'}>>>
chardet的安装方法如下:
从http://pypi.python.org/pypi/chardet下载chardet后,解压chardet压缩包,直接将里面的chardet文件夹放在应用程序目录下,就可以使用import chardet开始使用chardet了。
- 【NLP】UnicodeDecodeError: 'ascii' codec can't decode byte 0xd1 in position 74752: ordinal not in rang
- python: UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 20: ordinal not in rang
- 【转】UnicodeDecodeError: 'ascii' codec can't decode byte 0xe5 in position 4: ordinal not in rang
- python 解决UnicodeDecodeError: ‘ascii’ codec can’t decode byte 0xe9 in position 9: ordinal not in rang
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe8 in position 11: ordinal not in range
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 0: ordinal not in range(128)
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xb2 in position 0: ordinal not in range的解决办法
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 0: ordinal not in range(128)
- Python报错UnicodeDecodeError: ‘ascii’ codec can’t decode byte 0xe0 in position 0: ordinal not in range
- 解决UnicodeDecodeError: 'ascii' codec can't decode byte 0xe9 in position 0: ordinal not in range(128)
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe5 in position 0: ordinal not in range(128)
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 0: ordinal not in range(128)
- UnicodeDecodeError : 'ascii' codec can't decode byte 0xe0 in position 0: ordinal not in range(128)
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe5 in position 0: ordinal not in range(128)
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 0: ordinal not in range(128)
- xlwt:UnicodeDecodeError: 'ascii' codec can't decode byte 0xb2 in position 0: ordinal not in range(12
- UnicodeDecodeError: 'ascii' codec can't decode byte 0xe5 in position 0: ordinal not in range(128)
- xlutils写入中文报错:UnicodeDecodeError: 'ascii' codec can't decode byte 0xb8 in position 0: ordinal not in
- HighCharts
- 详解SPI中的极性CPOL和相位CPHA
- 用IO流实现文件复制功能
- HASH JOIN ,MERGE JOIN ,NESTED LOOP 比较
- 画图
- 【NLP】UnicodeDecodeError: 'ascii' codec can't decode byte 0xd1 in position 74752: ordinal not in rang
- 十六周上机任务(在字符串后增加符号)
- TestBatch(26)
- 下一个倒下的将是苹果?
- linux下生成C程序函数列表
- MFC之MessageBox用法
- 第28节 命名空间的流程
- 项目构建工具 Maven
- 网站压力测试工具