Python2.7:UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
来源:互联网 发布:2015伊戈达拉数据 编辑:程序博客网 时间:2024/05/23 02:00
Python版本:2.7
IDE:Pycharm2017
报错原因:爬虫一些古老的页面时,解码编码为UTF-8时发生乱码情况,使用GB2312解码进行UTF-8编码时爆发异常,无法完成编码。查询页面原始编码还恰好为GB2312。一头雾水之下开始百度,发现页面中如果少量包含GB2312之外的字符也是可以的,需要使用GB18030去解码,然后编码成UTF-8。具体代码如下:
string.decode('GB18030').encode('utf-8')
本文参照:Junkichan的博客
阅读全文
0 0
- Python2.7:UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
- 【python问题解决】UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
- UnicodeDecodeError: 'gb2312' codec can't decode bytes in position 2-3: illegal multibyte sequence、
- UnicodeDecodeError: 'gbk' codec can't decode bytes in position 12-13: illegal multibyte sequence
- 【UnicodeDecodeError: '' codec can't decode bytes in position : illegal multibyte sequence】
- UnicodeDecodeError: ‘XXX’ codec can’t decode bytes in position 2-5: illegal multibyte sequence
- Python中遇到"UnicodeDecodeError: ‘gbk’ codec can’t decode bytes in position 0: illegal multibyte
- UnicodeDecodeError: 'gb2312' codec can't decode byte 0x88 in position 164111: illegal multibyte sequ
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xfd in position 3952: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xaf in position 683: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xab in position 11126: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xfe in position 45: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xab in position 11126: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xae in position 199: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 205: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0xa4 in position 18: illegal multibyte sequence
- python编译报错:UnicodeDecodeError: ‘gbk’ codec can’t decode: illegal multibyte sequence
- UnicodeDecodeError: 'gbk' codec can't decode byte 0x9d in position 1793: illegal multibyte sequence
- Android:TabLayout向上滑动停留页面顶部
- Wireshark
- 前端开发中最常用的8个npm技巧
- Android平台Camera实时滤镜实现方法探讨(八)--滤镜基本制作方法(二)简单美颜滤镜
- go实现命令行的工具cli
- Python2.7:UnicodeDecodeError :'gb2312' codec can't decode bytes:illegal multibyte sequence
- 结构体联合体的字节对齐问题详解:
- 拷贝构造函数的相关
- linux设备驱动模型架构分析(一)——概述
- 菱形继承(虚函数)->菱形虚拟继承(虚函数)->多态系列问题
- Linux下查看编辑二进制文件
- 读写properties文件
- JAVA微信公众号开发之公众号内H5调微信支付
- #2 定义模型