python判断网页编码的三种方式

来源：互联网发布：淘宝买东西寄到国外编辑：程序博客网时间：2024/06/05 08:07

python判断网页编码的三种方式

一，使用urllib模块的getparam方法 #有时不准确

>import urllib>fopen1 = urllib.urlopen('http://www.baidu.com').info()>print fopen1.getparam('charset')# baidu1
2
3
1
2
3

二，使用chardet模块

>import chardet >import urllib>#先获取网页内容>data1 = urllib.urlopen('http://www.baidu.com').read()>#用chardet进行内容分析>chardit1 = chardet.detect(data1)>print chardit1['encoding'] # baidu1
2
3
4
5
6
7
1
2
3
4
5
6
7

三，利用BeautifulSoup模块方法

>from bs4 import BeautifulSoup>import urllib2>content=urllib2.urlopen(url)#这里url是你需要获取的网页>soup=BeautifulSoup(content)>print soup.original_encoding #这里的输出就是网页的编码方式

0 0

python判断网页编码的三种方式
判断网页的编码方式 python
Python 查看网页编码方式
python中判断文件是否存在的三种方式
python自动化获取网页编码方式
三种编码方式
网页嵌入flash 的三种方式
python_urllib2下载网页的三种方式
网页下载器的三种方式
response设置编码的三种方式
response设置编码的三种方式
response设置编码的三种方式
三种视频编码的方式
java base64编码的三种方式
Python爬虫系列之----Scrapy(五)网页提取的三种方式(正则,Beautiful Soup,Lxml)
文本文件的编码方式判断
判断文件的编码方式
python登录网页的两种方式
关于httpservletRequest碰到的一个问题
深入理解计算机系统之整型与浮点型
C++深拷贝和浅拷贝
vs编译后不拷贝
浅谈ListView、RecycleView、GridView的使用方法步骤和效果区别.TXT
python判断网页编码的三种方式
Android简介
洛谷 P1001 A+B Problem（学会改变——向C++进发！）
Servlet中用Cookie实现浏览商品的过程
android smali 之HelloWorld
查找算法中的概念（排序树和散列表）
pyhton批量修改指定路径下面的文件夹名字
微信支付快速集成
DB2删除完数据之后，如何释放LOB字段占用的空间