python网页提取

来源：互联网发布：国民党真实抗战知乎编辑：程序博客网时间：2024/06/06 04:42

#!/usr/bin/python
# -*- coding: utf-8 -*-
#encoding=utf-8
#Filename:urllib2-header.py

import urllib2
import sys

url = 'http://notepad.cc/share/W7Cgs95rxW'

req = urllib2.Request(url)
#req.add_header('Referer','http://notepad.cc/lianghui')
req.add_header('User-Agent','Mozilla/5.0 (Windows NT 6.2; rv:16.0) Gecko/20100101 Firefox/16.0')
r = urllib2.urlopen(req)

html = r.read()
receive_header = r.info()

html = html.decode('utf-8').encode(sys.getfilesystemencoding())

#print receive_header
#print '#####################################'
print html

0 0

python网页提取
Python:提取网页数据
Python:提取网页中的电子邮箱
【Python编程】网页URL提取实例
【Python编程】网页中文提取正则
Python提取网页中的超链接地址
python网页自动摘要和关键词提取
Python使用xslt提取网页数据
Python使用xslt提取网页数据
【Python爬虫2】网页数据提取
python 提取网页 charset 的方法
【Python】怎样从网页中提取特定的字符串/行？
python提取网页的特定内容（正则表达式实现）
Python网页正文及内容图片提取算法
python-获取提取网页url爬虫学习（1）
【Python爬虫5】提取JS动态网页数据
Python自动化（八）使用Scrapy shell提取网页信息
【Python】提取网页正文内容的相关模块与技术
高通8X16电池BMS算法（一）
LLDB调试技巧待续
第七次作业
利用onekey软件制作win10.gho系统文件的小方法
伟哥大数据3：MapReduce
python网页提取
linux设备模型之Class
Ubuntu15.10下Solr 6.0的搭建与IKAnalyzer中文分词结合使用
第几天？
Java基础之String类型的使用
c语言判断文件是否存在
Highly Available Queues
c++第七次上机实验
设计模式之策略模式