python 爬取网页的最基础三种方法

来源：互联网发布：mac word繁体变简体编辑：程序博客网时间：2024/06/06 14:03

# coding:utf8import cookielibimport urllib2url = "http://www.baidu.com"print "第一种方法"response1 = urllib2.urlopen(url)print response1.getcode()print len(response1.read())print "第二种方法"request = urllib2.Request(url)request.add_header("user-agent","Mozilla/5.0")response2 = urllib2.urlopen(request)print response2.getcode()print len(response2.read())print "第三种方法"cj = cookielib.CookieJar()opener = urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))urllib2.install_opener(opener)response3 = urllib2.urlopen(url)print response3.getcode()print cjprint len(response3.read())

待续。。。。

阅读全文

0 0

python 爬取网页的最基础三种方法
Python爬虫实战(十一)：两种简单的方法爬取动态网页
python爬取网页
Python 网页爬取
Python爬虫实战(三):简单爬取网页图片
Python爬取一个网页的图片
Python爬取网页的编码处理
Python爬取一个基本的网页
Python爬取一个网页的图片
Python基础学习-小代码1【爬取网页图片】
python最简单的爬取邮箱地址
几种网页爬取的方法与实现(Java)
爬取网页的两种方法（python3）
python三种网页抓取方法
Python 三种网页抓取方法
Python爬取网页信息时，经常使用的正则表达式及方法
python 爬取网页正文
python 多线程网页爬取
ORACLE数据库数据操作语言DML
第一次发博客，不知写点啥，就来个hello world吧！
【BZOJ】1030 [JSOI2007]文本生成器 AC自动机+DP
设计模式之代理模式
【回文串】835D Palindromic characteristics
python 爬取网页的最基础三种方法
php基础 unset()、isset()、defined()、empty()
走出第一步
牛顿迭代法在求解特征值问题中的应用
Java Web中的Servlet及Filter
Intent的显式与隐式
习题 2.4（7）求两个数m和n的最大公约数。
牛腩新闻发布系统
ES6中fetch的post的前后端node传参的问题