python urllib* 获取网页信息
来源:互联网 发布:json转字符串 编辑:程序博客网 时间:2024/05/22 03:27
直接上代码吧,从简单到复杂,同时涉及到cookie的使用
import urllib import urllib2import cookielibimport json########################simple urllib2##############################################################url="http://www.baidu.com/s"request=urllib2.Request(url)print urllib2.urlopen(request).geturl()######################with params########################################url='http://www.baidu.com/s'data={"wd":"hello",}print urllib.urlencode(data)request=urllib2.Request(url,urllib.urlencode(data))print urllib2.urlopen(request).geturl()#####################with headers#######################################headers={"User-agent":"Mozilla/4.0(compatible); MSIE 6.0; Windows NT 5.1"}url='http://www.baidu.com/s'request=urllib2.Request(url,urllib.urlencode(data),headers)print urllib2.urlopen(request).geturl() #############################################method one##################################################url="https://openapi.hellocdn.com/api/rest/login"headers={"User-agent":"Mozilla/4.0(compatible); MSIE 6.0; Windows NT 5.1"}login_info={"user":"someone@some.com","pass":"someonespassword","output":"json"}login_info=urllib.urlencode(login_info)cj=cookielib.CookieJar()opener=urllib2.build_opener(urllib2.HTTPCookieProcessor(cj))request=urllib2.Request(url,login_info,headers)response=opener.open(request)token=json.loads(response.read())["loginResponse"]["session"]["sessionToken"]url="https://openapi.hellocdn.com/stat/rest/traffic/responseCode/edge"login_info={"sessionToken":token,"apiKey":"test","fromDate":"20140701","toDate":"20140702","timeInterval":"0","output":"json"}headers={"User-agent":"Mozilla/4.0(compatible); MSIE 6.0; Windows NT 5.1"}login_info=urllib.urlencode(login_info)request=urllib2.Request(url,login_info,headers)response=opener.open(request)print "response",responseprint "url",response.geturl()headers=response.info()print "headers",headersfor (k,v) in headers.items(): print k,"#"*5,v#############################################method two##################################################from cookielib import LWPCookieJarimport requestsurl="https://openapi.hellocdn.com/api/rest/login"headers={"User-agent":"Mozilla/4.0(compatible); MSIE 6.0; Windows NT 5.1"}login_info={"user":"someone@some.com","pass":"someonespassword","output":"json"}jar = LWPCookieJar('cookies.txt')r = requests.get(url, cookies=jar,params=login_info,headers=headers)token=r.json()["loginResponse"]["session"]["sessionToken"]jar.save()url="https://openapi.hellocdn.com/stat/rest/traffic/responseCode/edge"login_info={"sessionToken":token,"apiKey":"test","fromDate":"20140701","toDate":"20140702","timeInterval":"0","output":"json"}jar = LWPCookieJar('cookies.txt')jar.load()r = requests.get(url, cookies=jar,params=login_info,headers=headers)
0 0
- python urllib* 获取网页信息
- Python 2.7 获取网络信息(Urllib)
- python实例31[urllib.request.urlopen获取股票信息]
- Python(2):Python获取网页信息
- 使用python urllib2获取网页信息
- python获取网页amf的信息
- Python3 urllib 获取网页 操作笔记
- Python网页抓取urllib,urllib2,httplib[1]
- Python网页抓取urllib,urllib2,httplib[2]
- Python网页抓取urllib,urllib2,httplib[3]
- Python网页抓取urllib,urllib2,httplib[1]
- Python网页抓取urllib,urllib2,httplib[2]
- Python网页抓取urllib,urllib2,httplib[3]
- python 网页抓取urllib,urllib2,httplib
- Python网页抓取urllib,urllib2,httplib[1]
- Python网页抓取urllib,urllib2,httplib[2]
- Python网页抓取urllib,urllib2,httplib[3]
- Python网页抓取urllib,urllib2,httplib[3] .
- 正则表达式(Regex)--(2)
- Interfacing an SPI ADC (MCP3008) chip to the Raspberry Pi using C++ (spidev)
- 算法导论 概率 投球问题 5.4-2
- MMC/SD卡驱动实例开发讲解(一)
- 简单之内存管理
- python urllib* 获取网页信息
- OC 的基本属性
- hdu 2275 Kiki & Little Kiki 1
- SD卡的初始化流程
- OC 属性的属性 点语法的使用 KVC
- 简单之数组链
- SD卡驱动理论篇
- java语法糖
- Spring文件上传