爬取网站段子

来源：互联网发布：多媒体课件制作软件编辑：程序博客网时间：2024/05/21 14:03

使用requests库和正则表达式爬取段子并保存到.txt文件

lianjie:https://github.com/Spacewe/python

import reimport requestsimport sysreload(sys)sys.setdefaultencoding("utf-8")url="http://hahahahhaahah.com/"# url=""header = {'User-agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/55.0.2883.87 Safari/537.36'}haha = requests.get(url,headers=header)haha.encoding='utf-8'    # print haha.textheihei=re.findall('<p>(.*?)</p>',haha.text,re.S)fp=open('neihan.txt', 'wb')# fp.write(heihei.text)for each in heihei:    print each    print '-'*100    fp.write(each)    fp.write("\n\n")    防止被覆盖fp.close()

1 0

爬取网站段子
neihan8段子爬取
爬取糗事百科段子
python爬虫爬取段子
糗事百科段子爬取
【网络爬虫】爬取糗事百科段子
利用Scrapy爬取糗事百科段子
python 爬虫爬取糗事百科段子
[Scrapy]爬取糗事百科段子
爬取糗事百科，朗读段子
Python爬虫爬取糗事百科段子
【爬虫】爬取煎蛋上的段子
Python爬虫爬取糗事百科段子
爬取糗事百科的段子Demo
python爬取糗事百科段子
Python爬虫爬取糗事百科段子
Python爬虫实战一之爬取糗事百科段子
pythpn学习の爬虫爬取糗事百科热门段子
拷贝构造函数4.匿名对象
看代码写结果——C++类的静态成员
LeetCode Weekly Contest 25
Android与JS调用
什么是支付账户、备付金、网络支付、银行卡清算、贷记卡、代扣、代付....
爬取网站段子
08链表
Linggle常用命令
LeetCode 94. Binary Tree Inorder Traversal
再次理解dfs，poj1014
中缀表达式树及其结果计算
【VS2013】错误处理error C4996: 'fopen': This function or variable may be unsafe
第二章：Oracle数据库的用户和表空间
计161_Problem : 字符串替换（串）