python 抽取信息

来源:互联网 发布:东方网络什么时候复牌 编辑:程序博客网 时间:2024/05/01 20:41

获取网页中的信息,用到了BeautifulSoup和tornado

#!/usr/bin/env python3from bs4 import BeautifulSoup#import tornado.httpclientimport tornadofrom tornado import httpclientcli=tornado.httpclient.HTTPClient()link='http://www.iciba.com/'search=raw_input('search: ')link+=searchdata=cli.fetch(link)body=data.body.decode('utf8')soup=BeautifulSoup(body)group=soup.find_all(class_='group_pos')group2=group[0].find_all('p')for ele in group2:print(ele.find(class_='fl').get_text())result=ele.find_all('label')for r in result:print(r.get_text())





原创粉丝点击