selenium模拟登陆知乎
来源:互联网 发布:淘宝鹊桥是什么意思 编辑:程序博客网 时间:2024/05/18 00:28
from selenium import webdriverfrom bs4 import BeautifulSoupimport time
#selenium模拟登陆知乎browser=webdriver.Chrome()browser.get('http://www.zhihu.com#signin')browser.find_element_by_class_name('qrcode-signin-cut-button').click()browser.find_element_by_name('account').send_keys('18251552002')browser.find_element_by_name('password').send_keys('000189')time.sleep(5) #手动输入验证码browser.find_element_by_class_name('sign-button').click()time.sleep(2)browser.get('http:www.zhihu.com')browser.execute_script("window.scrollTo(0,document.body.scrollHeight)")time.sleep(2) #模拟下拉获取ajax加载的内容# browser.execute_script("window.scrollTo(0,document.body.scrollHeight)")# time.sleep(2)soup=BeautifulSoup(browser.page_source,'lxml')titles=soup.find_all('h2',class_='ContentItem-title')news=[]for title in titles: info={} info['title']=title.find('a').get_text() print(info) news.append(info)print(len(news))# LOL Seleium
def seleium_crawl(start_page,need_page): browser = webdriver.Chrome() browser.get("http://lol.qq.com/guide/list.shtml") already_page=0 time.sleep(1) browser.find_element_by_css_selector('#list_page input').clear() time.sleep(1) browser.find_element_by_css_selector('#list_page input').send_keys('%d'%start_page) time.sleep(1) browser.find_element_by_css_selector('.pagejump').click() time.sleep(2) for i in range(start_page,start_page+need_page): html=browser.page_source soup=BeautifulSoup(html,'lxml') all_news=soup.find('ul',id='list_content').find_all('li') for news in all_news: new_info={} new_info['title']=news.find('p',class_='btn-a').get_text() new_info['read_num']=news.find('p',class_='bfl-playing').get_text()[4:] new_info['time']=news.find('span',class_='recommend-div-div-raiders-date fr').get_text() print(new_info) print('第%d页'%(start_page+already_page)) already_page += 1 try: browser.find_element_by_class_name('pagenext').click() time.sleep(1) except: break browser.close()
阅读全文
0 0
- selenium模拟登陆知乎
- 模拟登陆知乎
- 【scrapy】模拟登陆知乎
- python---模拟登陆知乎
- python 模拟知乎登陆
- python之selenium模拟登录知乎
- 使用python完成知乎模拟登陆
- Python爬虫模拟登陆知乎
- python模拟登陆知乎,得到cookie
- Python 爬虫模拟登陆知乎
- Python爬虫之模拟登陆知乎
- 第4节---模拟登陆知乎
- Java 爬虫模拟知乎登陆
- Scrapy模拟登陆知乎流程图
- Python爬虫模拟登陆知乎
- Python 模拟知乎登陆,保存登陆cookie
- selenium模拟登陆
- python模拟登陆知乎和CSDN【urllib,re,requests】
- BBR与CoDel
- Xcode9学习笔记19
- 对抗攻击最新研究:仅修改「一个像素」即可骗过神经网络!
- 梅宏院士:软件定义的未来——万物皆可互联,一切均可编程 | CNCC 2017
- HK 2016(Special Tour-构造)
- selenium模拟登陆知乎
- 即时通讯文章推荐分享-不定时更新
- OPENCV乌班图环境编译
- 560万Facebook人际关系数据,揭秘家庭职业传承“真相”
- java常用类库---比较器(Comparable,Comparator)
- leetcode桶装水问题
- 第三小组作业完成情况
- 究竟啥才是互联网架构“高并发”
- 华为2016研发工程师编程题