抓取妹子图
来源:互联网 发布:投资组合经理 知乎 编辑:程序博客网 时间:2024/04/29 22:39
#!/usr/bin/python
# encoding:utf-8
import requests
from bs4 import BeautifulSoup
url = 'http://www.mzitu.com/all/'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36'
}
req = requests.get(url, headers=headers)
soup = BeautifulSoup(req.text, 'lxml')
# print soup
all_a = soup.find('div', class_='all').find_all('a')
for a in all_a:
title = a.get_text()
href = a['href']
# print href
html = requests.get(href, headers=headers)
html_soup = BeautifulSoup(html.text, 'lxml')
max_span = html_soup.find('div', class_='pagenavi').find_all('span')[-2].get_text()
for page in range(1, int(max_span) + 1):
page_url = href + '/' + str(page)
# print page_url
img_html = requests.get(page_url, headers=headers)
img_soup = BeautifulSoup(img_html.text, 'lxml')
img_url = img_soup.find('div', class_='main-image').find('img')['src']
# print img_url
name = img_url[-9:-4]
img = requests.get(img_url, headers=headers)
f = open(name + '.jpg', 'ab')
f.write(img.content)
f.close()
# encoding:utf-8
import requests
from bs4 import BeautifulSoup
url = 'http://www.mzitu.com/all/'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36'
}
req = requests.get(url, headers=headers)
soup = BeautifulSoup(req.text, 'lxml')
# print soup
all_a = soup.find('div', class_='all').find_all('a')
for a in all_a:
title = a.get_text()
href = a['href']
# print href
html = requests.get(href, headers=headers)
html_soup = BeautifulSoup(html.text, 'lxml')
max_span = html_soup.find('div', class_='pagenavi').find_all('span')[-2].get_text()
for page in range(1, int(max_span) + 1):
page_url = href + '/' + str(page)
# print page_url
img_html = requests.get(page_url, headers=headers)
img_soup = BeautifulSoup(img_html.text, 'lxml')
img_url = img_soup.find('div', class_='main-image').find('img')['src']
# print img_url
name = img_url[-9:-4]
img = requests.get(img_url, headers=headers)
f = open(name + '.jpg', 'ab')
f.write(img.content)
f.close()
阅读全文
0 0
- 抓取妹子图
- Python抓取妹子图
- Python3抓取煎蛋妹子图
- Jsoup使用~抓取豆瓣妹子图
- Python爬虫---------------<妹子图>图片抓取(1)
- Python抓取妹子图网站全部图片
- 【Python】妹子图图片全站抓取
- 使用HttpClient实现一个简单爬虫,抓取煎蛋妹子图
- Python零基础入门二十二之抓取妹子图
- 妹子图
- 用Python爬虫抓取煎蛋(jandan.net)无聊图和妹子图
- [Java]使用HttpClient实现一个简单爬虫,抓取煎蛋妹子图
- Android中抓取妹子图网页数据并展示给界面
- python 爬虫 实战 抓取妹子图中图
- Java版抓取网站妹子图片
- 萌妹子图一张
- 妹子图 Spider
- NodeJs妹子图爬虫
- 【网站抓取】如何正确识别Baiduspider移动ua
- 实现并行抓取整站40万条房价数据(可更换抓取城市)
- Sentry Robots UVA
- 300. Longest Increasing Subsequence
- ogg格式文件转换为MP3格式
- 抓取妹子图
- 用 construct 2 制作 html 小游戏
- 如何使用makefile编译不同平台的目标文件(makefile的参数传递)
- 爬虫之模拟登录总结
- 安卓里,对fragment的粗略认识与总结
- Java中equals()与==区别
- java环境配置为1.7jdk为什么命令窗口中 java -version查看版本是1.8
- 简单重复是最短的成功之路
- ACE之Proactor模式使用实例