[Python/爬虫]斗图网
来源:互联网 发布:佳能wifi软件下载 编辑:程序博客网 时间:2024/04/29 11:40
#!/usr/bin/env python# *-* coding:utf-8 -*-from bs4 import BeautifulSoupimport reimport requestsurl = 'https://www.doutula.com/article/list/?page={}'class DownLoadImg(object): def __init__(self,url): self.url = url self.img_url = 'lazy image_dtb img-responsive' self.head = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/58.0.3029.81 Safari/537.36'} self.content = requests.get(self.url,headers=self.head) self.content.encoding = 'utf-8' self.GetTitle(self.content.content) def GetTitle(self,uri): url = uri soup = BeautifulSoup(url,'html.parser') info = soup.find_all('img',class_=self.img_url) n = 0 for i in info: url = requests.get('http:' + i['data-original']).content if i.get('data-original').endswith('.jpg'): f = open(r'C:\Users\Administrator\Desktop\Python\爬虫\斗图网图片\\%s%s.%s' % (i['alt'],n,'jpg'),'wb') f.write(url) f.close() elif i.get('data-original').endswith('.gif'): f = open(r'C:\Users\Administrator\Desktop\Python\爬虫\斗图网图片\\%s%s.%s' % (i['alt'],n,'gif'),'wb') f.write(url) f.close() n += 1if __name__ == '__main__': for i in range(0,10): print('开始下载第{}页图片'.format(i)) picture = DownLoadImg(url.format(i))
阅读全文
0 0
- [Python/爬虫]斗图网
- python爬虫-->爬虫基础
- [爬虫] Python爬虫技巧
- Python爬虫
- python 爬虫
- python 爬虫
- python 爬虫
- python爬虫
- Python爬虫
- Python爬虫
- python 爬虫
- Python爬虫
- python爬虫
- python 爬虫
- python 爬虫
- python爬虫
- python爬虫
- python爬虫
- 跟viewpagerIndicator一样的效果,但是不用引用library包
- 动态内存的传递
- 身份证号和银行卡号校验码的生成规则和算法
- Muduo网络库学习知识点总结
- 函数指针的笔记
- [Python/爬虫]斗图网
- 匿名内部类
- Scala学习
- Nginx伪静态配置和常用Rewrite伪静态规则集锦
- 座机+手机号一个正则表达式搞定
- 行业选择
- Linux 常用命令锦集
- yum安装出错 --skip-broke
- Jetty基本功能介绍