Scrapy ImagesPipeline下载图片
来源:互联网 发布:自动谱曲软件下载 编辑:程序博客网 时间:2024/06/02 00:09
项目源码下载:http://download.csdn.net/download/adam_zs/10166641
1.项目结构,下载图片截图
2.项目简介
settings.py
ITEM_PIPELINES = { # 'scrapy.pipelines.images.ImagesPipeline': 1 "ImagesPipelineTest.pipelines.MyImagesPipeline":1}IMAGES_STORE = 'E:\\shetuwang2017'
items.py
import scrapyclass ImageItem(scrapy.Item): image_urls = scrapy.Field() images = scrapy.Field()# image_urls和images是固定的
she_tu_wang.py
# -*- coding: utf-8 -*-import scrapyfrom ImagesPipelineTest.items import ImageItemclass XiaohuaSpider(scrapy.Spider): name = "shetuwang" allowed_domains = ["699pic.com"] start_urls = ['http://699pic.com/people.html'] download_delay = 2 def parse(self, response): item = ImageItem() srcs = response.xpath('//div[@class="swipeboxEx"]/div[@class="list"]/a/img/@data-original').extract() item['image_urls'] = srcs yield item
pipelines.py
from scrapy.pipelines.images import ImagesPipelinefrom scrapy.exceptions import DropItemfrom scrapy.http import Requestclass MyImagesPipeline(ImagesPipeline): def get_media_requests(self, item, info): for image_url in item['image_urls']: yield Request(image_url) def item_completed(self, results, item, info): image_path = [x['path'] for ok, x in results if ok] if not image_path: raise DropItem('Item contains no images') item['image_paths'] = image_path return item
3.运行项目
pycharm中运行begin.py
from scrapy import cmdline# cmdline.execute("scrapy crawl dmoz".split())cmdline.execute("scrapy crawl shetuwang".split())
阅读全文
1 0
- scrapy 下载图片 ImagesPipeline
- scrapy 下载图片 ImagesPipeline
- Scrapy ImagesPipeline下载图片
- Scrapy框架之利用ImagesPipeline下载图片
- Python:使用Scrapy框架的ImagesPipeline下载图片如何保持原图片名称呢?
- scrapy 下载gif图片
- scrapy下载图片
- scrapy下载图片问题
- scrapy框架下载图片
- 配置scrapy下载图片
- scrapy 下载并保存图片
- Scrapy图片自动下载配置
- Scrapy修改下载图片名字
- 重写scrapy.pipelines.images.ImagesPipeline方法实现自定义图片下载
- 使用scrapy图片管道下载图片
- Scrapy下载图片的DEMO(Python)
- Scrapy下载图片错误:ImportError: No module named PIL
- python scrapy 下载文件与图片相关小知识
- 从头实现一个深度学习对话系统--tensorflow Seq-to-Seq API介绍和源码分析
- SpringMVC接受JSON参数详解及常见错误总结
- Ueditor+ajax视频保存到本地和视频url在另一个页面播放
- java学习初探十八之线程的调度与控制_让位yield_合并jion
- JieCaoVideoPlayer,视频播放,极简单
- Scrapy ImagesPipeline下载图片
- JQuery选择器之位置选择器
- 空合并操作符
- 导师制培训小结
- 威尔逊区间
- 购物车4 增删改插
- 最长公共子序列
- Linux命令每日学之PS
- Java 基础知识(整型数据类型)