python IO
来源:互联网 发布:java file类详解 编辑:程序博客网 时间:2024/06/05 15:58
filename = path + res.group() #f = file(filename, 'r+') #f = open(filename, 'r+') f = file(filename, 'wb') #f = open(filename, 'wb') f.write(content) f.flush() f.close()
文件权限一定要对啊。不然爬出来的全身马赛克和不能看。
文件权限:
看到CSDN支持markdown了,顺带体验下。
颜色表 http://blog.csdn.net/thither_shore/article/details/52328313
完整下载美女源代码
# -*- coding: utf-8 -*- '''@Copyright © 2017 sanbo Inc. All rights reserved.@Description: 下载美女图片,大图不要缩略图{调整代码结构}@Version: 1.0@Create: 2017年1月2日 上午3:17:31 @Author: sanbo'''from bs4 import BeautifulSoup import urllib2 import os.path''' 解析高清地址,并且去下载'''def process(url): response = urllib2.urlopen(url) html = response.read() soup1 = BeautifulSoup(html, "lxml") someData = soup1.select("div.pic-image img") allUrls = [] for some in someData: tempUrl = some['src'] if tempUrl not in allUrls: download(tempUrl)'''增加去重,增加解析大图片详情地址'''def getUrls(n): url = 'http://www.4j4j.cn/beauty/tag_3_' + str(n) + '.html' response = urllib2.urlopen(url) html = response.read() soup1 = BeautifulSoup(html, "lxml") someData = soup1.select("div.i-list li a") allUrls = [] for some in someData: tempUrl = some['href'] if "javascript" not in tempUrl: if tempUrl not in allUrls: allUrls.append(tempUrl); return allUrlsdef download(url): global number number += 1 path = 'D:/girls/' #如果没有文件夹则创建 if not os.path.exists(path): #貌似linux 是 os.path.mkdir(path) #os.mkdir(path) os.mkdir(path) #content = urllib2.urlopen(url).read() header = { 'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/35.0.1916.114 Safari/537.36', 'Cookie': 'AspxAutoDetectCookieSupport=1', } request = urllib2.Request(url, None, header) content = urllib2.urlopen(request).read() filename = path + str(number) + '.jpg' print 'download [' + filename + ']===>' + url f = open(filename, 'wb') #这种模式可以正常下载图片 f.write(content) f.flush() f.close() if __name__ == '__main__': number = 1 for i in range(75): urls = getUrls(i) for u in range(len(urls)): process(urls[u])
0 0
- python IO
- python IO
- Python IO
- python IO
- python IO
- python io
- python IO
- python file io 详解
- <python>文件IO
- Python输入输出(IO)
- python IO编程
- Python学习----IO编程
- Python---IO的使用
- python IO编程
- python 异步IO
- python 异步IO
- Python IO multiplex
- python学习 文件IO
- 快崩溃了,求助高手,安卓程序里onCreate运行完后,直接就跳出程序,界面也显示不出来
- interfaceview画图更新问题
- ASP+Access 怎么利用一个搜索语句的结果做另一个搜索语句的字段来搜索?
- ios 使用ShareSDK实现分享时出错
- iOS UIView 背景图片问题
- python IO
- Can't change container ID of fragmen
- TextView结尾...结束的一个奇怪现象
- android fragment中怎么直接获取app版本号
- webview 调用相册权限和js地图权限
- Android viewpager2个页面连起来了
- 高德地图MapView添加到ListView的Header里面,焦点冲突
- 如何设置UIButton在初始化的时候就是被点击状态呢?
- MPChart的饼图使用相关问题