luigi 初探
来源:互联网 发布:js显示隐藏div切换 编辑:程序博客网 时间:2024/06/05 16:45
import luigiclass InputText(luigi.ExternalTask): ''' This class represents something that was created elsewhere by an external process, so all we want to do is to implement the output method. ''' date = luigi.DateParameter() def output(self): return luigi.LocalTarget(self.date.strftime('/var/tmp/text/%Y-%m-%d.txt'))class WordCount(luigi.Task): date_interval = luigi.DateIntervalParameter() def requires(self): return [InputText(date) for date in self.date_interval.dates()] def output(self): return luigi.LocalTarget('/var/tmp/text-count/%s' % self.date_interval) def run(self): count = {} for file in self.input(): # The input() method is a wrapper around requires() that returns Target objects for line in file.open( 'r'): # Target objects are a file system/format abstraction and this will return a file stream object for word in line.strip().split(): count[word] = count.get(word, 0) + 1 # output data f = self.output().open('w') for word, count in count.iteritems(): f.write("%s\t%d\n" % (word, count)) f.close() # Note that this is essential because file system operations are atomicif __name__ == '__main__': luigi.run(main_task_cls=WordCount)
阅读全文
0 0
- luigi 初探
- luigi任务调度框架
- Python Luigi框架的搭载和运行
- 初探
- HP 3COM/H3C Intelligent Management 几个 Luigi Auriemma发现的cve漏洞
- 初探PHP5
- jdbc初探......
- STL初探
- HiveMind初探
- 初探人工智能
- 初探人工智能
- SQLDMO初探
- 初探maven
- Hibernate初探
- COM初探
- 初探RMI
- Ajax初探
- HiveMind初探
- CodeForces
- 思维水题:UVa512-Spreadsheet Tracking
- JMeter---ForEach循环控制器实例
- 数据库学习之初见oracle
- algorithm——sort(eg.The Dragon of Loowater)
- luigi 初探
- animeditor-Creating a New Animation Clip
- Android之Socket的基于UDP传输
- CTF/CTF练习平台-welcome to bugkuctf【php://filter及php://input】
- javascript对象创建
- mysql中select查询语句添加自增id
- Android平台对H264视频硬解码
- centos7使用yum安装mariaDB(开源MySQL)无法启动的解决办法
- Android M App 永久隐藏导航栏的Solution