实验数据集大全

来源:互联网 发布:阿里巴巴和淘宝是谁的 编辑:程序博客网 时间:2024/04/26 04:52

1、DBLP Datasets 

[TXT]README.txt05-Dec-2011 18:363.6K [   ]dblp.dtd11-Dec-2011 18:428.9K [   ]dblp.xml04-Jan-2014 19:291.2G [   ]dblp.xml.gz04-Jan-2014 19:30235M [   ]dblp_bht.dtd17-May-2005 15:537.0K [   ]dblp_bht.xml01-Sep-2011 21:44130M [DIR]docu/18-Dec-2012 12:05- [   ]hdblp.xml.gz24-Jun-2013 16:06205M [   ]tags.xml29-Jan-2013 15:2133M 

2、Flickr

This is the social network of Flickr users and their friendship connections.

TSV file:downloadflickr-growth.tar.bz2 (124.80 MiB)Extraction code:downloadwosn.tar.bz2 (2.53 KiB)RDF:download RDFflickr-growth.n3.bz2 (128.86 MiB)

3、UCINET IV Datasets

The following pages describe the standard UCINET IV datasets provided with the program. Multirelational data are stored, when possible, in a single multirelational data file. Each relation within a multirelational set is labelled and information about the form of the data is described for each individual matrix.

  • BERNARD & KILLWORTH FRATERNITY
  • BERNARD & KILLWORTH HAM RADIO
  • BERNARD & KILLWORTH OFFICE
  • BERNARD & KILLWORTH TECHNICAL
  • DAVIS SOUTHERN CLUB WOMEN
  • GAGNON & MACRAE PRISON
  • KAPFERER MINE
  • KAPFERER TAILOR SHOP
  • KNOKE BUREAUCRACIES
  • KRACKHARDT OFFICE CSS
  • NEWCOMB FRATERNITY
  • PADGETT FLORENTINE FAMILIES
  • READ HIGHLAND TRIBES
  • ROETHLISBERGER & DICKSON BANK WIRING ROOM
  • SAMPSON MONASTERY
  • SCHWIMMER TARO EXCHANGE
  • STOKMAN-ZIEGLER CORPORATE INTERLOCKS
  • THURMAN OFFICE
  • WOLFE PRIMATES
  • ZACHARY KARATE CLUB

4、武汉大学Web搜索与挖掘实验室

武汉大学Web搜索与挖掘实验室是从事互联网搜索与数据挖掘研究的专业实验室,主要研究方向包括Web数据抽取,基于主题的Web数据融合,社交网络,微博数据,Deep Web,Web垂直搜索,电子商务等等与Web信息搜索与挖掘相关领域的研究。
主要提供:社交网络研究数据,新浪微博数据,Twitter数据,电子商务网站研究数据,Web垂直搜索数据.
Stanford Dataset全集之Networks with ground-truth communities
Stanford Dataset全集之Wikipedia networks and metadata
Stanford Dataset全集之Networks with ground-truth communities
Stanford Dataset全集之Autonomous systems graphs
Stanford Dataset全集之Web graphs
Stanford Dataset全集之Social networks
Stanford Dataset全集之Signed networks
Stanford Dataset全集之Product co-purchasing networks
Stanford Dataset全集之Online Communities
Stanford Dataset全集之Location-based online social networks
Stanford Dataset全集之Internet peer-to-peer networks
Stanford Dataset全集之Communication networks
Stanford Dataset全集之Collaboration networks
Stanford Dataset全集之Online Reviews
Stanford Dataset全集之Citation networks
Stanford Dataset全集之Road networks
Twitter数据流二(Followered User To Following User)
Twitter数据流一(Followered User To Following User)
Twitter数据流四(Followered User To Following User)
Twitter数据流三(Followered User To Following User)

5、自然语言处理与信息检索共享平台 NLPIR

0 0
原创粉丝点击