A survey and Experimental Comparison of Distributed SPARQL Engines for Very Large RDF Data
来源:互联网 发布:医药行业ims数据分析 编辑:程序博客网 时间:2024/05/19 22:48
发表于VLDB2017上的一篇文章,对当前主流的分布式SPARQL查询处理引擎做了大量实验进行对比。
原文链接:http://www.vldb.org/pvldb/vol10/p2049-abdelaziz.pdf
实验涉及的系统包括:AdPart [46] ,AdPart-NA [46], CliqueSquare [25] ,DREAM [38] ,EAGRE [56] ,gStoreD[45], H-RDF-3X [29] ,H2RDF+ [41], HadoopRDF [30], Partout [36] ,PigSparql [14] ,S2RDF [15] ,S2X [51],Sedge[57], Sempala[50], SHAPE [32] ,SHARD [47] TriAD [48] ,TriAD-SG [48], Trinity.RDF [33] ,WARP [28] .
实验使用的数据集包括LUBM,Watdiv和Bio2RDF,评估的主要指标为启动代价和查询性能。
首先对背景做一个简单的介绍,RDF是一种知识表示的模型,是知识图谱的一种表现形式,基本的数据单元为三元组,表现形式为<主体,谓词,客体>,例如<苹果,类型,水果>。RDF图是RDF三元组的一种图表现形式,将主体和客体当作顶点,谓词当作边构建一个大型图。而SPARQL是一种结构化的被用来检索RDF数据的查询语言。包括一系列三元组模式和约束条件。由于RDF结构的灵活性越来越多的知识被表示成RDF格式,例如DBpedia,YAGO和FreeBase等。一个简单的RDF图和SPARQL查询图如下所示:
- A survey and Experimental Comparison of Distributed SPARQL Engines for Very Large RDF Data
- MySQL Engines: InnoDB vs. MyISAM – A Comparison of Pros and Cons
- Search RDF data with SPARQL
- 《A Distributed Graph Engine for Web Scale RDF Data》2013——笔记
- A survey of task allocation and load balance in distributed system阅读笔记
- Visualization of Large-Scale Distributed Data
- A Relational Model of Data for Large Shared Data Banks
- 《An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing》——论文笔记
- Spark and SPARQL; RDF Graphs and GraphX
- 《Processing SPARQL queries over distributed RDF graphs》——读书笔记
- A Survey Of Methods For Colour Image Indexing And Retrieval In Image Databases
- How to delete a large number of data in SharePoint for List when refreshing data?
- A Relational Model of Data for Large Shared Data Banks 1970
- (OK) angular2-data-table is a Angular2 component for presenting large and complex data.
- A Comparison of C# and Java(转贴)
- A Technical Comparison of TTLS and PEAP
- HBase Tutorial: Theory and Practice of a Distributed Data Store(1)
- HBase Tutorial: Theory and Practice of a Distributed Data Store (2)
- 查询sql中数据不为空并且不为null
- 网站有收录没排名的原因与解决办法
- GBDT基本
- Quartz定时框架产生行锁的解决方法
- java.lang.NoClassDefFoundError
- A survey and Experimental Comparison of Distributed SPARQL Engines for Very Large RDF Data
- mybatis注解详解
- Spring的事务回滚机制
- php使用DOMDocument的时候如何判断xml中某节点是否存在
- 带参数的main函数
- 线上商城规划
- 聚类算法(一)
- php的max,min函数详解
- 300+篇运维、数据库等实战资料免费下载(文章+PDF+视频,持续更新)