GraphLab Create

来源:互联网 发布:数据监控体系 编辑:程序博客网 时间:2024/06/03 03:42

Detailed infomation please click: Dato's user guide

GraphLab Create is a Python package that allows programmers to perform end-to-end large-scale data analysis and data product development.

  • Data ingestion and cleaning with SFrames. SFrame is an efficient disk-based tabular data structure that is not limited by RAM. This lets you scale your analysis and data processing to handle terabytes of data, even on your laptop.

  • Data exploration and visualization with GraphLab Canvas. GraphLab Canvas is a browser-based interactive GUI that allows you to explore tabular data, summary plots and statistics.

  • Network analysis with SGraph. SGraph is a disk-based graph data structure that stores vertices and edges in SFrames.

  • Predictive model development with machine learning toolkits. GraphLab Create includes several toolkits for quick prototyping with fast, scalable algorithms.

  • Production automation with data pipelines. Data pipelines allow you to assemble reusable code tasks into jobs and automatically run them on common execution environments (e.g. Amazon Web Services, Hadoop).


Installing using Dato Launcher may fail for network connection issue. I solved it by:
  1. Download Anaconda from continuum
  2. Download GraphLab Create from python's website
  3. Open Dato Launcher, it will install them automatically.
0 0