Connecting to Sources with the Graph Data Interface (Preview)

The AnzoGraph Graph Data Interface (GDI) service (sometimes called the Data Toolkit) is an extremely flexible and configurable service component that enables users to write SPARQL queries that access a variety of remote data sources. The GDI service provides a single point of access to all of your structured and semi-structured data. It enables you to use three popular methods of data integration:

  • Extract, Load, Transform (ELT): With the ELT method, you use the GDI service to simply ingest remote data and then transform it into a graph structure in AnzoGraph using SPARQL queries.
  • Virtual Graphs or Virtualization: This method of data integration enables you to access the remote data exactly when it is needed, without requiring you to ingest all of the data into AnzoGraph up front.
  • Extract, Transform, Load (ETL): With the ETL method, the GDI service is used to extract data from a remote source, transform it using SPARQL queries, and then store the data on disk in graph form as an RDF TTL data set for loading later.

The topics in this section introduce you to the GDI and provide instructions for exploring, analyzing, and ingesting data from remote data sources.

The Graph Data Interface is a Preview release, which means the implementation has recently been completed but is not yet thoroughly tested and could be unstable, and future releases may include significant changes. The feature is available for trial usage, but Cambridge Semantics recommends that you do not rely on Preview features in production environments.