Loading Non-RDF Data with the GDI

The AnzoGraph DB Graph Data Interface (GDI) service (sometimes called the Data Toolkit) is an extremely flexible and configurable service component that enables users to write SPARQL queries that access a variety of data sources. The GDI service provides a single point of access to all of your structured and semi-structured data. It enables you to use three popular methods of data integration:

  • Extract, Load, Transform (ELT): With the ELT method, you use the GDI service to simply ingest remote data and then transform it into a graph structure in AnzoGraph DB using SPARQL queries.
  • Virtual Graphs or Virtualization: This method of data integration enables you to access the remote data exactly when it is needed, without requiring you to ingest all of the data into AnzoGraph DB up front.
  • Extract, Transform, Load (ETL): With the ETL method, the GDI service is used to extract data from a remote source, transform it using SPARQL queries, and then store the data on disk in graph form as an RDF TTL data set for loading later.

The topics in this section introduce you to the GDI and provide instructions for exploring, analyzing, and ingesting data from non-RDF data sources.