Tutorials

There are two ways to onboard structured and semi-structured data to Anzo:

Automated Direct Data Load Workflow

The Direct Data Load workflow follows an extract, load, and transform (ELT) process. In the ELT workflow, data sources are onboarded directly to graphmarts. Data layers with SPARQL queries are automatically generated to transform and blend the data to an analytics-ready knowledge graph. The AnzoGraph Graph Data Interface (GDI) Java plugin (sometimes called the Data Toolkit) is used to connect to the sources, create a model, and generate the data layer queries.

This section includes a Building a Knowledge Graph from a CSV File tutorial to help you get started with the direct data load ingestion method.

Manual Ingestion with SPARQL Queries

For advanced users who are familiar with SPARQL, the GDI can also be invoked by writing queries from scratch. The GDI is extremely flexible, allowing you to connect directly to sources via queries and control all aspects of the extract, load, and transform process. In addition to the data sources that the automated workflow supports, you can also ingest raw data and data from HTTP/REST endpoints with manually written SPARQL queries.

For information about the advanced method of writing GDI SPARQL queries for ingestion, see Onboarding or Virtualizing Data with SPARQL Queries.