Onboarding Data

Anzo onboards structured and unstructured data into the Dataset catalog. Structured data sources such as relational databases or flat files are onboarded using Anzo's built-in pipelines. These pipelines natively support CSV, JSON, and XML files, along with all common database connections, including SQL, Oracle, MySQL, HIVE, and others.

Anzo also onboards data from unstructured sources, such as PDFs, documents, or knowledge bases, using natural language processing (NLP) to find and extract data and add it to the graph model.

The topics in this section provide instructions for configuring file store locations and onboarding structured and unstructured data.