AnzoGraph System Settings Reference

This section provides reference information for the AnzoGraph system configuration settings. For instructions on changing settings, see Changing AnzoGraph Configuration Settings.

The table below describes the basic-level settings. Additional advanced-level settings are available for use by system administrators or users with an advanced level of knowledge about AnzoGraph or databases in general. See the configuration file, <install_path>/config/settings.conf, for descriptions of the advanced settings.

Setting Description Default Value (Type)
enable_persistence Controls AnzoGraph's save data to disk option. For more information, see Using AnzoGraph Persistence (Preview). false (boolean)
enable_sparql_protocol Whether to enable the HTTP SPARQL protocol service. The sparql_protocol_port setting controls the port to use.

Enabling the SPARQL HTTP protocol opens the standard SPARQL-compliant HTTP endpoint. Unlike the Anzo protocol endpoint, the SPARQL HTTP endpoint is not secured.

false (boolean)
enable_ssl_protocol Whether to enable the HTTPS SPARQL protocol service. The ssl_protocol_port setting controls the port to use.

Enabling the SPARQL HTTPS protocol opens the standard SPARQL-compliant HTTPS endpoint. Unlike the Anzo protocol endpoint, the SPARQL HTTPS endpoint is encrypted but not authenticated.

false (boolean)
internal_directory The directory where AnzoGraph should save internal database-related files such as generated code, logs, and query plans. For more information, see Relocating AnzoGraph Directories. Not set (char)

The default directory is <install_path>/internal.

max_memory Specifies the amount of memory (in MB) that is available for AnzoGraph. The default is system-based; at startup, AnzoGraph determines the amount of RAM that is available and sets max_memory. In test environments where AnzoGraph may be co-located with other programs, you can set the max_memory value to put a limit on the amount of memory AnzoGraph can use. However, Cambridge Semantics recommends that you do not set max_memory unless instructed by Support. System-based (int)
output_format Specifies the default output format for AnzoGraph responses. Valid values are xml, json, or csv. xml (char)
persistence_directory The directory where AnzoGraph should save data when it is persisted to disk. For more information, see Relocating AnzoGraph Directories. Not set (char)

The default directory for persisted data is <install_path>/persistence.

sparql_protocol_port SPARQL service HTTP port to use if enable_sparql_protocol is true. 7070 (int)
sparql_spec_default_graph Controls the default scope of SPARQL queries when FROM clauses are excluded from a query. When false, queries without FROM clauses target the default graph (DEFAULTSET) only. Triples in named graphs will not be included in the scope of the query. When true, AnzoGraph conforms to the SPARQL specification and includes the default graph and all named graphs in the scope of a query that omits the FROM clause. For more information, see Changing the Default FROM Clause Behavior. false (boolean)
spill_directory The directory where AnzoGraph should save temporary query files that spill to disk. For more information, see Relocating AnzoGraph Directories.

AnzoGraph uses O_DIRECT to read the spill files into the database. If you relocate the spill directory, make sure to place it on an ext4 file system that supports O_DIRECT.

Not set (char)

The default directory for spill files is <install_path>/spill.

ssl_protocol_port SPARQL service HTTPS port to use if enable_ssl_protocol is true. 8256 (int)
startup_info Specifies how verbose the startup message is: - 0-quiet, 1-ready, 2-ports, 3-more. 1 (int)
stop_timeout The number of seconds to wait for queries to finish before stopping the database. 30 (int)
truncate_clob Specifies whether to truncate large strings to the maximum string size (1 MB). false (boolean)
use_custom_ssl_files Specifies whether to use custom SSL files containing fully qualified domain names. false (boolean)
user_queues Sets the limit on the number of queries that can run concurrently. 40 (int)
anzo_protocol_port The Anzo protocol (gRPC) port for secure communication between AnzoGraph and Anzo. 5700 (int)
enable_root_user Whether to allow a user running with root privileges to start AnzoGraph. false (boolean)
auto_restart_directory Specifies the base location of the auto_restart directory, which contains the denied_list, warned_list, and unanalyzed_list directories. For more information about the auto-restart feature, see Managing the Automatic Restart Feature. Not set (char)

The default location for the auto_restart directory is <install_path>/internal.

auto_restart_max_attempts Specifies the number of times the system manager should attempt to start the database after a crash. The default value is 5, which means the system manager will attempt to restart the database a maximum of 5 times. Changing auto_restart_max_attempts to 0 disables the auto-restart feature. For more information about the auto-restart feature, see Managing the Automatic Restart Feature. 5 (int)
auto_restart_time Specifies the number of seconds to spend attempting to restart the database. If all attempts fail and this time limit is reached, the system manager stops trying to restart the database. The default value is 600, which means that the system manager will attempt to restart the database for a maximum of 600 seconds (10 minutes). For more information about the auto-restart feature, see Managing the Automatic Restart Feature. 600 (int)
ignore_deniedlist_queries Controls whether denied list queries are blocked from running or are allowed to be run when the database is returned to normal operation. The default value is true, which means denied list queries are ignored. Incoming queries are not compared with the denied list and are permitted to run. If ignore_deniedlist_queries is false, denied list queries are not ignored and are therefore blocked from running until they are removed from the denied list. For more information about the auto-restart feature, see Managing the Automatic Restart Feature. true (boolean)
enable_unbound_variables Controls whether AnzoGraph returns an empty result or an error if a query references a missing graph or includes unbound variables. This value is set to false by default, which means AnzoGraph returns an error. For more information, see Ignoring Missing Graphs. false (boolean)
jvm_max_memory Specifies the maximum size of the heap that can be used by the embedded Java virtual machine (JVM). Use k, m, or g (case insensitive) for KiB, MiB, or GiB. You can also specify % to indicate a percentage of the total memory that is available to AnzoGraph. By default, this value is not set, which means jvm_max_memory defaults to either 5% of the total memory or 4g, whichever value is smaller. Not set (char)

When not set, the default is 5% or 4g, depending on which value is smaller.

jvm_options Lists any optional parameters to use for configuring the embedded JVM. Use a semicolon-delimited (;) list to specify multiple parameters. For information about JVM options, see Options in the Java Documentation. Not set (char)
aws_log_level AnzoGraph uses an AWS C++ SDK for loading data from S3. This setting controls the logging level for the AWS SDK. The default value is 2, which is Error level logging. 2 (int)

Valid values:

  • 0 (Off)
  • 1 (Fatal)
  • 2 (Error)
  • 3 (Warn)
  • 4 (info)
  • 5 (Debug)
  • 6 (Trace)
aws_search_regions Lists the regions to search for AWS S3 buckets. Not set
log_directory Specifies where to write system management daemon (azgmgrd) log files. These types of logs (azgmgrd.log, azgctl-<user>.log, azgpidmgr.log, and azgpids.log) are created before the system is initialized and may be written before the <install_path>/internal/log directory exists. Therefore, they are located outside of the AnzoGraph file system, /tmp by default. If you change the log_directory value, Cambridge Semantics recommends that you choose another location that is outside the internalAnzoGraph directories. Not set

When not set, the default location is /tmp.

paged_data Enables or disables AnzoGraph's paged data feature, which controls whether data is stored in memory or on disk. When this option is false (the default value), data is stored in memory. Setting this option to true changes data storage from in-memory to on-disk (in the persistence_directory).

Enabling this option changes underlying database operations. Before enabling paged data, make sure that the performance and storage impacts are well-understood and that your environment meets the requirements. See Sizing Guidelines for Disk-Based Storage (Preview) for details.

false (boolean)
paged_cache_memory_percent When paged_data is enabled, this setting controls the amount of memory (as a percentage of total memory) to use for caching the most recently requested data. The default value is 20, which means AnzoGraph is configured to use 20% of the total available memory for caching data for analytics. For example, if you have 1 TB of data on disk and 300 GB of available RAM, AnzoGraph caches in memory 60 GB of the most recently accessed data. If a query requests data that is not currently cached, AnzoGraph releases the least accessed data from memory and loads the relevant data into memory. Note that a portion of the paged cache memory percent is used for the overhead of tracking the pages that are accessed. For more information, see Enabling Paged Data Mode (Preview).

Cambridge Semantics recommends that you do not set this value higher than 30.

20 (int)
Related Topics