Setting up a Cluster

Follow the instructions below to install AnzoGraph on a cluster.

  1. Complete the Pre-Installation Configuration
  2. Install AnzoGraph DB

Complete the Pre-Installation Configuration

Before installing AnzoGraph, make sure that all host servers in the cluster meet the requirements in Server and Cluster Requirements. This section provides instructions for installing the required software and tuning user resource limits.

Install GNU Compiler Collection (GCC)

AnzoGraph requires the latest version of the GCC tools for your operating system. Run the following command to install GCC on each server in the cluster:

sudo yum install gcc
Note Specifically, AnzoGraph requires the glibc, glibc-devel, and gcc-c++ libraries. Typically, when you install GCC by running yum install gcc, those libraries are included as part of the package. In rare cases, depending on the host server configuration, installing GCC excludes certain libraries. If AnzoGraph fails to start and you receive a "Compilation failed" message, it may indicate that some of the required libraries are missing. To install the missing libraries, run the following command:
sudo yum install glibc glibc-devel gcc-c++

Tune User Resource Limitations (ulimits)

Cambridge Semantics recommends that you tune the user resource limits (ulimits) for your Linux distribution to increase the limits for the following resources. Tune ulimits on all servers in the cluster:

  • Increase the open files limit to at least 4096.
  • Increase the limit for the following resources to unlimited:
    • cpu time
    • file locks
    • file size
    • max memory size
    • max user processes
    • virtual memory

To view the current ulimits, run ulimit -a. To permanently change ulimits, modify the /etc/security/limits.conf file. For more information, see How to set ulimit values in the RHEL support documentation.

Note Also make sure that the Linux environment variables http_proxy and https_proxy are not set. The Anzo gRPC protocol cannot make connections to the database when proxies are enabled.

Install AnzoGraph DB

Important Cambridge Semantics recommends that you do not install or run AnzoGraph as the root user. Because AnzoGraph offers features such as user-defined extensions, it is not secure software certified.
  1. Note the IP addresses of each server, and pick one server to act as the leader node. Typically users choose the lowest IP address to become the leader. After the installation, you will configure the cluster by listing the IP addresses in a configuration file.
  2. Go to the AnzoGraph Download page, which lists the AnzoGraph releases that are available to download. The page offers direct download for downloading the tarball to your computer or a cURL command that you can run from the AnzoGraph host server to download the tarball onto the server. Place the azg.tgz tarball on each of the host servers in the cluster, and then proceed to the next step.
  3. Unpack the tarball and install AnzoGraph on each server. Install AnzoGraph in the same file system location on each server. To unpack the tarball, change directories to the location where you want to install AnzoGraph, such as your home directory. Do not choose a root-owned location. Then run the following command to unpack the tarball and install AnzoGraph in that location:
    tar -zxf /path_to_tarball/azg.tgz

    The tarball unpacks the AnzoGraph files into a directory named azg.

  4. On each server in the cluster, open the install_path/azg/config/ip_addrs.conf file in a text editor. In ip_addrs.conf, type a list of the IP addresses for each server in the cluster. Include each address on a new line. Type the leader server IP address on the first line, followed by each compute IP address. Then save and close the file. For example, on a cluster with 4 servers where 192.168.2.1 is the leader server:
    192.168.2.1
    192.168.2.2
    192.168.2.3
    192.168.2.4
    ImportantMake sure that ip_addrs.conf is exactly the same, with IP addresses in the same order, on each server.
  5. On each server in the cluster, run the following command to start the system manager daemon. The system manager daemon is a lightweight program that runs at all times and manages the communication between AnzoGraph servers:
    ./install_path/bin/azgmgrd
    NoteIf you run the command as the root user, the system manager displays the following error message:
    Error - Cannot execute as user 'root'.
    To override this security protection, set 'enable_root_user=true':
    Invalid user id

    To resolve the issue, follow the instructions in Changing System Settings to add enable_root_user=true to the AnzoGraph settings file and enable root user access. Then repeat this step.

  6. On the leader node, run the following command to start the database on the cluster:
    ./install_path/bin/azgctl -start
  7. The software prompts you to review and accept the license agreement. Review the AnzoGraph license and type y and press Enter to agree to the license and start the database, or type n and press Enter to disagree and discontinue the startup.

For next steps, see Quickstart Using the Command Line Interface, a brief tutorial designed to introduce you to the AnzoGraph command line interface and provide instructions for loading and querying a sample data set as well as your own data.

Related Topics