Installation
Requirements

Requirements

MOSTLY AI is a multi-service application that can run on single-node or multi-node Kubernetes clusters. For a successful deployment and operation, the Kubernetes cluster on which you deploy MOSTLY AI must meet the compute and storage requirements.

If you need to run multiple synthetic datasets in parallel, you can deploy MOSTLY AI in a multi-node environment with multiple worker nodes.

Single-node deployments

You can deploy MOSTLY AI in a single-server environment that runs as a single-node Kubernetes cluster. The single node will run all components of the MOSTLY AI application architecture, including the application and worker nodes.

The minimum requirements for a successful single-node deployment of MOSTLY AI are as follows:

ResourcesSize
CPU16 cores
RAM32 GB
Storage256 GB

The majority of scenarios can be covered with the minimum setup.

Depending on the datasets you intend to synthesize, you might need a node with higher compute resources. Our highest recommended single-node configuration is listed below (more than that is typically not required):

ResourcesSize
CPU64 cores
RAM128 GB
Storage1 TB

Note
To learn how to size your infrastructure based on the size of the datasets you want to synthesize, see the best practices for virtual machine sizes.

Multi-node deployments

For a multi-node deployment, you need a Kubernetes cluster with at least two nodes. One of the nodes functions as the application node and the remaining function as worker nodes.

MOSTLY AI requires nodes with available resources that meet the computation and memory requirements. To ensure a smooth operation in shared cluster environments, it is best to dedicate the nodes solely to MOSTLY AI tasks. This prevents resource conflicts and ensures that all tasks required to complete AI training and to generate synthetic datasets will run unobstructed. To achieve this, you can isolate the dedicated nodes by applying taints and tolerations to node and pod configurations. For more information, see Taints and tolerations (opens in a new tab) in the Kubernetes documentation.

Application node requirements

The application node runs the web-based user interface and distributes the synthetic data generation jobs across the worker nodes.

ResourcesSize
CPU4 cores
RAM8 GB
Storage20 GB

Worker nodes requirements

The worker nodes are responsible for running and processing each synthetic dataset. Depending on the size of the datasets you want to synthesize, the minimum requirements for a single worker node are as follows:

ResourcesSize
CPU8 cores
RAM16 GB
Storage128 GB

The majority of scenarios can be covered with the minimum setup.

Depending on the datasets you intend to synthesize, you might need worker nodes with higher compute resources. Our highest recommended worker node configuration is listed below (more than that is typically not required):

ResourcesSize
CPU64 cores
RAM128 GB
Storage256 GB

Note
To learn how to size your infrastructure based on the datasets you want to synthesize, check the best practices to identify your virtual machine size.