Installation
Requirements

Requirements

MOSTLY AI is a multi-service application that can run on single-node or multi-node Kubernetes clusters. For a successful deployment and operation, the Kubernetes cluster on which deploy MOSTLY AI needs to meets a set of compute and storage requirements.

Single-node deployments

You can deploy MOSTLY AI in a single-server environment that runs as a single-node Kubernetes cluster. The single node will run all components of the MOSTLY AI application architecture, including the application and worker nodes.

The resource requirements for a single-node cluster are categorized into tiers. MOSTLY AI defines each tier depending on the size of the datasets that you intend to synthesize.

Tier 1Tier 2Tier 3
Data sizeDatasets with up to 1 million subjects and 100 columnsDatasets with up to 10 million subjects and 250 columnsDatasets with more than 10 million subjects and 250 columns
CPU32 cores64 cores64 cores
Memory128 GB256 GB512 GB
Disk storage (RWX Storage Class)500 GB1 TB1 TB

Multi-node deployments

For a multi-node deployment, you need a Kubernetes cluster with at least two nodes. One of the nodes functions as the application node and the remaining function as worker nodes.

Application node requirements

The application node runs the web-based user interface and distributes the synthetic data generation jobs across the worker nodes.

ResourcesSize
CPU4 cores
RAM8 GB
Storage20 GB

Worker nodes requirements

The worker nodes are responsible for running and processing the synthetic data generation jobs. Depending on the size of the datasets you want to synthesize, the worker node requirements are categorized into three tiers.

Tier 1Tier 2Tier 3
Data sizeDatasets with up to 1 million subjects and 100 columnsDatasets with up to 10 million subjects and 250 columnsDatasets with more than 10 million subjects and 250 columns
CPU32 cores64 cores64 cores
Memory128 GB256 GB512 GB
Disk storage (RWX Storage Class)500 GB1 TB1 TB