Skip to main content

Requirements

Whether you're setting up LLMOS on a virtual machine or bare-metal server, each node running LLMOS needs to meet these minimum requirements. These requirements cover LLMOS and its components but don’t include resources needed by the workloads you run.

Prerequisites

  • If the node has NVIDIA GPUs, install the NVIDIA driver first.
  • No two nodes should have the same hostname.
  • The node must not have any existing Kubernetes clusters running on it.
note

To check if the NVIDIA driver is installed correctly, run the following command:

nvidia-smi

The output should look something like this:

+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.183.06 Driver Version: 535.183.06 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+======================+======================|
| 0 NVIDIA GeForce RTX 4090 Off | 00000000:01:00.0 Off | Off |
| 30% 41C P0 54W / 450W | 0MiB / 24564MiB | 0% Default |
| | | N/A |
+-----------------------------------------+----------------------+----------------------+

+---------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=======================================================================================|
| No running processes found |
+---------------------------------------------------------------------------------------+

Supported Architectures

LLMOS works on the following system architectures:

  • x86_64
  • arm64/aarch64

Supported Operating Systems

While LLMOS is expected to work on most modern Linux distributions, the team has officially tested and verified the following:

info

It’s recommended to turn off firewalld. If you keep it enabled, you may need to open additional ports. Check the Inbound Rules section for more info.

DistributionVersion
Ubuntu20.04 LTS or newer
RHEL8.5 or newer
SLES/OpenSUSE15.3 (Leap) or newer
Rocky Linux8.5 or newer
Debian11.0 or newer
Fedora37 or newer

Hardware Requirements

The hardware needs will vary depending on your deployment size. Below are the minimum recommendations:

SpecMinimum RequirementRecommended Spec
CPU4 cores8 cores
Memory16 GB32 GB
Root Disk Space100 GB200 GB
Disk Performance5,000+ random IOPS for root disk (SSD/NVMe)Same as minimum requirements

Disk Type

For better performance, always use SSDs for your LLMOS cluster. If you’re using a cloud provider, pick the suitable disk size that gives you the maximum IOPS.

Networking

LLMOS nodes need to access ports 6443 and 8443 on all nodes. The node should not be using other ports.

LLMOS uses reverse tunneling, meaning the nodes establish outbound connections to the server, and all Kubelet traffic runs through this tunnel.

To use the metrics server, ensure all nodes can communicate with each other on port 10250.

Inbound Rules for LLMOS Nodes

ProtocolPortSourceDestinationDescription
TCP2379-2380ServersServersRequired for HA setups with embedded etcd
TCP6443WorkersServersKubernetes API Server
TCP8443All nodesAll nodesLLMOS dashboard and API Server
UDP8472All nodesAll nodesFor system VXLAN network
TCP10250All nodesAll nodesMetrics server
TCP5001, 6443All nodesAll nodesEmbedded distributed registry (Spegel)
TCP30000-32767All nodesAll nodesNodePort port range

Usually, all outbound traffic is allowed. Depending on your OS, you may need to adjust the firewall settings.