Deploy, manage, and scale AI models across your own hardware - on-premise, private cloud, or hybrid. Host models on your own, with full control over compute and every layer of your AI infrastructure.
Free and open source.
| Setup | Specs | Best for |
|---|---|---|
| Server Solution | 60GB VRAM, 256GB RAM, 8×RTX 4000 ADA | Large models, high performance |
| PC workstation | 32–128GB RAM, 2×Nvidia GPU | Small and medium models at speed |
| Mac Studio | 35–512GB RAM, up to 80-core GPU | Large models at moderate speed |
| MacBook Pro | 36–128GB RAM, up to 40-core GPU | Medium models, individual use |
