GPUs

Memory Requirements 117.72 GB

Requires 2 GPUs (based on memory capacity)

117 GB

All model weights

0.28 GB

Conversation history cache

0.34 GB

Expert model optimization

0.1 GB

Temporary computation cache

Scenario Examples (GPU + Model + Concurrency):

Click these examples to quickly configure popular model deployment scenarios!

📋 Calculation Formula FAQ