GPUs

Memory Requirements 117.64 GB

Requires 2 GPUs (based on memory capacity)

117 GB

All model weights

0.25 GB

Conversation history cache

0.29 GB

Expert model optimization

0.1 GB

Temporary computation cache

Scenario Examples (GPU + Model + Concurrency):

Click these examples to quickly configure popular model deployment scenarios!

📋 Calculation Formula FAQ