GPUs

Memory Requirements 110.61 GB

Requires 2 GPUs (based on memory capacity)

109 GB

All model weights

0.25 GB

Conversation history cache

1.13 GB

Expert model optimization

0.23 GB

Temporary computation cache

Scenario Examples (GPU + Model + Concurrency):

Click these examples to quickly configure popular model deployment scenarios!

📋 Calculation Formula FAQ