NVIDIA® H200 GPU 算力租賃 | 彈性定價免綁約

GMI Cloud 為 AI 創新者提供最先進的 NVIDIA GB200 NVL72 平台，在大型語言模型（LLM）推理、向量資料庫搜尋和資料處理方面帶來突破性的效能。GB200 NVL72 搭載雙 Blackwell GPU 和 NVIDIA NVLink® 互連技術，專為處理大規模 AI 工作負載而設計，並透過 NVIDIA 的可擴展 MGX™ 架構，輕鬆整合至現有基礎設施。結合 GMI Cloud 和 NVIDIA GB200 NVL72，讓您能更智慧地擴展規模，加速創新，充分發揮運算潛力。initial takeaways here.

搶先預訂

Technical details:

Model Provider:

DeepSeek

Type:

Chat

Parameters:

685B

Deployment:

Serverless (MaaS) or Dedicated Endpoint

Quantization:

FP16

Context Length:

Up to 128,000 tokens

Distilled models offering:

DeepSeek-R1-Distill-Llama-70B
DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Qwen-14B
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Qwen-1.5B

Try our token-free service with unlimited usage!

Reach out for access to our dedicated endpoint Here.

DeepSeek R1

Technical details:

Distilled models offering:

Try our token-free service with unlimited usage!

訂閱 GMI Cloud 電子報