NVIDIA® H200 GPU云服务器现货发售

GMI Cloud 为 AI 创新者提供强大的 NVIDIA GB200 NVL72 平台访问权限，为大语言模型（LLM）推理、向量数据库搜索和数据处理提供了突破性的性能。GB200 NVL72 由双 Blackwell GPU 和 NVIDIA的NVLink® 互连技术驱动，专为处理大规模 AI 工作负载而设计，通过 NVIDIA 可扩展的 MGX™ 架构，能够无缝集成到现有基础设施中。借助 GMI Cloud 和 NVIDIA GB200 NVL72，您可以更敏捷地扩展规模，更快地创新，充分释放加速计算的潜力。initial takeaways here.

立即预订

Technical details:

Model Provider:

DeepSeek

Type:

Chat

Parameters:

685B

Deployment:

Serverless (MaaS) or Dedicated Endpoint

Quantization:

FP16

Context Length:

Up to 128,000 tokens

Distilled models offering:

DeepSeek-R1-Distill-Llama-70B
DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Qwen-14B
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-7B
DeepSeek-R1-Distill-Qwen-1.5B

Try our token-free service with unlimited usage!

Reach out for access to our dedicated endpoint Here.

DeepSeek R1

Technical details:

Distilled models offering:

Try our token-free service with unlimited usage!

订阅 GMI Cloud 最新资讯