Model
/
LLM
/
DeepSeek

DeepSeek R1

欢迎进入 AI 新时代

GMI Cloud 为 AI 创新者提供强大的 NVIDIA GB200 NVL72 平台访问权限,为大语言模型(LLM)推理、向量数据库搜索和数据处理提供了突破性的性能。GB200 NVL72 由双 Blackwell GPU 和 NVIDIA的NVLink® 互连技术驱动,专为处理大规模 AI 工作负载而设计,通过 NVIDIA 可扩展的 MGX™ 架构,能够无缝集成到现有基础设施中。借助 GMI Cloud 和 NVIDIA GB200 NVL72,您可以更敏捷地扩展规模,更快地创新,充分释放加速计算的潜力。initial takeaways here.

Technical details:

Model Provider:
DeepSeek
Type:
Chat
Parameters:
685B
Deployment:
Serverless (MaaS) or Dedicated Endpoint
Quantization:
FP16
Context Length:
Up to 128,000 tokens

Distilled models offering:

  • DeepSeek-R1-Distill-Llama-70B
  • DeepSeek-R1-Distill-Qwen-32B
  • DeepSeek-R1-Distill-Qwen-14B
  • DeepSeek-R1-Distill-Llama-8B
  • DeepSeek-R1-Distill-Qwen-7B
  • DeepSeek-R1-Distill-Qwen-1.5B

Try our token-free service with unlimited usage!

Reach out for access to our dedicated endpoint Here.