

DeepSeek is an AI company founded in 2023, focusing on developing leading general artificial intelligence foundation models and technologies. They have released and open-sourced several large-scale models with billions of parameters, including DeepSeek-LLM, DeepSeek-Coder, and DeepSeek-MoE. DeepSeek provides APIs for accessing their models, allowing users to integrate AI capabilities into their applications.
07 Feb 2025
Readmore


DeepSeek v3 is a powerful 671B parameter Mixture-of-Experts (MoE) language model that offers groundbreaking performance. It is an AI-driven LLM with 671B total parameters (37B activated per token) and supports API access, an online demo, and research papers. Pre-trained on 14.8 trillion high-quality tokens, DeepSeek v3 delivers state-of-the-art results across various benchmarks, including mathematics, coding, and multilingual tasks, while maintaining efficient inference. It features a 128K context window and incorporates Multi-Token Prediction for enhanced performance and acceleration.
24 Jan 2025
Readmore