open source

DeepSeek-Coder

Provided by: Framework: PyTorch

DeepSeek‑Coder is a series of open-source code language models developed by DeepSeek AI using PyTorch. Trained from scratch on 2 trillion tokens (87% code, 13% natural language), with model sizes from 1.3B to 33B parameters and a 16K window context. It excels at project‑level code completion, infilling, and supports dozens of programming languages. It consistently leads benchmarks like HumanEval, MultiPL‑E, and MBPP in open-source comparisons.

Model Performance Statistics

13

Views

January 28, 2024

Released

Jul 20, 2025

Last Checked

1.5

Version

Capabilities
  • Code Completion
  • Bug Fixing
  • Doc Generation
Performance Benchmarks
HumanEval74.4%
Repo-Level Coding68.9%
Technical Specifications
Parameter Count
N/A
Training & Dataset

Dataset Used

GitHub, StackOverflow, technical docs

Related AI Models

Discover similar AI models that might interest you

Modelopen source

StarCoder2

StarCoder2

StarCoder2

BigCode

StarCoder2 is a large-scale open-source AI model developed by BigCode for code generation and comprehension tasks. Built with PyTorch and licensed under Apache 2.0, it supports multiple programming languages and is optimized for both code completion and generation. The model is designed to aid developers by automating code writing, improving productivity, and enabling advanced programming assistance.

Code Generationcode-generationdeveloper
13
Modelopen source

DBRX Instruct

DBRX Instruct

DBRX Instruct

Databricks

DBRX Instruct is an open-source large language model developed by Databricks, designed for code generation, reasoning, and tool-assisted problem solving. Featuring a Mixture of Experts (MoE) architecture with 132 billion total parameters—36 billion active per inference—it combines high performance with efficient scaling. DBRX Instruct achieves 74.5% on HumanEval and 84.7% on GSM8K, outperforming many models in programming and logical reasoning benchmarks. Built on Databricks’ enterprise-grade infrastructure, it supports fine-tuning and deployment across Databricks, AWS, and Hugging Face, making it ideal for developers and organizations seeking open, high-performing alternatives to proprietary LLMs.

Code Generationcode-generationllm
0
Modelopen source

Emu2-Chat

Emu2-Chat

Emu2-Chat

Beijing Academy of AI

Emu2-Chat is a conversational AI model designed for engaging and context-aware chat interactions. It is optimized for natural language understanding and generating human-like responses across various domains. Ideal for chatbots, virtual assistants, and customer support automation.

Multimodalconversational
94