open source

StarCoder2

Provided by:

BigCode

• Framework: PyTorch

StarCoder2 is a large-scale open-source AI model developed by BigCode for code generation and comprehension tasks. Built with PyTorch and licensed under Apache 2.0, it supports multiple programming languages and is optimized for both code completion and generation. The model is designed to aid developers by automating code writing, improving productivity, and enabling advanced programming assistance.

StarCoder2 AI Model

Views

May 12, 2024

Released

Jul 20, 2025

Last Checked

2.0

Version

Capabilities

Code Completion
Debugging
Doc Generation

Performance Benchmarks

HumanEval81.5%

MultiPL-E76.9%

Technical Specifications

Parameter Count: N/A

Training & Dataset

Dataset Used

The Stack v2, GitHub public repos

Related AI Models

Discover similar AI models that might interest you

More AI Models

Modelopen source

DeepSeek-Coder

DeepSeek AI

DeepSeek‑Coder is a series of open-source code language models developed by DeepSeek AI using PyTorch. Trained from scratch on 2 trillion tokens (87% code, 13% natural language), with model sizes from 1.3B to 33B parameters and a 16K window context. It excels at project‑level code completion, infilling, and supports dozens of programming languages. It consistently leads benchmarks like HumanEval, MultiPL‑E, and MBPP in open-source comparisons.

Code Generationcode-generationdeveloper

Modelopen source

DBRX Instruct

Databricks

DBRX Instruct is an open-source large language model developed by Databricks, designed for code generation, reasoning, and tool-assisted problem solving. Featuring a Mixture of Experts (MoE) architecture with 132 billion total parameters—36 billion active per inference—it combines high performance with efficient scaling. DBRX Instruct achieves 74.5% on HumanEval and 84.7% on GSM8K, outperforming many models in programming and logical reasoning benchmarks. Built on Databricks’ enterprise-grade infrastructure, it supports fine-tuning and deployment across Databricks, AWS, and Hugging Face, making it ideal for developers and organizations seeking open, high-performing alternatives to proprietary LLMs.

Code Generationcode-generationllm

Modelopen source

Emu2-Chat

Beijing Academy of AI

Emu2-Chat is a conversational AI model designed for engaging and context-aware chat interactions. It is optimized for natural language understanding and generating human-like responses across various domains. Ideal for chatbots, virtual assistants, and customer support automation.

Multimodalconversational

Model Performance Statistics

Dataset Used

Related AI Models

DeepSeek-Coder

DeepSeek-Coder

DBRX Instruct

DBRX Instruct

Emu2-Chat

Emu2-Chat