open source

Jais 30B

Provided by: Framework: Unknown

Jais 30B is an open-source large language model developed by G42 and Cerebras, designed to advance Arabic and bilingual NLP research. Trained on over 116 billion Arabic and English tokens, it delivers 83.4% performance on the Arabic MMLU benchmark and supports cross-lingual reasoning, translation, and text generation. Jais 30B leverages a specialized tokenizer optimized for Arabic script, ensuring accurate morphological understanding and natural context flow. With its bilingual training and cultural adaptation, Jais 30B stands as the most powerful Arabic-English model for developers, researchers, and AI startups focusing on regional NLP solutions.

Model Performance Statistics

0

Views

February 18, 2025

Released

Aug 19, 2025

Last Checked

1.3

Version

Capabilities
  • Arabic NLP
  • Cross-cultural adaptation
  • Bilingual generation
Performance Benchmarks
XTREME-Ar76.8%
Arabic MMLU83.4%
Technical Specifications
Parameter Count
N/A
Training & Dataset

Dataset Used

Arabic web corpus (116B tokens)

Related AI Models

Discover similar AI models that might interest you

Modelfree

Bloom

Bloom

Bloom

BigScience

Bloom is an open-source multilingual transformer model developed by BigScience, designed for a variety of natural language processing tasks across multiple languages.

Natural Language Processingllm
47
Modelopen source

FastChat

FastChat

FastChat

LM Systems

FastChat is a powerful, Apache‑2.0 licensed, open‑source platform by LMSYS for training, serving, and evaluating large‑language‑model chatbots. It supports command‑line, Web‑UI & OpenAI‑compatible APIs, powering tools like Vicuna, Chatbot Arena, and FastChat‑T5.

Natural Language Processingllm
32
Modelfree

Phi-4

Phi-4

Phi-4

Microsoft

Efficient small-scale model with reasoning capabilities

Natural Language Processingllm
29