Category1free & open source
🗣️

Natural Language Processing

NLP models for text classification, sentiment analysis, summarisation, machine translation, named entity recognition, question answering, and conversational AI — 2M+ models on Hugging Face Hub.

4APIs16AI Models
Most Popular In
Text ClassificationSentiment AnalysisSummarisation
Auth Breakdown
API Key75%
OAuth25%
Notable Developers
Hugging FaceGoogleMeta AIOpenAICohere
Updated Jun 12, 2026
Curated by FreeAPIHub editors
Topics:Text ClassificationSentiment AnalysisSummarisationMachine TranslationNamed Entity RecognitionQuestion Answering
20 of 20
Access:
Auth:
Format:
Top Resources
Google Cloud Natural Language API logo

Google Cloud Natural Language API

API · Natural Language Processing
FreemiumOAuth

Google Cloud Natural Language analyses text: sentiment, entities, entity sentiment, syntax and content classification. Send text and get structured insights as JSON, part of Google Cloud's AI services.

3500+ usersNot rated yetView
SYSTRAN Translation API logo

SYSTRAN Translation API

API · Natural Language Processing
Free tierAPI Key

SYSTRAN is a machine-translation API. Translate text between many language pairs, detect the source language, and handle formats like plain text and HTML, returned as JSON, with an API key.

1K+ usersNot rated yetView
Twilio Autopilot API logo

Twilio Autopilot API

API · Natural Language Processing
FreemiumAPI Key

Twilio Autopilot was Twilio's conversational-AI API for building chat and voice bots with natural-language understanding. Twilio retired it in 2023, so it is no longer available - newer NLU and LLM platforms replace it.

5K+ usersNot rated yetView
Google Cloud Translation API logo

Google Cloud Translation API

API · Natural Language Processing
FreemiumAPI Key

The Google Cloud Translation API translates text between languages, detects the source language and lists supported languages, using Google's machine-translation models.

50K+ usersNot rated yetView

Jais 30B

Model · Inception (G42), MBZUAI, Cerebras
Apache 2.0

Jais is a family of open Arabic-English bilingual large language models from Inception (G42), MBZUAI and Cerebras. Built to be the best open Arabic LLM, it brings strong Arabic understanding and generation alongside English.

↓ 380K+Not rated yetView

Mistral 8x22B

Model · Mistral AI
Apache 2.0

Mixtral 8x22B is Mistral AI's open sparse mixture-of-experts model. With 141B total but only ~39B active parameters per token, it pairs strong quality and a 64K context with efficient inference, under a permissive Apache 2.0 licence.

↓ 3M+Not rated yetView
FA

FastChat

🔥 Hot
by LMSYS (UC Berkeley)

FastChat is LMSYS's open platform for training, serving and evaluating large language model chatbots. It powers Vicuna and the Chatbot Arena, and exposes an OpenAI-compatible API server for local models.

Apache 2.0Serving framework (u
View model
ML

MLC-LLM

🔥 Hot
by MLC AI Team (CMU, SJTU, Apache TVM)

MLC LLM is a universal deployment engine that compiles and runs large language models natively on almost any hardware — phones, laptops, browsers and servers — using machine-learning compilation built on Apache TVM.

Apache 2.0Deployment engine (c
View model
MS

Mistral Small 3

🔥 Hot
by Mistral AI · 32768 (128K in v3.1+ ctx

Mistral Small 3 is a 24B open model from Mistral AI built for low latency. Apache-2.0 licensed, it delivers strong reasoning and instruction following at a size that competes with much larger models while staying fast and efficient.

Apache 2.024B
View model
PH

Phi-4

🔥 Hot
by Microsoft Research · 16K ctx

Phi-4 is Microsoft's 14B small language model that delivers reasoning and math performance rivaling far larger models, achieved through heavy use of high-quality synthetic 'textbook' training data. Open weights under MIT.

MIT14B (Phi-4) / 3.8B (
View model
BE

BERT

🔥 Hot
by Google AI · 512 ctx

BERT is Google's landmark bidirectional transformer that reshaped natural language processing. By pretraining on masked-language modelling, it learns deep two-way context and fine-tunes to leading results on understanding tasks.

Apache 2.0BERT-Base 110M / BER
View model
TE

TensorRT-LLM

🔥 Hot
by NVIDIA

TensorRT-LLM is NVIDIA's open-source library for optimising large-language-model inference on NVIDIA GPUs. It compiles models into highly tuned engines with quantisation, batching and kernel fusion for maximum throughput and low latency.

Apache 2.0Inference engine (wo
View model
R1

R1 1776

🔥 Hot
by Perplexity AI · 131K ctx

R1-1776 is Perplexity AI's open post-trained version of DeepSeek-R1. It preserves the model's strong reasoning while removing built-in censorship, producing unbiased, factual answers on sensitive topics. Released openly under MIT.

MIT671B total / 37B act
View model
L2

Llama 2

🔥 Hot
by Meta AI · 4K ctx

Llama 2 is Meta's landmark open large language model, released in 7B, 13B and 70B sizes with chat-tuned variants. Its capable quality and broadly permissive licence made it the foundation of the modern open-LLM ecosystem.

Llama Community License7B / 13B / 70B
View model
T5

T5

🔥 Hot
by Google Research · 512 (Long-T5: 16384) ctx

T5 (Text-to-Text Transfer Transformer) is Google's influential encoder-decoder model that frames every NLP task as text-to-text. From translation to summarization to classification, one unified format handles them all.

Apache 2.060M / 220M / 770M /
View model
YI

Yi-34B

🔥 Hot
by 01.AI · 200K ctx

Yi-34B is a bilingual (English and Chinese) open large language model from 01.AI, strong on reasoning and knowledge for its size, with a 200K-token long-context variant and a permissive licence for commercial use.

Apache 2.06B / 9B / 34B
View model
MA

Mamba-2.8B

🔥 Hot
by Albert Gu (CMU) & Tri Dao (Princeton)

Mamba-2 is a selective state-space model (SSM) from CMU and Princeton that matches Transformer quality on language modelling while scaling linearly with sequence length, released openly under Apache 2.0.

Apache 2.0130M / 370M / 790M /
View model
DI

DBRX Instruct

🔥 Hot
by Databricks · 32K ctx

DBRX is an open mixture-of-experts large language model from Databricks with 132B total parameters but only 36B active per token, giving strong quality with efficient inference and a 32K context window.

Databricks Open Model License132B total / 36B act
View model
QW

Qwen1.5-72B

🔥 Hot
by Alibaba Cloud (Qwen Team) · 32K ctx

Qwen1.5 72B is a large open language model from Alibaba Cloud. Strongly multilingual with a 32K context, it offers competitive reasoning, coding and chat across many languages, and ships with chat-tuned variants and broad framework support.

Qwen License (commercial use allowed)0.5B / 1.8B / 4B / 7
View model
O1

OLMo 1.7

🔥 Hot
by Allen Institute for AI (AI2) · 4K ctx

OLMo is a truly open language model from the Allen Institute for AI (AI2): open weights, open training code, and the open Dolma dataset. It is built for reproducible, transparent LLM research, with 1B and 7B sizes.

Apache 2.01B / 7B / 32B
View model
Showing 20 of 20 resources

At a glance

Compare the top Natural Language Processing APIs

Browse all APIs
APIAccessAuthFormatsRating
SYSTRAN Translation API logo
SYSTRAN Translation API
Free tierAPI KeyRESTJSONView
Twilio Autopilot API logo
Twilio Autopilot API
FreemiumAPI KeyRESTJSONView
Google Cloud Translation API logo
Google Cloud Translation API
FreemiumAPI KeyRESTJSONView

About this category

Natural Language Processing — developer guide

What Is Natural Language Processing?

Natural Language Processing (NLP) is the branch of AI that enables machines to read, understand, and generate human language. NLP models power search engines, content moderation systems, customer support automation, medical record analysis, financial document processing, and the conversational AI assistants used by hundreds of millions daily. The Hugging Face Hub now hosts over 2 million models (October 2025 milestone), with NLP models accounting for 58% of downloads — a testament to how central language understanding is to modern AI applications.

Core NLP Tasks and Applications

  • Text classification — sentiment analysis, spam detection, topic labelling, intent recognition
  • Named entity recognition (NER) — extract names, dates, organisations, locations from unstructured text
  • Summarisation — condense long documents into key points for research and content tools
  • Machine translation — translate text across 100+ language pairs at scale
  • Question answering — extract precise answers from documents without full LLM generation overhead
  • Zero-shot classification — classify text into arbitrary categories without labelled training data

NLP Models to Know in 2026

ModernBERT (December 2024) is the updated BERT replacement — faster, longer context (8K tokens), and trained on fresh data including code. It's the new default for encoder-based NLP tasks like classification and NER. DeBERTa-v3-large still leads many text-classification benchmarks for fine-tuned use. For multilingual NLP, XLM-RoBERTa-XL handles 100 languages. For summarisation, BART-Large-CNN and Pegasus-X remain strong open-source choices. For zero-shot classification, facebook/bart-large-mnli is widely used and free on Hugging Face Inference API.