Ollama API

Ollama is an open-source framework that allows you to run large language models like Llama 2, Mistral, and Qwen locally on your machine. It provides a REST API compatible with OpenAI's Chat Completions endpoint, enabling developers to integrate LLMs into their applications without relying on external servers.

Endpoints

Views

Jul 20, 2025

Last Checked

NaN

Rate Limit

API Endpoints

Generates text using local language models

Full URL

http://localhost:11434/api/api/generate

Code Examples

curl -X POST 'http://localhost:11434/api/api/generate'

Parameters

{
  "model": "llama2",
  "prompt": "Explain quantum computing basics",
  "stream": false
}

Example Response

{
  "done": true,
  "model": "llama2",
  "response": "Quantum computing uses qubits...",
  "created_at": "2023-07-18T16:00:00Z"
}

Version

v1

Related APIs

Discover similar APIs that might interest you

View All APIs

APIfreemium

Google Cloud Vision AI

Google Cloud Vision AI API provides powe...

SpeechBrain

SpeechBrain is a comprehensive open-sour...

Haystack API

Haystack is a robust open-source Python/...