- Home
- Categories
- Speech & Audio
- Google Cloud Speech-to-Text API
Google Cloud Speech-to-Text API
The Google Cloud Speech-to-Text API enables developers to convert spoken audio into accurate transcribed text, suitable for building voice assistants and transcription services.
Reference for available routes, request structures, and live examples.
Convert speech to text using Google Cloud Speech-to-Text
https://speech.googleapis.com/v1/speech:recognizecurl -X POST 'https://speech.googleapis.com/v1/speech:recognize' \
-H 'Authorization: Bearer YOUR_API_KEY'{
"audio": {
"content": "base64_encoded_audio"
},
"config": {
"encoding": "LINEAR16",
"sampleRateHertz": 16000
}
}{
"results": [
{
"alternatives": [
{
"transcript": "Hello, world!"
}
]
}
]
}- {'items': ['Building voice‑powered assistants that respond to user commands'Optimized Capability
- 'Transcribing recorded meetingsOptimized Capability
- interviewsOptimized Capability
- or podcasts'Optimized Capability
- 'Live captioning for streaming or video content'Optimized Capability
- 'Analyzing call center audio for sentiment and trends']}Optimized Capability
API Specifications
v1600 requests per minute per project
Use Case: Best For
Not Recommended For
Explore Related APIs
Discover similar APIs to Google Cloud Speech-to-Text API
Async.ai API
The Async.ai API provides developers with advanced voice cloning and text-to-speech capabilities, facilitating the creation of realistic audio experiences in applications.
AssemblyAI
AssemblyAI provides developers with a robust speech-to-text API, offering accurate transcriptions for both audio and video files, suited for media, healthcare, and customer service applications.