Bring Voice AI to your app
Leverage productoin-ready tools to create lifelike speech, clone voices, and transcribe audio with minimal setup
API
RESTful API with comprehensive documentation. Support for text-to-speech, voice cloning, and speech-to-text with low latency and high quality output.
# Text to Speech API
curl -X POST https://api.fish.audio/v1/tts \
-H "Authorization: Bearer $FISH_API_KEY" \
-H "Content-Type: application/json" \
-H "model: s1" \
-d '{"text": "Hello! Welcome to Fish Audio."}' \
--output welcome.mp3
Python SDK
Official Python SDK with async support, streaming capabilities, and comprehensive type hints for a seamless development experience.
# Install
pip install fish-audio-sdk
# Usage
from fishaudio import FishAudio
from fishaudio.utils import save
client = FishAudio(api_key="your_api_key_here")
audio = client.tts.convert(text="Hello! Welcome to Fish Audio.")
save(audio, "welcome.mp3")
API Pricing
Simple, transparent pricing with pay-as-you-go model. No hidden fees, no minimum commitments. Scale as you grow.
| Model Type | Model Name | Pricing |
|---|---|---|
| TTS | speech-1.5 | $15.00 / million UTF-8 bytes |
| TTS | speech-1.6 | $15.00 / million UTF-8 bytes |
| TTS | s1 | $15.00 / million UTF-8 bytes |
| ASR | transcribe-1 | $0.36 / hour |