> ## Documentation Index > Fetch the complete documentation index at: https://docs.myrouter.ai/llms.txt > Use this file to discover all available pages before exploring further. # Fish Audio Text-to-Speech For best results, it is recommended to upload reference audio using the [Voice Cloning](/docs/models/reference-fish-audio-voice-cloning) API before using this API. This will improve voice quality and reduce latency. Fish Audio converts text to speech. Supported audio formats: * WAV / PCM * Sample rates: 8kHz, 16kHz, 24kHz, 32kHz, 44.1kHz * Default sample rate: 44.1kHz * 16-bit, mono * MP3 * Sample rates: 32kHz, 44.1kHz * Default sample rate: 44.1kHz * Mono * Bitrates: 64kbps, 128kbps (default), 192kbps * Opus * Sample rate: 48kHz * Default sample rate: 48kHz * Mono * Bitrates: -1000 (auto), 24kbps, 32kbps (default), 48kbps, 64kbps ## Request Headers Enum: `application/json` Bearer authentication format: Bearer \{\{API Key}}. ## Request Body The text to be converted to speech. Controls the randomness of speech generation. Higher values (e.g., 1.0) make the output more random, lower values (e.g., 0.1) make it more deterministic. We recommend `0.9` for the `s1` model. Required range: `0 <= x <= 1` Controls diversity through nucleus sampling. Lower values (e.g., 0.1) make the output more focused, higher values (e.g., 1.0) allow more diversity. We recommend `0.9` for the `s1` model. Required range: `0 <= x <= 1` Reference audio for the voice. This requires MessagePack serialization, which will override reference\_voices and reference\_texts. Reference audio file. Reference text corresponding to the audio. Reference model ID for the voice. Prosody control for the voice. Voice speed control. Voice volume control. Chunk length for the voice. Required range: `100 <= x <= 300` Whether to normalize the voice. This will reduce latency but may decrease performance on numbers and dates. Format for the voice. Possible values: `wav`, `pcm`, `mp3`, `opus` Sample rate for the voice. MP3 bitrate for the voice. Possible values: `64`, `128`, `192` Opus bitrate for the voice. Possible values: `-1000`, `24`, `32`, `48`, `64` Latency setting for the voice. balanced will reduce latency but may result in decreased performance. Possible values: `normal`, `balanced` ## Response The API will return an audio stream in the format specified by the `format` parameter (Default: mp3).