Fish Audio is a cost play with a licensing asterisk
At roughly $15 per million characters, Fish Audio can be about 10x cheaper than ElevenLabs while still ranking near the top on independent expressiveness benchmarks and cloning from a ~15-second sample. The catch is licensing: commercial use of the open weights requires a paid license, so confirm the terms before shipping revenue content. For high-volume TTS where budget dominates, it is a serious contender.

