Curated AI tools directory

All AI Tools

Browse AI tools for writing, image generation, video, office work, coding, and research.

Search results

The full catalog stays searchable and filterable without overpowering the decision paths above.

All categoriesPopular firstFree tierRecently updated
24 tools shown
ElevenLabs logoPopular

ElevenLabs

An AI voice platform for text-to-speech, voice cloning, dubbing, narration, and multilingual audio generation.

Free tierText to speech
Fish Audio logoNew

Fish Audio

A low-cost text-to-speech platform with open-weights voice cloning from a short sample, fine-grained emotion control, and 80+ language support.

Free tierText to speech
Cartesia logoNew

Cartesia

An ultra-low-latency text-to-speech API (Sonic) built for real-time conversational voice agents, billed per character with instant voice cloning.

Free tierRealtime TTS
OpenAI TTS logo~$15/1M chars

OpenAI TTS

OpenAI's text-to-speech API with preset natural voices and steerable tone, billed per token/character, with no voice cloning.

~$15/1M charsText to speech
Azure AI Speech (TTS) logoFree tier

Azure AI Speech (TTS)

Microsoft Azure's enterprise text-to-speech with 100+ languages and locales, neural and HD voices, custom voice options, Speech SDK/REST access, and compliance-grade infrastructure.

Free tierEnterprise TTS
Chatterbox (Resemble AI) logoNew

Chatterbox (Resemble AI)

An open-source (MIT) text-to-speech model family from Resemble AI with voice cloning from a few seconds of audio and competitive quality, free for commercial use.

Free tierOpen source TTS
Deepgram logoPopular

Deepgram

A real-time speech-to-text platform (Nova/Flux) built for low-latency voice agents, with batch and streaming transcription and per-minute pricing.

Free tierSpeech to text
AssemblyAI logoFree tier

AssemblyAI

A speech-to-text API (Universal-3 Pro, Universal-2, and streaming models) pairing transcription with speech intelligence such as summaries, sentiment, topic detection, and speaker labels.

Free tierSpeech to text
OpenAI Whisper logoPopular

OpenAI Whisper

OpenAI's open-source speech recognition model family supporting 99+ languages, considered the accuracy gold standard and free to self-host.

Free tierSpeech to text
Google Cloud Speech-to-Text logoFree tier

Google Cloud Speech-to-Text

Google Cloud's enterprise speech recognition API with broad language coverage, streaming and batch transcription, and Google's infrastructure.

Free tierSpeech to text
ElevenLabs Scribe logoNew

ElevenLabs Scribe

ElevenLabs' speech-to-text model (Scribe v2) for accurate multilingual transcription and real-time use, complementing its TTS platform.

Free tierSpeech to text
Suno logoNew

Suno

An AI music creation platform for generating songs, vocals, instrumentals, and creative audio from prompts.

Free tierAI music
Udio logoNew

Udio

An AI music generator for creating songs, instrumental ideas, vocals, and shareable audio experiments.

Free tierMusic generator
Murf logoFree tier

Murf

An AI voice generator for studio-quality voiceovers, presentations, training videos, ads, and product explainers.

Free tierVoiceover
Krisp logoFree tier

Krisp

An AI meeting audio tool for noise cancellation, voice clarity, meeting notes, and call productivity.

Free tierNoise cancellation
Adobe Podcast logoFree tier

Adobe Podcast

Adobe's AI audio tool for enhancing speech, cleaning recordings, and improving podcast or voice content quality.

Free tierSpeech enhance
AIVA logoNew

AIVA

An AI music composition platform for scores, instrumentals, and licensing-aware composer workflows.

Free tierAI composer
SOUNDRAW logoNew

SOUNDRAW

An AI background-music generator focused on royalty-free commercial tracks, editing, distribution, and API/enterprise paths.

Paid plansBackground music
Mubert logoNew

Mubert

An AI music API and generation platform positioned around licensed/partner content and commercially safer background generation.

API tiersMusic API
OpenAI Realtime API logoNew

OpenAI Realtime API

OpenAI's realtime audio API for building low-latency voice interactions, live speech conversations, and multimodal agent experiences.

Check pricingRealtime audio
Retell AI logoNew

Retell AI

A platform for building, testing, deploying, and monitoring inbound and outbound AI phone agents with telephony, tools, and analytics.

Check pricingAI phone agents
Bland AI logoNew

Bland AI

An enterprise voice AI platform for building, running, and monitoring inbound and outbound AI phone agents at scale.

Check pricingVoice AI
Rasa Voice logoNew

Rasa Voice

Rasa's enterprise voice experience platform for realtime conversations with turn-taking, interruptions, and ASR/TTS provider control.

Check pricingEnterprise voice
Inworld logoNew

Inworld

A realtime voice and AI character platform with streaming TTS, STT, voice cloning, and API layers for voice-first applications.

Check pricingRealtime voice