🗣️ Speech AI tools

SpeechPulse is a comprehensive voice typing software that uses Whisper voice recognition to enable real-time speech-to-text conversion across all applications, supporting 99 languages and offline processing for enhanced privacy.
- Pay Once

TTS Voice Wizard offers high-quality speech recognition and synthesis with a wide range of voices and language support. It integrates with various services and provides features like VRChat interaction and heart rate sharing.
- Free

AssemblyAI is a comprehensive speech-to-text platform offering advanced AI models for voice data processing, including real-time transcription, speaker diarization, and speech understanding capabilities with up to 95% accuracy.
- Freemium

Voice.ai offers a free real-time AI voice changer and a comprehensive ecosystem of AI voice tools for gaming, streaming, and communication.
- Freemium
- API

Talkscriber is a secure and cost-effective enterprise-grade speech-to-text platform, delivering high accuracy and advanced features like emotion and purchase intent detection.
- Usage Based

AppTek.ai is a global leader in AI and ML technologies specializing in speech recognition, neural machine translation, and language processing solutions. Their platform delivers enterprise-grade language technologies across multiple industries using advanced neural networks and machine learning.
- Contact for Pricing

Open Voice OS is an open-source voice AI platform that enables developers to create custom voice-controlled interfaces with privacy-focused features, NLP capabilities, and a customizable UI. It supports multiple platforms and devices, making it ideal for DIY smart speaker projects.
- Free

Akkadu provides real-time AI subtitles for videos, live streams, webinars, and video conferences in over 90 languages. An effective tool to make content accessible across various languages.
- Usage Based

Deepgram provides APIs for speech-to-text, text-to-speech, and speech-to-speech voice agents, enabling developers to build voice AI products and features.
- Usage Based

Dictation.io is a web-based speech recognition tool that accurately transcribes voice to text in real-time, supporting multiple languages and voice commands within Google Chrome.
- Free

Accent Guesser is a free online tool that uses AI to analyze your speech and identify your accent characteristics. Get instant, accurate results and insights for pronunciation improvement.
- Free

Vext is an AI-powered speech-to-text tool that provides real-time captions and translations for meetings, events, and videos. It ensures seamless communication and accessibility across various platforms.
- Free

Vocol AI empowers individuals and enterprises to efficiently transform voice data into text and actionable insights, enhancing collaboration and productivity.
- Freemium
- API

Wavify is a platform for on-device speech AI, enabling software engineers to embed features like speech recognition and wake word detection into any software.
- Freemium
- From 150$

Speech Intellect offers real-time speech-to-text and text-to-speech solutions using a unique AI-focused mathematical theory, "Sense Theory," for enhanced understanding and generation of human-like voice.
- Usage Based

Voicetapp is a comprehensive AI platform offering speech-to-text transcription, content writing, voiceover generation, and YouTube-to-blog conversion capabilities with multilingual support and up to 99% accuracy.
- Paid
- From 12$

Speechmatics offers enterprise-grade APIs for Automatic Speech Recognition (ASR) and building Conversational AI products, delivering top transcription accuracy and supporting over 50 languages.
- Freemium

Picovoice is a platform for building voice-enabled applications with on-device voice AI and local LLMs, ensuring privacy, low latency, and efficiency.
- Freemium

Open Voice OS is an open-source voice AI platform that enables developers to create custom voice-controlled interfaces with NLP capabilities, focusing on privacy, security, and multi-platform compatibility.
- Free

WhisperUI is a web-based speech-to-text conversion tool that leverages OpenAI's Whisper ASR system to transcribe audio files into text and SRT formats with high accuracy across multiple languages.
- Freemium

Lemonfox.ai provides a cost-effective, high-quality speech-to-text API with features like speaker recognition and support for over 100 languages. It also offers LLM Chat and SDXL Image APIs.
- Paid
- From 5$

Duzo.ai is an AI-powered platform for content translation, voice cloning, lip-syncing, and subtitle generation, helping creators reach a global audience across 29+ languages.
- Freemium
- From 22$

LumenVox provides AI-powered speech recognition and voice authentication solutions for businesses, offering automatic speech recognition, call progress analysis, voice biometrics, and neural text-to-speech capabilities.
- Contact for Pricing

Murf AI is a versatile and powerful text to speech software ideal for education, marketing, corporate coaching, podcasting, animation, customer support, and more. With over 120+ voices in 20+ languages, users can create studio-quality voice overs in minutes for videos, presentations, podcasts, and other professional uses.
- Freemium
- From 19$
- API

Muchtodo is an AI-powered task management platform that converts voice input into projects, tasks, and notes across 57 languages, helping users save time and boost productivity.
- Free Trial
- From 3$

US-based AI startup ClearCypherAI excels in creating advanced multilingual, multimodal, real-time voice intelligence solutions, including text-to-audio, audio-to-text, and audio-to-audio conversions.
- Contact for Pricing
- API

A voice-powered AI to-do list manager that converts spoken tasks into organized lists in under 10 seconds, supporting 57 languages and offering intelligent task recommendations.
- Freemium
- From 3$

VoiceGPT is a specialized Android browser with voice capabilities that enhances accessibility to AI platforms like ChatGPT, Bing AI, and Bard through speech recognition and text-to-speech features, supporting 67+ languages.
- Freemium

Resemble AI offers a powerful voice AI generator that allows users to create realistic human-like voiceovers in seconds. It enables features like text to speech, speech to speech, neural audio editing, and language dubbing.
- Free Trial
- API

InteliConvo is an AI-powered speech analytics platform that analyzes customer conversations to improve sales, collections, customer experience, and compliance.
- Free Trial

Gladia is a comprehensive speech-to-text API platform offering real-time and asynchronous transcription services with multilingual support, featuring <300ms latency and advanced audio intelligence add-ons.
- Freemium

Fluent.ai provides unique speech-to-intent technology offering offline, noise-robust speech recognition that supports any language and accent.
- Contact for Pricing

Cockatoo is an AI-powered transcription tool that converts audio and video files to text with 99.8% accuracy in over 90 languages, processing 1 hour of content in just 2-3 minutes.
- Freemium
- From 9$

LinguaPeak offers AI-driven IELTS speaking practice with personalized feedback and real-time analysis to help users improve their scores. It provides mock exams, detailed analytics, and multi-accent training.
- Freemium
- From 20$
Featured Tools
Join Our Newsletter
Stay updated with the latest AI tools, news, and offers by subscribing to our weekly newsletter.
More Categories
-
👤
Personalization
-
🎞️
Video Editing
-
💹
Finance
-
🔊
Audio
-
💬
Chat
-
🎥
Video
-
🔓
OpenSource AI tools
-
🎵
Audio Generator
-
🖼️
Image Editing
Didn't find tool you were looking for?