About us
English
Turn your text into a natural voice
Transform your text into an engaging podcast recording
Turn your text into a compelling story
Voice
Xavier
Tone
💼
By using the product, you agree to our Terms of Service and have read our Privacy Policy.
Trusted by Millions Worldwide
4.4
2,100+ reviews on G2
4.4
8,200+ reviews on Capterra
4.4
73,000+ reviews on App Store
248M
Registered Users
5B
Notes Created
2M
Notes Created Daily
Frequently Asked Questions
The tool converts text to natural-sounding audio. Simply type or upload your notes and choose parameters to generate audio.
Yes, the tool works in any modern web browser like Chrome, Firefox, Safari, and Edge. No installation is required.
Yes, free users can generate a 10-second audio preview and replay it within the browser. Full audio features require logging in.
Once logged in, you can download the generated audio file in .m4a format directly from the browser to your device.
Select the Calm tone in Standard mode. Voices like Ember and Zoe pair well to produce gentle, soothing audio.
In Standard mode, choose from 10 voices, each with unique qualities like Xavier (balanced) or Hudson (melodic).
Use the free web app to convert short text into a 10-second audio preview. Log in for full access and downloads.
While specific extensions aren't provided, the tool functions effectively as a web app accessible in any browser.
No, an internet connection is required for generation since processing happens online. Downloaded files, however, can be played offline.
Yes, each uploaded file can be up to 100 MB. This applies to audio, video, and text files.
Generated audio can last up to 60 minutes. Ensure the total content, typed or transcribed, fits within this duration.
In Standard mode, select from preset tones or describe a custom tone for varied audio delivery that suits your notes.
You can upload text files (.txt, .md, .csv), images (OCR extraction), and audio/video files for transcription.
No, editing isn't available in the tool. You can download and edit with external software if needed.
Audio generation generally takes up to one minute, depending on the text size and processing requirements.