Generate Voice
Generate realistic speech from text
Summary
The Voice Node converts text into natural-sounding speech using AI text-to-speech models. Write or connect a prompt, select a voice, and the node generates an audio output that sounds like a real person speaking. Use it for voiceovers, narration, dialogue, or any workflow that needs spoken audio from text.
How to Use
Add the Node:
Click the Add (+) button and select Voice from the Audio node category.
Write Your Text:
Type what you want spoken into the prompt field, or connect a Prompt or AI Copilot node.
Select a Voice:
Choose a voice from the Voice dropdown (e.g., Roger). Each voice has a distinct tone, pitch, and character.
Run:
Click Run, and the AI generates an audio file of the selected voice speaking your text.
Choosing the Right Settings
Voice
Dropdown (e.g., Roger)
Selects the voice character used for speech generation. Different voices vary in tone, gender, accent, and style.
Prompt
Text Input
The text content that will be converted into speech.
Stability
Slider (0–100%)
Controls how consistent the voice sounds across the output.
Similarity Boost
Slider (0–100%)
Controls how closely the output matches the selected voice.
Speed
Slider
Adjusts the speaking pace of the generated audio.
Timestamps
Checkbox
When enabled, returns word-level or sentence-level timestamps alongside the audio output.
Sample Use Cases
Voiceovers for AI-Generated Videos
Generate a voiceover and connect it to a Combine Audio & Video node to add narration to any video in your workflow.
Multilingual Audio Content
Write the same script in multiple languages and generate a voice for each — perfect for localizing video content without hiring voice actors.
Podcast and Audio Previews
Quickly generate audio previews of scripts, blog posts, or ad copy to hear how they sound before recording with a real voice.
Audio Models
Visit Audio Models to explore all available models and find the one that fits your audio needs.
Last updated

