Speak
The Speak node outputs text to the user via voice during a conversation. The node can deliver responses using text-to-speech or pre-recorded audio files.

When to Use This Node
Greet users at the start of a conversation and introduce the Virtual Agent's capabilities
Deliver information without expecting user input (e.g., opening hours, policy details)
Respond to classified intents where the conversation ends after the agent's response
Provide small talk responses to create a more natural conversational experience
Setting Up the Speak Node
Node Configurations
Response Text: The message the agent will speak to the user. Enter text directly or use parameters to insert dynamic content.
Multiple Responses: Add additional response variations. The agent randomly selects one per interaction to create conversational variety.
Use Recording: Upload a human voice recording to replace text-to-speech output.
Supported file types: .wav, .mp3, .ogg
Maximum file size: 4MB
Recordings can be uploaded directly in the node or selected from the Recordings property
Use Parameter: Select a parameter associated with an entity that has recordings. The agent will play the recording linked to the parameter's value.
Pro Tip
Use clear, concise language in your prompts. Users need to understand the agent's message quickly, especially in voice conversations where they cannot re-read text.
When to Use Recording vs Parameter
If you want to include a recording in a Speak node you have two options:
Use Recording - Use a human voice recording that was either uploaded to the Recordings property or right there on the spot to be given out as the agent's response.
Use Parameter - If you have collected a parameter that is associated with an entity with recordings, you can select this parameter here. The agent will then give out the parameter value with the recording you have previously uploaded to the entity.

Last updated
Was this helpful?