Speak

The Speak node outputs text to the user via voice during a conversation. The node can deliver responses using text-to-speech or pre-recorded audio files.

When to Use This Node

  • Greet users at the start of a conversation and introduce the Virtual Agent's capabilities

  • Deliver information without expecting user input (e.g., opening hours, policy details)

  • Respond to classified intents where the conversation ends after the agent's response

  • Provide small talk responses to create a more natural conversational experience

Setting Up the Speak Node

1

Add the Speak node to your flow

Drag the Speak node from the node panel onto the canvas and connect it to the previous node (typically the Start node for greetings or after a Classification node for intent-based responses).

2

Enter your response text

Type the message the agent will speak to the user (e.g., "Welcome to customer support. How can I help you today?" or "Our office hours are Monday through Friday, 9 AM to 5 PM"). Keep the message clear and concise—users cannot re-read voice output.

3

Add response variations (optional)

Create multiple versions of the same message. The agent will randomly select one each time the node triggers, making conversations feel more natural and less robotic.

4

Choose output method (optional)

Toggle Use Recording to upload a human voice recording (.wav, .mp3, .ogg, max 4MB) or toggle Use Parameter to play a recording associated with a parameter value from an entity.

5

Connect to the next node

Link the Speak node output to the next step in your flow (e.g., a Listen node to capture user response) or leave unconnected to end the conversation after the agent speaks.

Node Configurations

  • Response Text: The message the agent will speak to the user. Enter text directly or use parameters to insert dynamic content.

  • Multiple Responses: Add additional response variations. The agent randomly selects one per interaction to create conversational variety.

  • Use Recording: Upload a human voice recording to replace text-to-speech output.

    • Supported file types: .wav, .mp3, .ogg

    • Maximum file size: 4MB

    • Recordings can be uploaded directly in the node or selected from the Recordings property

  • Use Parameter: Select a parameter associated with an entity that has recordings. The agent will play the recording linked to the parameter's value.

circle-check

Pro Tip

When to Use Recording vs Parameter

If you want to include a recording in a Speak node you have two options:

Use Recording - Use a human voice recording that was either uploaded to the Recordings property or right there on the spot to be given out as the agent's response.

Use Parameter - If you have collected a parameter that is associated with an entity with recordings, you can select this parameter here. The agent will then give out the parameter value with the recording you have previously uploaded to the entity.

Last updated

Was this helpful?