$dialer.getTtsConfig
This method returns the TTS provider settings of the phone channel used for the current call.
Syntax
The method is called without arguments:
$dialer.getTtsConfig();
The method returns a settings object.
The properties of this object are different based on the provider.
If the provider supports no additional settings, the method returns an empty object {}
.
$dialer.setTtsConfig
method.Provider settings
Google
-
lang
- Synthesized speech language.
-
voice
- Speech synthesis voice.
tipSee the complete list of languages and voices for synthesis in the Google documentation. -
- Voice pitch.
Takes an integer or float value from
-20
to20
, where-20
means a 20-halftone decrease from the original tone, and20
means the same increase.
- Voice pitch.
Takes an integer or float value from
-
- Synthesized speech rate.
Takes an integer or float value from
0.25
to4
, where1
is the normal voice speed.
- Synthesized speech rate.
Takes an integer or float value from
-
- Volume increase in dB relative to the normal voice volume.
Takes an integer or float value from
-96
to16
. When set to6
, the volume is approximately twice as high as normal.
- Volume increase in dB relative to the normal voice volume.
Takes an integer or float value from
Yandex
The list of available settings depends on the TTS version.
You can get the current version during a call via $dialer.getTtsConfig
.
If the useV3
parameter is set to true
, then the third version of the Yandex SpeechKit protocol is used.
Select a version in the connection settings.
Do not change the value of the useV3
parameter via $dialer.setTtsConfig
as this can cause TTS errors.
- v1
- v3
lang
- Speech synthesis language.
voice
- Speech synthesis voice.
speed
- Synthesized speech rate.
Takes an integer or float value from
0.1
to3
, where1
is the normal voice speed.
- Synthesized speech rate.
Takes an integer or float value from
emotion
-
Speaker role, which is a characteristic of the voice. For example, the speaker can sound friendlier or whisper.
cautionIf you use a role that is not supported for the current voice, a TTS error will occur.
-
-
lang
- Speech synthesis language.
-
voice
- Speech synthesis voice.
-
speed
- Synthesized speech rate.
Takes an integer or float value from
0.1
to3
, where1
is the normal voice speed.
- Synthesized speech rate.
Takes an integer or float value from
-
volume
- Synthesized speech loudness relative to full scale (LUFS).
Takes an integer or float value from
-145
to0
. The recommended value range is from−20
to−16
LUFS.
- Synthesized speech loudness relative to full scale (LUFS).
Takes an integer or float value from
-
role
-
A characteristic of the voice. For example, the speaker can sound friendlier or whisper.
cautionIf you use a role that is not supported for the current voice, a TTS error will occur.
-
-
useVariables
- If the setting is enabled, speech synthesis is done via Yandex SpeechKit Brand Voice Adaptive, which supports variables.
See the complete list of languages, voices, and roles in the Yandex documentation.
Azure
-
language
- Synthesized speech language.
-
voiceName
- Speech synthesis voice.
tipSee the complete list of languages and voices for synthesis in the Azure documentation. -
sampleRate
- Sample rate.
Aimyvoice
voice
- Speech synthesis voice.
3iTech
model
- Speech synthesis language model.
sampleRate
- Sample rate.
speed
- Synthesized speech rate.
tone
- Voice tone.
ElevenLabs
-
voiceId
-
Voice ID. You can copy it in the ElevenLabs interface.
noteThe ElevenLabs website is not available for Russian IP addresses.
-
-
modelId
- Model for speech synthesis.
-
stability
- Controls voice stability and synthesis variation. At low values the voice sounds emotional, at high values it sounds monotonous. Accepts values from
0
to1
.
- Controls voice stability and synthesis variation. At low values the voice sounds emotional, at high values it sounds monotonous. Accepts values from
-
similarityBoost
- Controls how closely the AI should imitate the original voice. Accepts values from
0
to1
.
- Controls how closely the AI should imitate the original voice. Accepts values from
-
style
- Amplifies the style of the original voice. Accepts values from
0
to1
. Values above0
require more resources and might increase the latency.
- Amplifies the style of the original voice. Accepts values from