Google Text-To-Speech and Speech-To-Text

The Google TTS/STT integration allows you to leverage Google's advanced speech recognition and text-to-speech services within NethVoice. These services can be used to create dynamic voice announcements and enable custom speech recognition scenarios.

Setup and Authentication

Obtaining Google API Credentials

To use Google's TTS and STT services, you need to obtain authentication credentials from the Google Developer Platform:

Visit the Google Developer Console
Create or select an existing project
Enable the required APIs:
- Google Cloud Text-to-Speech API
- Google Cloud Speech-to-Text API
Create a service account and generate a JSON key file
Download the JSON credentials file to your local machine

For detailed instructions on obtaining your credentials, refer to the links inside the page.

Uploading Credentials to NethVoice

To enable Google services in NethVoice:

Navigate to Applications > Cloud services
Click the Upload button
Select the JSON credentials file downloaded from the Google Developer Platform
The credentials will be stored securely in NethVoice

Once credentials are uploaded, both TTS and STT features become available.

Google Text-To-Speech (TTS)

Overview

Google TTS allows you to generate high-quality voice recordings from text. This is particularly useful for creating professional voice announcements, IVR prompts, and call queue recordings without requiring manual voice recording.

Using TTS in VisualPlan

You can use Google TTS anywhere in NethVoice where you can add a recording:

Announcements: Create voice announcements for calls
IVR (Interactive Voice Response): Build voice menus with dynamic prompts
CQR (Call Queue Recordings): Generate professional queue messages

Creating a TTS Recording

Navigate to the recording addition dialog in your desired feature (Announcements, IVR, CQR)
If credentials have not been uploaded yet, you will have the option to upload your Google API key in the dialog
If credentials are already configured, two dropdown menus will appear:
- Language: Select the language for the voice
- Voice: Select the specific voice variant
Enter your message text in the provided text field
Click the Play button (speaker icon) to preview the generated voice
After confirming the recording sounds as desired:
- Enter a Name for the recording
- Enter a Description
- Click Save to store it as a system recording

The recording can now be used throughout NethVoice wherever recordings are supported.

Google Speech-To-Text (STT)

Overview

Google STT provides advanced speech recognition capabilities for custom NethVoice implementations and integrations.

Custom Implementation

STT is primarily used through custom API implementations and integrations. For specific use cases and integration guidance, please contact Nethesis support by opening a ticket through the helpdesk.

Support Resources:

Open a support ticket for custom STT implementation guidance
Discuss your specific use case and requirements with the support team
Receive expert recommendations for your particular scenario

Cost Considerations

Both Google TTS and STT services are subject to Google Cloud pricing. Usage costs depend on:

Number of characters processed (TTS)
Duration of audio processed (STT)
API call frequency

Refer to the Google Cloud Pricing page for current rates and usage limits.

NethVoice 8.0

Setup and Authentication​

Obtaining Google API Credentials​

Uploading Credentials to NethVoice​

Google Text-To-Speech (TTS)​

Overview​

Using TTS in VisualPlan​

Creating a TTS Recording​

Google Speech-To-Text (STT)​

Overview​

Custom Implementation​

Cost Considerations​

Setup and Authentication

Obtaining Google API Credentials

Uploading Credentials to NethVoice

Google Text-To-Speech (TTS)

Overview

Using TTS in VisualPlan

Creating a TTS Recording

Google Speech-To-Text (STT)

Overview

Custom Implementation

Cost Considerations