Google Text-To-Speech and Speech-To-Text
The Google TTS/STT integration allows you to leverage Google's advanced speech recognition and text-to-speech services within NethVoice. These services can be used to create dynamic voice announcements and enable custom speech recognition scenarios.
Setup and Authentication
Obtaining Google API Credentials
To use Google's TTS and STT services, you need to obtain authentication credentials from the Google Developer Platform:
- Visit the Google Developer Console
- Create or select an existing project
- Enable the required APIs:
- Google Cloud Text-to-Speech API
- Google Cloud Speech-to-Text API
- Create a service account and generate a JSON key file
- Download the JSON credentials file to your local machine
For detailed instructions on obtaining your credentials, refer to the links inside the page.
Uploading Credentials to NethVoice
To enable Google services in NethVoice:
- Navigate to
Applications > Cloud services - Click the Upload button
- Select the JSON credentials file downloaded from the Google Developer Platform
- The credentials will be stored securely in NethVoice
Once credentials are uploaded, both TTS and STT features become available.
Google Text-To-Speech (TTS)
Overview
Google TTS allows you to generate high-quality voice recordings from text. This is particularly useful for creating professional voice announcements, IVR prompts, and call queue recordings without requiring manual voice recording.
Using TTS in VisualPlan
You can use Google TTS anywhere in NethVoice where you can add a recording:
- Announcements: Create voice announcements for calls
- IVR (Interactive Voice Response): Build voice menus with dynamic prompts
- CQR (Call Queue Recordings): Generate professional queue messages
Creating a TTS Recording
- Navigate to the recording addition dialog in your desired feature (Announcements, IVR, CQR)
- If credentials have not been uploaded yet, you will have the option to upload your Google API key in the dialog
- If credentials are already configured, two dropdown menus will appear:
- Language: Select the language for the voice
- Voice: Select the specific voice variant
- Enter your message text in the provided text field
- Click the Play button (speaker icon) to preview the generated voice
- After confirming the recording sounds as desired:
- Enter a Name for the recording
- Enter a Description
- Click Save to store it as a system recording
The recording can now be used throughout NethVoice wherever recordings are supported.
Google Speech-To-Text (STT)
Overview
Google STT provides advanced speech recognition capabilities for custom NethVoice implementations and integrations.
Custom Implementation
STT is primarily used through custom API implementations and integrations. For specific use cases and integration guidance, please contact Nethesis support by opening a ticket through the helpdesk.
Support Resources:
- Open a support ticket for custom STT implementation guidance
- Discuss your specific use case and requirements with the support team
- Receive expert recommendations for your particular scenario
Cost Considerations
Both Google TTS and STT services are subject to Google Cloud pricing. Usage costs depend on:
- Number of characters processed (TTS)
- Duration of audio processed (STT)
- API call frequency
Refer to the Google Cloud Pricing page for current rates and usage limits.