- SDKs (16)
- Articles (6)
- How To (2)
- Sample Source Code (8)
- Libraries (0)
- Developers (0)
- Followers (32)
- Changelog (32)
Google has announced updates to its Cloud Speech to Text and Text to Speech APIs that include additional languages, voices, and more affordable pricing models. The goal of these updates is to help developers build better intelligent voice applications and make the products more widely available.
Google has announced the general availability of Cloud Text-to-Speech and a beta release of Cloud Text-to-Speech Audio Profiles. The company also announced updates to Cloud Speech-to-Text which include the addition of multi-channel recognition, speaker diarization, and language auto-detect.
In this, developer-blogger Alex Kras shows us how to overcome the 60 second audio file limitation of the free tier of Google's Cloud Speech API by taking a longer audio file, breaking it up into short chunks, and then cycling through those chunks to make a complete transcription.
Google has announced updates to the Cloud Speech API which include the addition of word-level timestamps which means that timestamp information is now available for each word in the transcript. Also, the API now supports long-form audio files up to three hours long and 30 additional languages.
The last couple of years have seen a large number of organizations and developers rush towards getting familiar with Machine Learning fundamentals and coming to grips with what it takes to integrate it into their applications. While you can definitely build out your own Machine Learning platform,
Nineteen APIs have been added to the ProgrammableWeb directory in categories such as Recognition, Telephony, and Payments. Highlighted today is the Google Cloud Speech API which can return text results from speech and recognize languages, among other functions. Here's a look at what's new.