July 13, 2018
Related Articles (135)
Google has announced updates to the Cloud Speech API which include the addition of word-level timestamps which means that timestamp information is now available for each word in the transcript. Also, the API now supports long-form audio files up to three hours long and 30 additional languages.
ProgrammableWeb first covered Animetrics last year as its Facial Recognition API was picking up speed in the government and law enforcement space. Expanding on its initial success, Animetrics has now released a commercial version of the API: FaceR API. This is Animetrics first taste of the commercial space.
Google has announced the general availability of Cloud Text-to-Speech and a beta release of Cloud Text-to-Speech Audio Profiles. The company also announced updates to Cloud Speech-to-Text which include the addition of multi-channel recognition, speaker diarization, and language auto-detect.