Google has announced the general availability of Cloud Text-to-Speech and a beta release of Cloud Text-to-Speech Audio Profiles. The company also announced updates to Cloud Speech-to-Text which include the addition of multi-channel recognition, speaker diarization, and language auto-detect.
Deep Learning has applications in many fields. In this example, it is put to use across three APIs to predict what music genre an album is based on the cover artwork.
Three APIs have been added to the ProgrammableWeb directory today in Cameras, Sports, and Email categories. Also added were several Kairos SDKs. Here's a summary of what's new.