This SDK provides an interface to the DOCOMO Speech Recognition API, enabling speech recognition to be added to Android applications. This particular Android SDK contains technologies powered by NTT IT language processing, and provides support up to Android version 4.2. The SDK offers a recording time of 15 seconds. The API accepts spoken voice within an audio file, and transcribes the language detected into words.
Google has announced the general availability of Cloud Text-to-Speech and a beta release of Cloud Text-to-Speech Audio Profiles. The company also announced updates to Cloud Speech-to-Text which include the addition of multi-channel recognition, speaker diarization, and language auto-detect.
Developers can get quick and simple access to sophisticated image recognition software thanks to Google’s Cloud Vision API. The tool leverages machine learning to identify the contents of images for classification across thousands of categories, and here it is used to determine traffic volume.
AIH Technology has announced a new inclusive Facial Recognition as a Service API. The company’s facial recognition algorithm is designed to be ethnicity neutral which many facial recognition providers have failed to accomplish. The API is currently available through the Microsoft Azure Marketplace.