The service provides conversion of recorded speech input to text output. It also provides identification of language in a recorded speech sample, and it can synchronize speech with corresponding text. It handles content in many European languages as well as Mandarin and Arabic. Geographic redundancy helps to provide effective service across countries and regions.
API methods support either real-time or batch processing of automated transcription jobs, from submission of audio sample through delivery of text. Methods also support basic detection of language spoken in audio samples and synchronization of audio with corresponding text transcript.
Of the many APIs we published this week, eleven were highlighted on the blog by our team of writers. In this post, we’ll shine a spotlight on those eleven, which include the LairOut API. LairOut, in a nutshell, is a web-based event location directory that provides users with information on events happening around their location. All users have to do is enter in a location and date and the site will generate events taking place around that location at or near that date. The API simply makes this functionality available to developers via API calls.
If it's speech-to-text services you're after, Vocapia Research is in the game, offering a variety of speech technology services and working with their clients to adapt, fine tune or create products that are relevant for specific applications. One of Vocapia's products is the VoxSigma speech-to-text software suite that is suitable for many languages and a variety of audio data types. The VoxSigma API is available as a Web service via a REST API over HTTPS.
Computer Vision APIs are a bundle of cognitive services that will allow you to classify images with labels, detect scene of subjects and objects, analyze and describe an image with captions, perform...