The Latest News On The API Economy
Searching: No Search Term , Filtered By Category: "Text-to-Speech", Category: "Audio", Category: "Media", Category: "Voice"
According to a report published by TechCrunch, Snapchat is secretly planning to launch a developer platform, its first ever. Dubbed Snapkit, prototypes of the platform suggest it will give developers access to Snap's camera software so that they can integrate it into their applications.
Twilio has announced the launch of a public beta for its Recording Composition API. The Recording Composition API allows developers to transcode, combine and mix Group Room video recordings, which are stored in separate audio and video tracks, programmatically, eliminating manual complexity.
Microsoft has unveiled a Speech Devices SDK that allows manufacturers of microphone-enabled devices to integrate their devices with the cloud-based Microsoft Speech service. The Speech Devices SDK consists of a pre-tuned library that is paired with a microphone-enabled device.
Microsoft announced that it is launching a unified API for all of its AI speech services at its annual Build conference this week in Seattle. The new API, which is part of the company's Azure platform, will combine into a single API four APIs that are currently made available as separate services.
Google has announced a large overhaul of its Cloud Speech-to-Text product (formerly the Google Cloud Speech API). Google Cloud Speech-to-Text now supports a selection of pre-built models, automatic punctuation, recognition metadata, and standard service level agreement (SLA).
Amazon has launched a beta version of a Gadgets Skill API that allows developers to build game skills for Echo Buttons that work in conjunction with Amazon's Echo, Echo Dot, Echo Show, Echo Plus and Echo Spot devices. Using the new API, developers can create new kinds of gaming experiences.
Instagram has deprecated much of its public API effective immediately. In a changelog published today, the popular Facebook-owned social media sharing platform announced that the endpoints for Follows and Relationships, Public Content Commenting, Likes and User Search have been removed.
Amazon has added new functionality to its Video Skill API, which allows the company's voice-driven virtual assistant, Alexa, to interface with video content and services. The new functionality consists of recording, launcher and state reporting capabilities that will work with numerous services.
VoiceBase, a provider of AI-powered speech analytics, yesterday announced an update to its Speech Analytics API. The updated API allows organizations to evaluate and categorize voice calls, as well as to build scorecards, reports, dashboard and key performance indicators (KPIs).
Ozonetel has announced a speech API for its KooKoo telephony platform. Developers can use the tag to apply natural language processing to their IVR implementations. To demonstrate the offering, the company has published a simple, but powerful interactive demo to showcase.
Amazon today announced the beta launch of a new version of the Alexa Skills Kit (ASK) developer console that aims to streamline the experience of creating, managing and publishing Alexa skills. The new version of the developer console breaks the Alexa skill development lifecycle into four phases.
Last week Amazon announced that it is adding cooking capabilities to the Smart Home Skill API, which enables Alexa to control and check the status of cloud-connected devices. Initially, the API will be extended to allow Alexa to control microwave ovens but support for other devices is coming.
In this, developer-blogger Alex Kras shows us how to overcome the 60 second audio file limitation of the free tier of Google's Cloud Speech API by taking a longer audio file, breaking it up into short chunks, and then cycling through those chunks to make a complete transcription.
Eleven APIs have been added to the ProgrammableWeb directory in categories including Internet of Things, Location, and Blockchain. Featured is the Sony Audio Control API, which developers can use to create applications that can control the audio of certain Sony devices. Here's a look at what is new.
Sony recently launched its first Audio Control API. That API allows third party apps to control certain audio features of compatible Sony devices. While this is the first Audio API play from Sony, the same strategy was used in a camera scenario through the Sony Camera Remote API.
EBU opened free API access to its database of 240+ quality control tests. Media industry participants have long requested API access to such tests and the industry group answered with the help of members. The data has been freely available via EBU.IO/QC; however, API access streamlines the process.
In an effort to woo developers to build features for Google Assistant, the search giant's virtual personal assistant offering, Google this week announced a number of new functionality that is available or will be coming to its Google Assistant platform. This includes a new push notification API.
Amazon is testing a self-serve advertising API that will allow brands to automate and manage their Amazon advertising campaigns. According to Digiday, the API will offer brands a "systematic approach to adjusting and reporting campaigns and offer capabilities to automate the ways campaigns work."
WebRTC 1.0 has become a browser standard for realtime communications. Despite its widespread use, the API remained a W3C Candidate Recommendation. The API is now stable and considered feature complete. With its new designation, W3C calls for wide implementation and is working on future versions.
Twilio has introduced two new APIs for enabling multi-user augmented reality (AR) applications; the DataTrack API and the Media Sync API. The APIs allow developers to build apps that provide a more immersive environment for users and an AR experience that immediately reacts to changes.