The Latest News On The API Economy
Searching: No Search Term , Filtered By Category: "Recognition", Category: "Voice"
Recent technology backed by machine learning is making it possible to recognize people and objects from images, videos, sounds, body parts, speech, phrases, characters, even typing. Developers can find over two-hundred APIs for Recognition on ProgrammableWeb. Here are some top choices.
Twelve APIs have been added to the ProgrammableWeb directory in categories including Shipping, Identity, and Blockchain. Featured today is an API that identifies car make and model from a photograph, and an API that determines risk based on buyer reputation. Here's a rundown of the latest additions.
Eighteen APIs have been added to the ProgrammableWeb directory in categories including Data, Blockchain, and Voice. Highlights include the Lyrebird API which allows developers to integrate a user's artificial voice into an application. Here's a rundown of the latest additions.
Don't let the name fool you: VoiceSell will not sell your voice data to third-party entities. In fact, quite the opposite. The company, which emerged from stealth today, offers e-commerce vendors a secure way to add voice support to their web sites. The idea is to improve accessibility and security.
Vonage is bolstering its cloud communication suite with the APIs it purchased through its Nexmo acquisition. A new feature for its Vonage Business Cloud platform, called Number Programmability, leverages the Nexmo API suite to empower Vonage numbers with a host of new features.
RingCentral further integrated its communications features within enterprise systems. RingCentral Embeddable uses RingCentral's core voice and SMS features and allows access to such features within third party apps. The new e-Discovery and message retention APIs allows better compliance reporting.
The Linux Foundation's Automotive Grade Linux project is an open source project aiming to build an industry standard for infotainment and telematics. The project has released its latest version, Unified Code Base 7.0. The new release moves the project closer to speech-enabling all vehicle apps.
This week Microsoft announced its updates for the month of February to its Cognitive Services Speech SDK, currently in version 1.3. Included in the update is the selection of the desired input microphone by using the AudioConfig class, beta support for Unity and new sample code.
Google has announced updates to its Cloud Speech to Text and Text to Speech APIs that include additional languages, voices, and more affordable pricing models. The goal of these updates is to help developers build better intelligent voice applications and make the products more widely available.
Neurotechnology introduced a lite version of its VeriLook SDK. The new SDK, Face Verification SDK, has a smaller library and is targeting mobile technology and apps that need enhanced security (e.g. banks, ecommerce, login providers, payment processors, etc.). SDK available across most platforms.
The Cognitive Computing section of our Most Interesting APIs in 2018 series includes APIs from categories such as AI, ML, Recognition, Predictions, Sentiment and more that aim to make computers think like computers. Dozens of APIs were added in these areas this past year, here are some highlights.
Qualcomm may have just made life easier for Bluetooth headset makers looking to integrate with Amazon's Alexa. Qualcomm's new Smart Headset Platform includes both a reference design and a development kit that allows Bluetooth headset makers to easily integrate their headsets with Alexa.
Five APIs as well as a notable SDK have been added to the ProgrammableWeb directory in categories including Auto, Voice, Business, Mapping, and Currency. Featured today is the MarketFactory Whisperer API for FX trade aggregation, and the Alexa Auto SDK. Here's a look at what's new.
Samsung is preparing to launch an API and SDK for Bixby, its new voice assistant that is integrated into its new Galaxy Home smart speaker. The Bixby API will allow the tech to be integrated into other apps, while the SDK will give developers the ability to make apps with Bixby.
Google has announced the general availability of Cloud Text-to-Speech and a beta release of Cloud Text-to-Speech Audio Profiles. The company also announced updates to Cloud Speech-to-Text which include the addition of multi-channel recognition, speaker diarization, and language auto-detect.
In its effort to "deliver a developer-centric platform", eBay today announced that it is making its image recognition technology available to developers through an Image Search API. The API gives developers the ability to implement image search functionality into their apps.
Amazon today announced the open source release of its Alexa Auto SDK, which lets automakers integrate its Alexa service into their vehicles. Voice-based services like Amazon Alexa are a perfect fit for automobiles, as consumers are in many cases required by law to use hands-free devices.
Twilio has announced the launch of a public beta for its Recording Composition API. The Recording Composition API allows developers to transcode, combine and mix Group Room video recordings, which are stored in separate audio and video tracks, programmatically, eliminating manual complexity.
Microsoft has unveiled a Speech Devices SDK that allows manufacturers of microphone-enabled devices to integrate their devices with the cloud-based Microsoft Speech service. The Speech Devices SDK consists of a pre-tuned library that is paired with a microphone-enabled device.
Microsoft announced that it is launching a unified API for all of its AI speech services at its annual Build conference this week in Seattle. The new API, which is part of the company's Azure platform, will combine into a single API four APIs that are currently made available as separate services.