The Latest News On The API Economy
Searching: No Search Term , Filtered By Category: "Text-to-Speech", Category: "Audio"
Google has announced updates to the Cloud Text-to-Speech API that will provide developers with 12 new languages/variants as well as 76 new voices. The API now includes 95 WaveNet voices, which the company claims are far more natural-sounding.
Recent technology backed by machine learning is making it possible to recognize people and objects from images, videos, sounds, body parts, speech, phrases, characters, even typing. Developers can find over two-hundred APIs for Recognition on ProgrammableWeb. Here are some top choices.
Google has announced updates to its Cloud Speech to Text and Text to Speech APIs that include additional languages, voices, and more affordable pricing models. The goal of these updates is to help developers build better intelligent voice applications and make the products more widely available.
For this installment of the Most Interesting API series, we focus on APIs added to our directory in categories including Sports, Music, Games, News Services, Video, Gambling, eSports, Emoji, Comics, Animation, Memes & Gifs, Photography, Books, Poetry, and Podcasts. Have a look at what our list below
Ten more APIs have been added to the ProgrammableWeb directory in categories including eCommerce, Auto, and Marketing. Highlights include the Sonos API for home sound automation and the Arcentry API for architectural diagram creation. Here's a look at what is new for developers.
Tunity today announced the release of the Tunity SDK for Audio which allows for the implementation of white-labeled versions of its TV audio streaming technology. Tunity's streaming tech enables a business' customers the ability to hear live audio from muted televisions on their own devices.
Google has announced a large overhaul of its Cloud Speech-to-Text product (formerly the Google Cloud Speech API). Google Cloud Speech-to-Text now supports a selection of pre-built models, automatic punctuation, recognition metadata, and standard service level agreement (SLA).
Eleven APIs have been added to the ProgrammableWeb directory in categories including Internet of Things, Location, and Blockchain. Featured is the Sony Audio Control API, which developers can use to create applications that can control the audio of certain Sony devices. Here's a look at what is new.
Sony recently launched its first Audio Control API. That API allows third party apps to control certain audio features of compatible Sony devices. While this is the first Audio API play from Sony, the same strategy was used in a camera scenario through the Sony Camera Remote API.
WebRTC 1.0 has become a browser standard for realtime communications. Despite its widespread use, the API remained a W3C Candidate Recommendation. The API is now stable and considered feature complete. With its new designation, W3C calls for wide implementation and is working on future versions.
Twilio has introduced two new APIs for enabling multi-user augmented reality (AR) applications; the DataTrack API and the Media Sync API. The APIs allow developers to build apps that provide a more immersive environment for users and an AR experience that immediately reacts to changes.
As the week kicks into high gear, here is another edition of ProgrammableWeb's "In other API Economy News". Today we look at the Audioburst API, billed as the Google for audio; Status.im's API for creating decentralized chatbots, Movesense's announcement of their motion sensor and SDK and more.
Nvidia has released Audio and 360 Video SDKs to help simplify VR development. The NVIDIA VRWorks Audio SDK delivers realistic audio in VR environments. The NVIDIA VRWorks 360 Video SDK allows developers to capture and stitch together video feeds into a single 360-degree panoramic video.
Twenty-seven APIs have been added to the ProgrammableWeb directory in categories including Compliance, Air Travel, and Music. Highlights include several APIs for Cisco's IoT services and several for Immaga Technologies image management services. Here's a rundown what is new.
Microsoft has joined the likes of Google and Mozilla with its release of the WebRTC 1.0 API. The release enables real-time audio, video, chat, and file sharing across platforms and browsers. With the first release, Microsoft is focused on delivering RTC functionality in existing, legacy websites.
Since Microsoft launched its Bot Framework earlier this year, it has consistently added features to empower developers to create, what Microsoft calls, "the next great conversational experiences." The latest API additions include the Computer Vision, Bing Speech, and Bing Image Search APIs.
Microsoft announced that its Edge browser will support Speech Synthesis APIs with the next Windows 10 update. The support will allow websites to convert text to speech in a customized manner. Further, Microsoft will support Speech Synthesis Markup Language (SSML) to enhance developer control.
According to Gartner, there will be nearly 21 billion devices connected to the Internet by the year 2020. That’s a lot of devices that need to be connected in useful ways. This article shows you the steps to make a Raspberry Pi speak from anywhere in the world using Node.js and PubNub.
A year after YouTube introduced 360-degree video, it has added a live-streaming option. YouTube enhances immersive video experiences by supplementing 360-degree live streaming with spatial listening (a feature that mimics in-person listening by accounting for depth, intensity, and distance).
Building on Fortytwo’s expertise in the A2P messaging arena, Fortytwo’s Voice API enables clients to deliver messages to a customers’ mobile phone or landline with a pre-recorded audio file or by using Text-To-Speech which converts messages into a male or female voice in up to fifteen languages.