OpenAI launches new voice intelligence features in its API

Must Read
bicycledays
bicycledayshttp://trendster.net
Please note: Most, if not all, of the articles published at this website were completed by Chat GPT (chat.openai.com) and/or copied and possibly remixed from other websites or Feedzy or WPeMatico or RSS Aggregrator or WP RSS Aggregrator. No copyright infringement is intended. If there are any copyright issues, please contact: bicycledays@yahoo.com.

OpenAI stated Thursday that its API will now embrace quite a few new voice intelligence options designed to assist builders create apps that may discuss, transcribe, and translate conversations with customers.

The corporate’s new GPT‑Realtime‑2 is one other voice mannequin, constructed to create a practical vocal simulation that may converse with customers. Nonetheless, not like its predecessor (GPT-Realtime-1.5) this one is constructed with GPT‑5‑class reasoning that OpenAI says was created to cope with extra difficult requests from customers.

The corporate can also be launching GPT‑Realtime‑Translate, which, simply because it sounds, is designed to offer real-time translation companies that “preserve tempo” with the person, conversationally. The characteristic consists of greater than 70 enter languages (that’s, the languages that it might comprehend) and 13 output languages (the languages it relays to the speaker).

Lastly, the corporate has additionally launched a brand new transcription functionality, GPT-Realtime-Whisper, which provides customers stay speech-to-text capabilities which are captured as interactions happen.

“Collectively, the fashions we’re launching transfer real-time audio from easy call-and-response towards voice interfaces that may truly do work: hear, cause, translate, transcribe, and take motion as a dialog unfolds,” the corporate stated.

Who will these updates be good for? Firms that wish to broaden customer support capabilities are an apparent goal. Nonetheless, OpenAI additionally notes that its new options will help with a big selection of areas, together with schooling, media, occasions, and creator platforms, amongst others.

As helpful as these instruments appear from an enterprise perspective, it additionally appears believable that they might be misused. The corporate stated it has constructed guardrails to cease its new options from being abused to create spam, fraud, or different types of on-line abuse. Sure triggers have been embedded within the system in order that “conversations could be halted if they’re detected as violating our dangerous content material tips,” OpenAI stated.

Techcrunch occasion

San Francisco, CA
|
October 13-15, 2026

All the new voice fashions are included in OpenAI’s Realtime API. Translate and Whisper are billed by the minute, whereas GPT-Realtime-2 is billed by token consumption.

Whenever you buy by way of hyperlinks in our articles, we could earn a small fee. This doesn’t have an effect on our editorial independence.

Latest Articles

After using Lenovo’s $2,600 Yoga, I’m taking premium Windows laptops seriously...

Comply with ZDNET: Add us as a most well-liked supply on Google.2026 has been a powerful yr for content-creation laptops working...

More Articles Like This