Comply with ZDNET: Add us as a most popular supply on Google.
ZDNET’s key takeaways
- Speaking to ChatGPT feels extra collaborative than typing.
- It shines for brainstorming, prep, and translation.
- Utilization limits can interrupt productiveness mid-session.
Over the previous yr or so, I’ve had the chance to speak with OpenAI. More often than not, these conversations have been about programming or coding-related capabilities, however just lately, I had a dialogue about Voice Mode.
OpenAI wished to get throughout a lot of examples you are able to do with it. The final time I took a take a look at Voice Mode, I used to be assessing its technical information. The AI fabricated solutions so as to present speedy responses which, at the moment, nerfed its usefulness to me.
After that fail, I type of ignored the potential. However the strategies that OpenAI made about the way to get essentially the most out of Voice Mode play to its strengths and made me need to test it out once more. So, I made a decision to have a dialog in Voice Mode about Voice Mode.
That dialog proved to be very productive, and now I will spotlight seven issues you are able to do with Voice Mode that’ll improve your productiveness. Learn on to the tip as a result of there is a bonus, an eighth concept that itself is definitely worth the worth of entry.
(Disclosure: Ziff Davis, ZDNET’s dad or mum firm, filed an April 2025 lawsuit towards OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI programs.)
Utilizing Voice Mode
Voice Mode runs on cellular gadgets, in addition to in your browser. On cellular, there are two ChatGPT widgets obtainable for the lock display. One widget opens the app, and one launches ChatGPT Voice. If you happen to select the Voice widget, you will have speedy one-tap entry to Voice Mode proper out of your lock display.
I discovered the in-the-browser desktop functionality actually fascinating as a result of it really works when the tab is backgrounded. Simply faucet the small icon on the fitting of the chat bar.
So, for instance, I used to be in a position to have an e-mail message open whereas I used to be within the Gmail tab of Chrome whereas speaking with ChatGPT Voice Mode. I discovered that fairly useful, particularly as I used to be going by means of the e-mail thread I had with OpenAI on this subject.
The whole lot that you just talk about by voice turns into a ChatGPT transcript. That lets you return and minimize and paste from the chat transcript into different paperwork or your e-mail.
I did word that after I used Voice Mode for a productiveness dialogue, the resultant transcript was extra chatty. You will must perform a little bit extra digging in your transcript to search out the jewels that you just need to paste into different issues, but it surely does work.
You can too use Voice Mode for pure dictation by telling it “begin dictation” and “cease dictation.” That enables Voice Mode to seize your actual phrases after which save them to the chat.
You’ll be able to select from quite a lot of voices.
I selected Vale, which has a British accent and a Mary Poppins really feel to it. If you need one other British accent, there’s additionally Arbor, which has a little bit bit extra of an Clever Dodger really feel to it. There are additionally extra cheerful or extra business-like American-accent voices which you can select.
Voice Mode makes use of that ought to get you began
The next concepts have been supplied by OpenAI, however I added an extra concept as a bonus. Remember that there are much more methods you need to use Voice Mode as a productiveness pal, however this listing ought to get you considering.
1. On the spot translation whereas touring
One of many conceits old fashioned science fiction like Star Trek, Physician Who, and Farscape used was that everybody spoke English, even aliens from distant planets. The way in which protagonists communicated with aliens was some type of common translator, a tool that transformed spoken phrase from one language to a different.
Now, nonetheless, your telephone generally is a common translator. Simply open up the ChatGPT app and let it translate for you. You’ll be able to inform it to translate into the language of alternative. Then, while you say one thing in English, it can say one thing meaning the identical factor within the goal language. What makes this such a sensible device is that it may possibly translate what the opposite particular person is saying in that language again to English.
OpenAI would not immediately listing the variety of languages, nor does it specify the languages it may possibly translate. Nevertheless, as this Instagram clip exhibits, it is definitely attainable to make use of ChatGPT to assist talk whereas touring.
As you may think, I discovered some counterproductive methods to make this function entertain me. Translating into conversational Klingon appeared like desk stakes, and it achieved it. The AI failed translating into Shyriiwook, the language of Wookiees. Apparently, it is too guttural for the AI to handle. Nevertheless, I used to be in a position to translate spoken English into Huttese, the language spoken by Hutts like Jabba and broadly understood within the Star Wars underworld.
2. Discuss it out to prepare your ideas
I typically discover that utilizing one other particular person as a sounding board can assist me manage my ideas. Because it seems, you can even use ChatGPT’s Voice Mode to do the identical factor.
When in Voice Mode, you may give it prompts like “I will ramble for 60 seconds about what’s stressing me out. Then summarize what you heard and provides me 3 subsequent steps,” or “I’ve a busy day, assist me construct a to-do listing from every part I am about to inform you.”
I used Voice Mode to speak by means of among the concepts on this article. Though I used to be ultimately minimize off in my dialog (extra on that beneath), I requested it to undergo what we might talked about and extract questions I would requested it as a fast bullet listing. It did, and I used these inquiries to construct up my fast lightning spherical Q&A beneath.
For this functionality, simply inform the AI to hear, then speak to it. If you’re performed, and even a part of the best way by means of, ask it to prepare, make clear, or encourage you for added ideas.
3. An on-the-go thought associate
The earlier approach talks about utilizing ChatGPT’s Voice Mode to construction info, mainly to show concepts into an inventory, a plan, or concise notes.
However you possibly can take ChatGPT in the wrong way that can assist you brainstorm concepts, what the buzzword followers name “ideation.” On this instance, you are not simply organizing your ideas, you are exploring to search out one thing new. You are utilizing ChatGPT to set off considering, evaluation, and new concepts.
Use prompts like “Are you able to speak me by means of this,” or “Are you able to information me into discovering new approaches?” to completely set off this system.
4. Prep for an interview or large assembly with reside follow
This was enjoyable. I requested ChatGPT to ask me 5 interview questions, as if I have been auditioning for a product supervisor gig. Again after I was a mere younger pup, I used to be a product supervisor after which product advertising and marketing director for some laptop and software program corporations.
ChatGPT requested me a sequence of questions. What I appreciated about this was that it was interactive. It is a lot totally different to reply a reside follow query verbally than it’s to return to your laptop, take into consideration a solution, and write it up as paragraphs.
I additionally valued the truth that ChatGPT may consider the responses and provides me suggestions. You can too inform it to level out while you’re rambling, affirm the way you’re doing when it comes to staying on subject, and make it easier to follow answering questions concisely.
5. Fingers-free assistant for on a regular basis duties
This works nicely when you open and unlock your telephone. If you happen to’re commuting, cooking, or doing one thing together with your fingers like packing for journey or cleansing up a workbench, you possibly can speak to ChatGPT, and it may possibly each document and reply.
The instance that involves thoughts for my life-style is making a buying listing. My spouse and I eat just about the identical meals each week, with a couple of minor exceptions. So we may simply feed the total listing of all our meals into ChatGPT. Then, with the telephone in my pocket and my AirPods in my ears, I can have it stroll me by means of the fridge, the large freezer, and our cupboards, figuring out whether or not we nonetheless have sufficient of a given merchandise or want extra.
For instance, I may say, “I am on the freezer, what ought to I verify?” It’d say, “Let me know what number of baggage of frozen broccoli you continue to have.” I would reply, “We’ve 3.” And it may say, “I am going to add seven baggage to the listing.”
As soon as the method is finished, I may have it assemble an inventory of what we’d like, which might turn out to be our very correct buying listing.
What hands-free duties may ChatGPT make it easier to with? Let me know beneath.
6. Actual-time copy editor for messages you are about to ship
I have a tendency to love to compose and ship my very own e-mail messages and texts. However the hands-free dictation function of ChatGPT Voice Mode has its attraction.
For instance, you possibly can dictate a message in additional of a brainstorming mode, after which ask ChatGPT to scrub it up so you possibly can minimize and paste it into an e-mail. Or you can give it a immediate like, “I will dictate a textual content to my buddy. Rewrite it to sound heat and clear. I do not need to sound dramatic.”
That final immediate was supplied by OpenAI. Might there be some juicy backstory about drama? Nah, I do not need to know. Simply use this function to scrub up rambling for some fast cuts and pastes.
7. Ask questions on photographs in actual time
It is a function solely obtainable to paid tiers of ChatGPT. It additionally differs in taste relying on whether or not you are utilizing it in your desktop in your browser, or utilizing the cellular app.
When in desktop mode, it is attainable to feed in a screenshot or a photograph, and have the AI inform you about it. I typically feed screenshots into ChatGPT. This was my first time attempting it in Voice Mode.
I discovered it notably useful with types as a result of I may screenshot the shape, feed that into ChatGPT, and let Voice Mode run in a background tab. Then, with the shape within the foreground tab, I may speak to the AI and have it assist me with what must be put into every area.
In cellular mode, you need to use present pictures which can be already in your telephone or take pictures of areas, indicators, plates of meals, contents of your fridge, or anything you may need to know extra about, and have Voice Mode make it easier to out with the contents. One nice instance takes us again to translation: you possibly can snap an image of an indication that is in one other language and get a translation again from ChatGPT.
Bonus: Train buddy
OpenAI did not counsel this, however I simply tried it out, and it is fairly cool. You should use ChatGPT Voice Mode as an train buddy that can assist you maintain monitor of reps, remind you what to do subsequent, and to encourage you to proceed.
I attempted it with weights. I prompted it, “Okay, I am at ten reps. What’s subsequent?” to which I obtained a response again, “It’s essential do 5 extra of this train, after which change to facet lateral raises. You are doing nice. Stick with it.”
I do not usually like my AIs sucking as much as me or encouraging me, however on the subject of exercising, I am going to take all the assistance I can get.
Voice Mode feels totally different
Utilizing Voice Mode as a productiveness helper is a little bit of an odd expertise. It’s precisely the identical ChatGPT expertise you might have when typing in textual content. However since you’re speaking, it actually does type of really feel like a dialog.
I discovered that there was extra depth (and extra meandering) to the enter I supplied to the AI when utilizing Voice Mode than there was after I was typing, so my prompts have been considerably extra loquacious.
The tone of voice that the AI makes use of tends to result in a sense of colleague-like communication in comparison with simply studying a chat response on the display. The sensation of the dialog is rather more related. I am undecided whether or not that is good or unhealthy, however it’s an fascinating expertise.
As I mentioned in my earlier article, I really feel like I’m nonetheless sacrificing precision for dialog fluidity. However should you use Voice Mode for work that advantages from conversational fluidity, it really works surprisingly nicely.
Till it would not.
Unclear limits
I’m utilizing the $20-per-month ChatGPT Plus plan. About half-hour into my dialog exploring Voice Mode, I obtained this message.
There are two points right here: how a lot time is allowed, and what is the distinction between Superior and Normal Voice Mode?
CNET has an explainer that describes each modes. In response to the article, “In normal mode, the AI creates its response in textual content first after which a separate voice device reads the textual content aloud. Whereas the voice might sound respectable, it may possibly really feel unnatural and sometimes delayed. A unified mannequin like superior voice would not separate writing and talking.”
The extra superior mode adjusts its tone so it sounds extra pure. You’ll be able to inform you’re utilizing Superior Mode as a result of the voice indicator is a blue circle. Normal mode exhibits a black circle.
Sadly, on the subject of limits you may encounter, OpenAI’s Voice Mode FAQ is not notably useful. I do not suppose it is updated. It talks about limits to GPT-4o, however GPT-4o was retired earlier this month.
In any case, here is what OpenAI says about limits:
- Free (logged-in) customers: Voice runs on GPT-4o mini and is proscribed to a set variety of hours per day.
- Plus subscribers: Voice begins with GPT-4o and is almost limitless every day; after GPT-4o minutes are used, it continues with GPT-4o mini.
- Professional subscribers: Limitless GPT-4o voice use, topic to abuse guardrails.
- Enterprise: Limitless GPT-4o voice use, billed in line with credit score consumption.
With my Plus plan, I hit a wall about half-hour in. Since I am testing for high quality, I made a decision to skip downgrading to the Normal mode.
I encourage OpenAI to replace its FAQ paperwork with the fitting mannequin names and extra tangibly comprehensible restrict info.
What about you?
Have you ever tried ChatGPT’s Voice Mode as a productiveness device, or do you largely keep on with typing? Do you see your self utilizing it extra for organizing your ideas, brainstorming concepts, reside interview prep, translation, or hands-free duties?
Have you ever run into utilization limits? Did utilization limits change how you utilize it? Does Voice Mode really feel extra conversational and useful to you, or much less exact? What different productiveness duties do you advocate utilizing voice mode for? Tell us within the feedback beneath.
You’ll be able to observe my day-to-day challenge updates on social media. Make sure you subscribe to my weekly replace e-newsletter, and observe me on Twitter/X at @DavidGewirtz, on Fb at Fb.com/DavidGewirtz, on Instagram at Instagram.com/DavidGewirtz, on Bluesky at @DavidGewirtz.com, and on YouTube at YouTube.com/DavidGewirtzTV.





