Wednesday, 21 May 2025
  • My Feed
  • My Interests
  • My Saves
  • History
  • Blog
Login
Pearl Digest
  • Home
  • Technology
    TechnologyShow More
    Samsung Galaxy S25 Edge to Use Corning Gorilla Glass Ceramic 2 Display Protection

    Samsung Electronics and Corning Incorporated (NYSE: GLW) have jointly revealed that the…

    By Ronald Hook
    Huawei unveils first laptop running self-developed HarmonyOS as Windows licence expires.

    Huawei Technologies introduced its first laptop powered by the company’s own operating…

    By Pearl Digest Admin
    Apple’s move to introduce AI search options on Safari deals a significant blow to Google’s dominance.

    Apple is reportedly considering a major overhaul of Safari, with the iPhone…

    By Pearl Digest Admin
    Samsung’s Next Galaxy S25 Phone Packs an Ultra-Level Camera

    Samsung has been toying with my emotions, gradually releasing teasers of its…

    By Pearl Digest Admin
    YouTube launches AI-powered music tool for creators

    YouTube has launched an AI-powered tool called “Music Assistant,” allowing creators to…

    By OswardNa
  • Business
    BusinessShow More
    China loses third of billionaires as economy falters, China’s 10 richest billionaires (rich list)

    According to a "rich list" put out by the research organization Hurun,…

    By Pearl Digest Admin
    Zumi’s Closure: Analyzing the Implications of Letting Go 150 Employees

    In a move that has sent shockwaves through the tech industry, Zumi,…

    By Pearl Digest Admin
    Apple seeks to have the $1 billion UK lawsuit about app store fees dismissed.

    On Tuesday, Apple (AAPL.O) opened a new tab and requested that a…

    By Ronald Hook
    Uganda is in discussions about a proposed oil refinery with a UAE investment business.

    Uganda's energy minister announced on Tuesday that the country is in talks…

    By Ronald Hook
    IMF predicts that artificial intelligence will eliminate 40% of jobs and increase inequality.

    According to a new IMF analysis, artificial intelligence will affect nearly 40%…

    By Ronald Hook
  • Travel
    TravelShow More
  • Health
    HealthShow More
    U.S. Measles Cases Top 1,000 in 2025, Marking Second Worst Year Since Elimination

    More than 1,000 measles cases have been recorded across the United States…

    By Pearl Digest Admin
    Uganda begins vaccinating 1.1 million children against malaria.

    Uganda launched a major malaria vaccination campaign on Wednesday, aiming to immunize…

    By Pearl Digest Admin
  • Lifestyle
    LifestyleShow More
    Harnessing Nature’s Elixir: A Guide to Making Turmeric Oil for Acne-Free Skin.

    Introduction: Acne, a common skin woe, often prompts a search for natural…

    By Pearl Digest Admin
    7 Practical Tips to Successfully Quitting Alcohol for a Healthier Life

    Choosing to quit alcohol is a significant step towards a healthier, more…

    By Kasirye Moses
    Mastering the Art of the Instagram Selfie: The Ultimate Guide to Instagram-Worthy Shots

    Introduction:In the age of social media, the Instagram selfie has evolved into…

    By Kasirye Moses
    Transform Your Home into a High-Performance Gym with Household Items

    Embarking on a fitness journey doesn't always require an expensive gym membership…

    By Kasirye Moses
    Decadent Chocolate Milkshake: A Blissful Treat from Your Own Kitchen

    Indulging in a rich and velvety chocolate milkshake is a pleasure that…

    By Kasirye Moses
  • Buying Guides
    Buying GuidesShow More
  • 🔥
  • Technology
  • AI
  • Apple
  • Business
  • ChatGPT
  • Google
  • ChatBot
  • Microsoft
  • Meta
  • WIFI 7
  • Black Friday
  • windows
Font ResizerAa
Pearl DigestPearl Digest
0
  • My Saves
  • My Interests
  • My Feed
  • History
  • Travel
  • Health
  • Technology
Search
  • Home
  • Blog Index
  • Advertise
  • Categories
    • Technology
    • Travel
    • Health
  • Personalized
    • My Feed
    • My Saves
    • My Interests
    • History
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Pearl Digest > Blog > Technology > Open AI’s ChatGPT now supports voice chats and image-based queries
Technology

Open AI’s ChatGPT now supports voice chats and image-based queries

OswardNa
Last updated: September 26, 2023 11:16 am
OswardNa
Share
monitor screen with openai logo on black background
Photo by Andrew Neel on Pexels.com
SHARE

Significant upgrades to ChatGPT will allow the chatbot to respond to voice commands and image-based inquiries. Users will be able to feed photos into ChatGPT on all platforms and engage in voice conversations with it on Android and iOS. The features are now being released through OpenAI. The image-based capabilities will initially only be accessible to Plus and Enterprise users; eventually, other users will also have access to them.

ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://t.co/uNZjgbR5Bm pic.twitter.com/paG0hMshXb

— OpenAI (@OpenAI) September 25, 2023

If you want to test out voice conversations, you must enable them in the ChatGPT app’s settings (choose New Features from the Settings menu). Five voices are available for you to select by tapping the microphone button.

Also: How to use Chat GPT Plus from browser to Plugins

A new text-to-speech algorithm, according to OpenAI, powers the back-and-forth voice dialogues and can produce “human-like audio from just text and a few seconds of sample speech.” It used professional actors to help create the five voices. The company’s Whisper speech recognition system, on the other hand, transforms a user’s spoken words into text.

The image-based features are also fascinating. According to OpenAI, you could ask the chatbot to answer a math problem you take a picture of, show it a picture of your grill and ask why it won’t start, or get it to help you plan a dinner based on a photograph of what’s in your fridge. In fact, Microsoft emphasized the Copilot AI’s aptitude at math issues during last week’s Surface event.

GPT-3.5 and GPT-4 are used by OpenAI to fuel its image recognition capabilities. hit the photo button (you’ll need to hit the + button first on iOS or Android) to take a picture or select an existing image on your smartphone to enjoy ChatGPT’s image-based features. You can use a drawing tool to zoom in on a particular area of the image while asking ChatGPT about numerous images.

The possibility for harm was mentioned by OpenAI in a blog post introducing the revisions. The voices of well-known people (as well as regular people) can be imitated by bad actors, who might then engage in fraud. For this reason, OpenAI is concentrating on ChatGPT voice chats using this technology and collaborating with a few select partners on additional restricted use cases (more on that in a bit).

Regarding visuals, OpenAI collaborated with Be My Eyes, a free tool that enables volunteers to join video conversations with blind and low-vision users to help them better interpret their surroundings. According to OpenAI, “Users have told us they find it valuable to have general conversations about images that just so happen to have people in the background, like if someone pops up on TV while you’re figuring out your remote control settings.” The business stated that because ChatGPT “is not always accurate and these systems should respect individuals’ privacy,” it has also restricted how ChatGPT can assess and make direct statements about people who appear in photographs.

It has written a paper on the image-based functionality, which it refers to as GPT-4 with vision, and its safety attributes.

English text in graphics is easier for ChatGPT to interpret than other languages. For the time being, according to OpenAI, the chatbot “performs poorly” in other languages, particularly those that employ scripts other than Roman. As a result, it advises non-English speakers to refrain from utilizing ChatGPT to deal with text in photos for the time being.

While this is going on, Spotify and OpenAI have partnered to exploit the voice-based technology in an intriguing way. For podcasters, the former has revealed a Voice Translation tool pilot. Using the voices of the people who appear on the show, this may translate podcasts into many languages. According to Spotify, the program can translate the voice of the original speaker into different languages while keeping their speech patterns.

Do you dream of a world where some of the top podcasts would be spoken in your native language? Well, that’s now possible. We’re excited to pilot Voice Translation, a groundbreaking feature powered by AI that translates podcasts into additional languages—all in the podcaster’s… pic.twitter.com/7ebVwF99hD

— Spotify News (@SpotifyNews) September 25, 2023

Spotify is initially translating a few English-based shows into other languages. There are currently Spanish-language translations of some Armchair Expert and The Diary of a CEO with Steven Bartlett episodes, with French and German versions coming soon.

Share This Article
Twitter Email Copy Link Print
Previous Article The first production EV from Huawei and Chery Autos claims to outperform the Tesla Model S.
Next Article The top Android devices in 2023
Leave a comment

Click here to cancel reply.

Please Login to Comment.

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
LinkedInFollow
MediumFollow
QuoraFollow
- Advertisement -
Ad image

Popular Posts

Microsoft is creating AI reasoning models in order to rival OpenAI.

Microsoft is developing its own artificial intelligence reasoning models to compete with OpenAI and may…

By OswardNa

Ray-Ban Meta Smart Glasses review: user-friendly privacy nightmare

Spy glasses are no longer just for fictional characters like James Bond or Black Mirror.…

By Kasirye Moses

Google is reportedly developing ‘Jarvis’ AI that could take over your web browser

Google is reportedly close to launching an AI agent called Project Jarvis, designed to operate…

By Pearl Digest Admin

You Might Also Like

Technology

Samsung Galaxy S25 Edge to Use Corning Gorilla Glass Ceramic 2 Display Protection

By Ronald Hook
Technology

Huawei unveils first laptop running self-developed HarmonyOS as Windows licence expires.

By Pearl Digest Admin
AiTechnology

Apple’s move to introduce AI search options on Safari deals a significant blow to Google’s dominance.

By Pearl Digest Admin
Technology

Samsung’s Next Galaxy S25 Phone Packs an Ultra-Level Camera

By Pearl Digest Admin
Pearl Digest

About US


Pearl Digest: Your instant connection to breaking stories and live updates. Stay informed with our real-time coverage across politics, tech, entertainment, and more. Your reliable source for 24/7 news.

Top Categories
  • Tech
  • Health
  • Travel
  • Business
  • Lifestyle
Usefull Links
  • Contact Us
  • Advertise with US
  • Complaint
  • Privacy Policy
  • Cookie Policy
Facebook Youtube X-twitter Linkedin Instagram

© Pearl Digest.  All Rights Reserved. Website by Palnode

adbanner
AdBlock Detected
Our site is an advertising supported site. Please whitelist to support our site.
Okay, I'll Whitelist
Welcome Back!

Sign in to your account

Register Lost your password?