Today, OpenAI unveiled their latest multimodal model, GPT-4o, which offers improved speed and enhanced capabilities across text, voice, and vision. Their new model is now being rolled out to ChatGPT, bringing advanced features to both free and paid users.
One of the standout features of the new GPT-4o model is its ability to understand and discuss images shared by users. For example, you can take a picture of a menu in a foreign language and ChatGPT will translate it, provide insights into the food's history, and offer recommendations. In the future, OpenAI says GPT-4o will enable even more natural, real-time voice conversations and the ability to converse via live video.
To complement the new model, OpenAI is also rolling out new tools and updated experiences. Firstly, ChatGPT is getting a new, friendlier look and feel with an updated home screen, message layout, and more. These changes aim to make the user experience more conversational and intuitive.
Secondly, OpenAI has announced a new ChatGPT desktop app for macOS. Users can now access ChatGPT with a universal keyboard shortcut (Option + Space) from anywhere on their Mac. You can ask questions, discuss screenshots, and even have voice conversations directly from your desktop.
The app also enables voice conversations with ChatGPT. Simply tap the headphone icon, and you can start discussing ideas, preparing for interviews, or even brainstorming with ChatGPT. However, this feature currently uses the older Voice Mode, with plans to incorporate GPT-4o's new audio and video capabilities in the future.
The macOS app is being rolled out to Plus users first, with broader availability in the coming weeks. A Windows version is planned for later this year.