Monday, May 13th 2024
ChatGPT Comes to Desktop with OpenAI's Latest GPT-4o Model That Talks With Users
At OpenAI's spring update, a lot of eyes were fixed on the company, which spurred the AI boom with the ChatGPT application. Now being almost a must-have app for consumers and prosumers alike, ChatGPT is a de-facto application for the latest AI innovation, backed by researchers and scientists from OpenAI. Today, OpenAI announced a new model called GPT-4o (Omni), which hopes to bring advanced intelligence, improved overall capabilities, and real-time voice interaction with users. Now, the ChatGPT application wants to become like a personal assistant that actively communicates with users and provides much broader capabilities. OpenAI claims that it can respond to audio inputs as quickly as 232 milliseconds, with an average of 320 milliseconds, similar to human response time in conversations.
However, OpenAI states that it wants ChatGPT's latest GPT-4o model to be available to the free, Plus, and Team paid subscribers, where paid subscribers get 5x higher usage and early access to the model. Interestingly, the GPT-4o model is much improved across a variety of standard benchmarks like MMLU, Math, HumanEval, GPQA, and others, where it now surpasses almost all models except Claude 3 Opus in MGSM. It now understands more than 50 languages and can do real time translation. In addition to the new model, OpenAI announced that they are launching a desktop ChatGPT app, which can act as a personal assistant and see what is happening on the screen, but it is only allowed by user command. This is supposed to bring a much more refined user experience and enable users to use AI as a third person to help understand the screen's content. Initially only available on macOS, we are waiting for OpenAI to launch the Windows ChatGPT application so everyone can also experience the new technology.
Source:
OpenAI
However, OpenAI states that it wants ChatGPT's latest GPT-4o model to be available to the free, Plus, and Team paid subscribers, where paid subscribers get 5x higher usage and early access to the model. Interestingly, the GPT-4o model is much improved across a variety of standard benchmarks like MMLU, Math, HumanEval, GPQA, and others, where it now surpasses almost all models except Claude 3 Opus in MGSM. It now understands more than 50 languages and can do real time translation. In addition to the new model, OpenAI announced that they are launching a desktop ChatGPT app, which can act as a personal assistant and see what is happening on the screen, but it is only allowed by user command. This is supposed to bring a much more refined user experience and enable users to use AI as a third person to help understand the screen's content. Initially only available on macOS, we are waiting for OpenAI to launch the Windows ChatGPT application so everyone can also experience the new technology.
35 Comments on ChatGPT Comes to Desktop with OpenAI's Latest GPT-4o Model That Talks With Users
There are rumors that Apple and OpenAI have inked a deal to bring better AI integration to Apple's upcoming operating systems (to be previewed next month at Apple WWDC). Perhaps that's why they are offering on Macs first.
Maybe if government was competent and offered free degrees in high demand areas only, for anyone, then we might be able to pull ourselves out of the future collapse, but only free for degrees in demand - with strict requirements like keeping a B average the entire time, etc.
give people economic mobility - and the world would be much better off.
www.techspot.com/news/102975-sam-altman-envisions-future-where-universal-basic-income.html
or this could be a thing, I don't know. I wonder if UBI would make it so plumbers and electricians would just quit their jobs and society would deteriorate, or would they be greedy enough to keep wanting more and more money on top of the UBI? let the experimenting begin!
The baseline isn't the introduction of ChatGPT itself nor the technology underpinning it, as I see it. The baseline is the first step towards AGI (which has been clearly stated by many companies pursuing AGI, including OpenAI). And the end result will be complete integration of multiple systems specifically designed to facilitate specific tasks with reasoning. Defining the reasoning depends on a lot more than just the code or trained LLM. It is coming though. Sooner than we probably care to believe.
The problem as always is bringing the technology to a place in the market(s) with known and unknown application, vying for relevance in every way possible (broad scope, not focused), and essentially dumping it into people's hands and asking them to integrate it how they see fit. Which is to say, nothing out of the ordinary for technology circa 1980 (really picked up steam in the 90's) and onward.
So for the short term, we get this displacement in areas unintended (or perhaps intended by some), we tend to forego the focused vision (it may help some folks but isn't being used nor interacted with by all) and instead deploy broadly, and then down the road patch up the framework that we broke along the way. You can find this application of new technology into society very easily in recent history.
We seem to be wanting to move away from classical forms of living, accustomed to life as it is or was and instead looking forward for ways to make it happen sooner. The problem is that this affects everyone directly but doesn't benefit everyone directly. Due to economic forces and this particular kind of technological transition as a whole, we end up with apps for everything. It's appalling in the short term but in the end once we 'patch up the framework' and the technology matures enough to directly benefit everyone, it'll be a tumultuous road ahead in the short term.
As an example of what I mean - digital currencies via blockchain technology isn't the bees knees - it's the blockchain underlying that presents the real technological breakthrough, but takes eons (in relative implementation) to be institutionalized.
And AI still doesn't really know how to do anything artistic, like paint, write music, etc. Maybe someday AI will write something as good as Mozart's overture to "Don Giovanni" but I'm not going to hold my breath. It might be able to write filler music for some videogame cutscene.
But for sure, the world isn't going to change in two years. Some things will change more rapidly than others. AI deployment will be ahead of the curve in enterprise situations but AI isn't going to serve you a better grilled chicken sandwich any faster anytime soon.
Could AI pick my fantasy sports team roster? My guess is a handful of clever people are already doing this to their advantage. No reason for them to mention it at this time when they have a competitive advantage. Heh heh.
once the middle class jobs fall the economy falls.
Coin-size 50+ year batteries
Extremely small scale nuclear reactors
Propellantless Thrust
Robotics breakthroughs (near endless IIRC, since roughly 2000)
And on and on and on...
Pair them all together and.. voila ... AGI (once reasoning is figured more maturely) will be a real thing.
You can't just say "do my job for me and text me when you're done so I can stop watching cat videos and go home."
Right now AI is mostly good at speeding up really repetitive online or computer-based tasks.
IBM won't care, they already stated as much that AI will be taking over the workforce, its def coming for most middle class jobs.
www.techradar.com/pro/nvidia-ceo-predicts-the-death-of-coding-jensen-huang-says-ai-will-do-the-work-so-kids-dont-need-to-learn
Of course he gains more by selling AI accelerators to companies who want to reduce the number of programmers on their payroll.
:)
see this video here:
www.techpowerup.com/forums/threads/how-can-we-utilize-artificial-intelligence-to-help-us-be-more-prescient-about-quality-of-life-and-environment.320713/post-5252673
AI, as cool as it is doesn't consume anything - and in order to sell things you need consumers.
you know and I know not every single tech person felt that way, feels that way, or will feel that way. and vice versa.
the world is complicated, the ones who make the most noise get the most attention, sadly, and those people are usually asshats, again, sadly.
we need to figure out a way to move forward, to be resilient, that is in-demand jobs need to be free to train and study for.