OpenAI just announced o3 and o3 mini, its next-gen reasoning models.
In the livestream, SVP of Research Mark Chen showed o3's performance on certain benchmarks, compared to o1, like competition math (96.7 percent) and PhD-level science (87.7 percent). OpenAI and the ARC Prize competition also shared how o3 scored 76 percent on the ARC-AGI benchmark, which includes novel unpublished datasets. The ARC-AGI benchmark is designed to test ability to learn new and distinct skills on the fly with every new task.
This Tweet is currently unavailable. It might be loading or has been removed.
The announcement caps the 12 Days of OpenAI marathon, which debuted something new everyday. Over the past 12 business days, OpenAI has launched its AI video generator Sora, vision with Advanced Voice Mode, in addition to a slew of products and features designed to make ChatGPT more seamless to use in work and daily life.
The o3 mini model is designed to be a cost-efficient model that balances performance. It has three different effort levels and cap adapt its amount of reasoning time based on the difficulty of the problem. "An incredible cost-to-performance gain," said CEO Sam Altman.
So, o3 and o3 mini have achieved amazing intelligence breakthroughs according to OpenAI. But they're not ready to be released to the public yet. But OpenAI is granting early access to o3 and o3 mini for safety testing starting today. Applications to join the model testing program are accepted on a rolling basis and close on Jan. 10.
文章
566
浏览
28343
获赞
3
Ariana Grande's 'Fortnite' concert is a genuinely good time
Starting on Friday, Aug. 6, at 6:00 p.m. ET, anyone and everyone can log into Fortnite to experienceSamsung Galaxy Note 20 teardown reveals a big surprise
Samsung's Galaxy Note 20 is a beast of a phone. But a new teardown by iFixit's experts — who tThese coronavirus trackers can help you sort through the info overload
If you're like me, the daily barrage of information about the progress of the coronavirus pandemic cFacebook drops, YouTube rises as a source of U.S. news
YouTube’s in, Facebook’s out.Google’s video streaming platform is catching up to FThe Fyre Fest blowjob guy is the star of a cheeky new Evian water ad
Andy King from Netflix's Fyre Fest documentary is perhaps best known for one thing: blowjobs. In retThe United Arab Emirates wants to build a city on Mars
Elon Musk isn't the only person who wants to build a city on Mars. Now the United Arab Emirates hasBest free ChatGPT courses
TL;DR:A wide range of ChatGPT courses are available to take for free on Udemy. Are you aware that UdTrump who? Tech giants join massive effort to uphold Paris Agreement
U.S. tech titans are joining an effort by more than 1,000 U.S. governors, mayors, investors, univers22 tweets for people who are sick and tired of Zoom calls
We're only halfway through 2020, but the Zoom fatigue is real.Quarantining, social distancing, and wIn Paris Agreement speech, Trump never acknowledged the reality of global warming
As a candidate, Donald Trump said climate change was a hoax, and made fun of Democrats who ranked itApple Maps will help drivers avoid red
Apple Maps will let drivers know when they're approaching a red-light or speed camera.The feature wiNorth Face, Patagonia, and REI boycott Facebook ads to #StopHateForProfit
The brands are revolting against Facebook.Well, at least some of them are. Temporarily, that is.FiveCalculate your stimulus check before the IRS relaunches its website
Overcoming the briefest of Trump temper tantrums, Congress has finally passed a new coronavirus reliReddit recruits black tech entrepreneur to join board
Reddit is honoring Alexis Ohanian’s request to fill his board seat with a black candidate by nApple removed 'Fortnite' from the App Store for violating its policies
When it comes to the App Store, Apple is through playing nice. The tech giant confirmed Thursday tha