• Latest
  • Trending
  • All
  • Market Updates
  • Cryptocurrency
  • Blockchain
  • Investing
  • Commodities
  • Personal Finance
  • Technology
  • Business
  • Real Estate
  • Finance
OpenAI teases new reasoning model—but don’t expect to try it soon

OpenAI teases new reasoning model—but don’t expect to try it soon

December 20, 2024
What Is That Mysterious Metallic Device US Chief Design Officer Joe Gebbia Is Using?

What Is That Mysterious Metallic Device US Chief Design Officer Joe Gebbia Is Using?

March 4, 2026
Is that message spam or real? This Android trick helps you ID the scams

Is that message spam or real? This Android trick helps you ID the scams

March 4, 2026
Catalysts Ahead: Oil Inventories, Treasury Auctions, Options Expiry Could Stir Volatility

Catalysts Ahead: Oil Inventories, Treasury Auctions, Options Expiry Could Stir Volatility

March 4, 2026
Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

March 4, 2026
AI, layoffs spur workers to want a career change, FlexJobs finds

AI, layoffs spur workers to want a career change, FlexJobs finds

March 4, 2026
FX option expiries for 13 August 10am New York cut

FX option expiries for 4 March 10am New York cut

March 4, 2026
TradFi Will Move to 24/7/365 Crypto Rails: Bitwise

TradFi Will Move to 24/7/365 Crypto Rails: Bitwise

March 4, 2026
MDB, PINS, NRG, KTB & more

MDB, PINS, NRG, KTB & more

March 4, 2026
I saw the Nothing Phone 4a in multiple colors at MWC – and these two got the most love

I saw the Nothing Phone 4a in multiple colors at MWC – and these two got the most love

March 4, 2026
Soft Manager – Trading Ideas – 5 August 2025

AI Trading in 2026: How I Use Alpha Pulse AI Without the Hype (Opus 4.6 + Gemini 3.1 Pro) – My Trading – 3 March 2026

March 4, 2026
Eurex Weighs Entry into Prediction Markets as CME, Cboe Gain Ground: Report

Eurex Weighs Entry into Prediction Markets as CME, Cboe Gain Ground: Report

March 4, 2026
What’s at Stake for Crypto as Three US States Kick off Party Primaries?

What’s at Stake for Crypto as Three US States Kick off Party Primaries?

March 3, 2026
Wednesday, March 4, 2026
No Result
View All Result
InvestorNewsToday.com
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech
InvestorNewsToday.com
No Result
View All Result
Home Technology

OpenAI teases new reasoning model—but don’t expect to try it soon

by Investor News Today
December 20, 2024
in Technology
0
OpenAI teases new reasoning model—but don’t expect to try it soon
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter


For the final day of ship-mas, OpenAI previewed a brand new set of frontier “reasoning” fashions dubbed o3 and o3-mini. The Verge first reported {that a} new reasoning mannequin could be coming throughout this occasion.

The corporate isn’t releasing these fashions in the present day (and admits remaining outcomes could evolve with extra post-training). Nonetheless, OpenAI is accepting functions from the analysis neighborhood to check these techniques forward of public launch (which it has but to set a date for). OpenAI launched o1 (codenamed Strawberry) in September and is leaping straight to o3, skipping o2 to keep away from confusion (or trademark conflicts) with the British telecom firm referred to as O2.

The time period reasoning has grow to be a typical buzzword within the AI trade recently, nevertheless it mainly means the machine breaks down directions into smaller duties that may produce stronger outcomes. These fashions usually present the work for the way it obtained to a solution, fairly than simply giving a remaining reply with out rationalization.

Based on the corporate, o3 surpasses earlier efficiency data throughout the board. It beats its predecessor in coding assessments (referred to as SWE-Bench Verified) by 22.8 % and outscores OpenAI’s Chief Scientist in aggressive programming. The mannequin practically aced one of many hardest math competitions (referred to as AIME 2024), lacking one query, and achieved 87.7 % on a benchmark for expert-level science issues (referred to as GPQA Diamond). On the hardest math and reasoning challenges that often stump AI, o3 solved 25.2 % of issues (the place no different mannequin exceeds 2 %).

OpenAI claims o3 performs higher than its different reasoning fashions in coding benchmarks.
OpenAI

The corporate additionally introduced new analysis on deliberative alignment, which requires the AI mannequin to course of security selections step-by-step. So, as a substitute of simply giving sure/no guidelines to the AI mannequin, this paradigm requires it to actively purpose about whether or not a consumer’s request suits OpenAI’s security insurance policies. The corporate claims that when it examined this on o1, it was a lot better at following security tips than earlier fashions, together with GPT-4.



Source link

Tags: dontexpectmodelbutOpenAIreasoningteases
Share196Tweet123
Previous Post

EU imports record quantities of Russian LNG in 2024

Next Post

Crypto company chaired by ex-chancellor ditches push for UK registration

Investor News Today

Investor News Today

Next Post
Crypto company chaired by ex-chancellor ditches push for UK registration

Crypto company chaired by ex-chancellor ditches push for UK registration

  • Trending
  • Comments
  • Latest
Want a Fortell Hearing Aid? Well, Who Do You Know?

Want a Fortell Hearing Aid? Well, Who Do You Know?

December 3, 2025
Private equity groups prepare to offload Ensemble Health for up to $12bn

Private equity groups prepare to offload Ensemble Health for up to $12bn

May 16, 2025
Lars Windhorst’s Tennor Holding declared bankrupt

Lars Windhorst’s Tennor Holding declared bankrupt

June 18, 2025
The human harbor: Navigating identity and meaning in the AI age

The human harbor: Navigating identity and meaning in the AI age

July 14, 2025
Why America’s economy is soaring ahead of its rivals

Why America’s economy is soaring ahead of its rivals

0
Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

0
Nato chief Mark Rutte’s warning to Trump

Nato chief Mark Rutte’s warning to Trump

0
Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

0
What Is That Mysterious Metallic Device US Chief Design Officer Joe Gebbia Is Using?

What Is That Mysterious Metallic Device US Chief Design Officer Joe Gebbia Is Using?

March 4, 2026
Is that message spam or real? This Android trick helps you ID the scams

Is that message spam or real? This Android trick helps you ID the scams

March 4, 2026
Catalysts Ahead: Oil Inventories, Treasury Auctions, Options Expiry Could Stir Volatility

Catalysts Ahead: Oil Inventories, Treasury Auctions, Options Expiry Could Stir Volatility

March 4, 2026
Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

Bitcoin rejected at $70K again, but a short squeeze may still be brewing!

March 4, 2026

Live Prices

© 2024 Investor News Today

No Result
View All Result
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech

© 2024 Investor News Today