o3 - Search News

News

OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

A discrepancy between first- and third-party benchmark results for OpenAI's o3 AI model is raising questions about the ...

OpenAI's o3 and o4-mini hallucinate way higher than previous models

By OpenAI 's own testing, its newest reasoning models, o3 and o4 -mini, hallucinate significantly higher than o1.

14h

OpenAI's newest o3 and o4-mini models excel at coding and math – but hallucinate more often

Historically, each new generation of OpenAI's models has delivered incremental improvements in factual accuracy, with ...

51m

AI took a huge leap in IQ, and now a quarter of Gen Z thinks AI is conscious

The jump is so steep that it may be causing some to think that AI has become Skynet. According to a new EduBirdie survey, 25% ...

Cryptopolitan on MSN23h

OpenAI’s o3 model falls short of its own benchmark claims

OpenAI’s newest LLM, o3, is facing scrutiny after independent tests found it solved a far fewer number of tough math problems ...

Digital Information World15h

Concerns Raised as OpenAI’s o3 AI Model Scores Major Discrepancy Between First and Third-Party Benchmark Results

OpenAI’s o3 model shows inflated benchmark results; real-world tests reflect performance far below initial FrontierMath ...

The Tech Portal19h

Third-party tests show OpenAI’s o3 under-delivers

OpenAI’s o3 model is under scrutiny after third-party tests revealed far lower performance than previously claimed.

Futurism on MSN6h

OpenAI's Hot New AI Has an Embarrassing Problem

OpenAI launched its latest AI reasoning models, dubbed o3 and o4-mini, last week. According to the Sam Altman-led company, ...

18h

OpenAI’s o3 AI Model Falls Short of Benchmark Claims in FrontierMath Test

In December 2024, OpenAI held a livestream on YouTube and other social media platforms, announcing the o3 AI model. At the time, the company highlighted the improved set of capabilities in the large ...

12h

Is OpenAI’s o3 AI Model Weaker Than They Claimed? Independent Tests Reveal Shocking Gap

OpenAI’s newest AI model, o3, is at the center of a growing controversy after third-party tests revealed performance significantly lower than the ...

13hon MSN

There's no need to overshare on social media now that OpenAI's new chatbots can pinpoint your location from the tiniest details in images

Word to the wise, be careful about the images you post on social media. OpenAI's latest AI models, released last week, have ...

13h

ChatGPT gets scarily good at guessing photo locations, sparking doxxing concerns

OpenAI released its latest o3 and o4-mini models last week, which can "reason" through uploaded images. This means it can ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results