News
Through my experience working with the world's leading hedge funds and quants, I’ve seen the limitations of black-box models and the enduring value of rigorous, explainable and mathematically ...
We’re seeing some new developments in AI models that are shedding light on one of the technology’s most prominent gaps – its relative inability to do math well. Some experts note that AI is ...
OpenAI has achieved "gold medal-level performance" at the International Math Olympiad, notching another important milestone for AI's fast-paced growth. Alexander Wei, a research scientist at ...
OpenAI researchers reveal how their experimental model, devoid of any external aids, powered through hours-long proofs to ...
That’s the “cold maths era”: statistical fluency without the ability to show the work. Financial institutions, operating ...
OpenAI’s latest model has achieved a gold-level score at the 2025 International Mathematical Olympiad. It answered five out of the six questions under exam conditions, scoring 35 out of a ...
Hosted on MSN3mon
OpenAI models sabotage shutdown order to solve math problems
The research firm ran a test where AI models were instructed to solve basic math problems, and then asked for the next problem after solving one. The models were told that at some point their ...
Phi-4 and an rStar-Math paper suggest that compact, specialized models can provide powerful alternatives to the industry’s largest systems.
In a new paper, researchers show that even the most sophisticated general-purpose AI language models struggle to solve math problems.
DeepSeek models match or beat some of Silicon Valley's top offerings. BI put the Chinese contender through its paces with a challenging math problem.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results