Benchmark Results - Search News

Google Releases Gemini 3.5 Flash, Beats Gemini 3.1 Pro On Many Benchmarks

AI is accelerating so fast that Flash models are beating Pro models from just months ago. Google DeepMind announced Gemini 3.5 Flash ...

5don MSN

Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark

Microsoft's new vulnerability-scanning system, codenamed MDASH, scored 88.45% on the CyberGym benchmark, surpassing ...

Forza Horizon 6 Benchmark: 47 GPUs Tested

Forza Horizon 6 is one of the best-looking racing games yet, but max settings and RT can crush modern GPUs. We tested 47 ...

MacRumors

M3 Ultra Chip is Only 10% Faster Than M4 Max in Benchmark Results

The first alleged benchmark result for Apple's new M3 Ultra chip has surfaced in the Geekbench 6 database tonight, allowing for more performance comparisons. The high-end chip is available in the new ...

MIT Technology Review

How to build a better AI benchmark

To fix the way we test and measure models, AI is learning tricks from social science. It’s not easy being one of Silicon Valley’s favorite benchmarks. SWE-Bench (pronounced “swee bench”) launched in ...

MIT Technology Review

The way we measure progress in AI is terrible

Many of the most popular benchmarks for AI models are outdated or poorly designed. Every time a new AI model is released, it’s typically touted as acing its performance against a series of benchmarks.

NextBigFuture

XAI Grok 4 Has Leading Benchmarks

XAI Grok 4 Benchmarks are showing it is the leading model. Humanity Last Exam at 35 and 45 for reasoning is a big improvement from about 21 for other top models. If these leaked Grok 4 benchmarks are ...

Accounting TodayOpinion

"AI can't do accounting" benchmarks are asking the wrong question

The industry framing of "what can AI do?" is mismatched with how AI is deployed today, and the conclusions can leave firms a ...

Hosted on MSN

Benchmark’s (NYSE:BHE) Q1 CY2026 earnings results: Revenue in line with expectations, stock jumps 13.9%

Electronics manufacturing services provider Benchmark (NYSE:BHE) in Q1 CY2026, with sales up 7.2% year on year to $677.3 million. The company expects next quarter’s revenue to be around $720 million, ...

The Information

Cerebras IPO Winners Include Foundation, Benchmark—and OpenAI

Over the last decade, there were times when Cerebras investors could be forgiven for losing hope. The company, which was attempting the difficult and expensive task of developing an alternative to ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results