Benchmark Full - Search News

OpenAI Life Science Benchmark Reveals AI Passes Only 1 in 3 Scientific Research Tasks

AI life science benchmark LifeSciBench, published June 17 by OpenAI with 173 PhD scientists, shows frontier models clear only ...

TechRadar

Best benchmarks software of 2025

We list the best benchmarks software, to make it simple and easy to improve your PC's performance and test it against other hardware set-ups. This is especially useful if looking to buy a new PC, or ...

Seeking Alpha

Benchmark forecasts 9% to 10% full-year revenue growth as Semi-Cap and AC&C ramp

"In the first quarter, we delivered revenue of $677 million and EPS of $0.58, both coming in towards the higher end of our expectations," said (President, CEO & Director David Moezidis). "We now ...

Nature

Small variant benchmark from a complete assembly of X and Y chromosomes

The sex chromosomes contain complex, important genes impacting medical phenotypes, but differ from the autosomes in their ploidy and large repetitive regions. To enable technology developers along ...

MIT Technology Review

How to build a better AI benchmark

To fix the way we test and measure models, AI is learning tricks from social science. It’s not easy being one of Silicon Valley’s favorite benchmarks. SWE-Bench (pronounced “swee bench”) launched in ...

22d

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

DeepSWE puts GPT-5.5 atop the AI coding leaderboard while raising new questions about Claude Opus, SWE-Bench Pro, and benchmark leakage.

VentureBeat

Scale AI launches Voice Showdown, the first real-world benchmark for voice AI — and the results are humbling for some top models

Voice AI is moving faster than the tools we use to measure it. Every major AI lab — OpenAI, Google DeepMind, Anthropic, xAI — is racing to ship voice models capable of natural, real-time conversation.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results