Close Menu
Luminari | Learn Docker, Kubernetes, AI, Tech & Interview PrepLuminari | Learn Docker, Kubernetes, AI, Tech & Interview Prep
  • Home
  • Technology
    • Docker
    • Kubernetes
    • AI
    • Cybersecurity
    • Blockchain
    • Linux
    • Python
    • Tech Update
    • Interview Preparation
    • Internet
  • Entertainment
    • Movies
    • TV Shows
    • Anime
    • Cricket
What's Hot

Stats: PBKS mount Target 200-plus again

June 8, 2025

‘The Lost Bus’ Teaser With Matthew McConaughey, America Ferrera

June 8, 2025

Pioneering Apple engineer Bill Atkinson dies at 74

June 8, 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Luminari | Learn Docker, Kubernetes, AI, Tech & Interview Prep
  • Home
  • Technology
    • Docker
    • Kubernetes
    • AI
    • Cybersecurity
    • Blockchain
    • Linux
    • Python
    • Tech Update
    • Interview Preparation
    • Internet
  • Entertainment
    • Movies
    • TV Shows
    • Anime
    • Cricket
Luminari | Learn Docker, Kubernetes, AI, Tech & Interview PrepLuminari | Learn Docker, Kubernetes, AI, Tech & Interview Prep
Home » DeepMind claims its newest AI tool is a whiz at math and science problems
AI

DeepMind claims its newest AI tool is a whiz at math and science problems

HarishBy HarishMay 14, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Reddit WhatsApp Email
Share
Facebook Twitter Pinterest Reddit WhatsApp Email


Google’s AI R&D lab DeepMind says it has developed a new AI system to tackle problems with “machine-gradable” solutions.

In experiments, the system, called AlphaEvolve, could help optimize some of the infrastructure Google uses to train its AI models, DeepMind said. The company says it’s building a user interface for interacting with AlphaEvolve, and plans to launch an early access program for selected academics ahead of a possible broader rollout.

Most AI models hallucinate. Owing to their probabilistic architectures, they confidently make things up sometimes. In fact, newer AI models like OpenAI’s o3 hallucinate more than their predecessors, illustrating the challenging nature of the issue.

AlphaEvolve introduces a clever mechanism to cut down on hallucinations: an automatic evaluation system. The system uses models to generate, critique, and arrive at a pool of possible answers to a question, and automatically evaluates and scores the answers for accuracy.

DeepMind AlphaEvolve
DeepMind’s AlphaEvolve system is designed to be used by domain experts, the lab saysImage Credits:DeepMind

AlphaEvolve isn’t the first system to take this tack. Researchers, including a team at DeepMind several years ago, have applied similar techniques in various math domains. But DeepMind claims AlphaEvolve’s use of “state-of-the-art” models — specifically Gemini models — makes it significantly more capable than earlier instances of AI.

To use AlphaEvolve, users must prompt the system with a problem, optionally including details like instructions, equations, code snippets, and relevant literature. They must also provide a mechanism for automatically assessing the system’s answers in the form of a formula.

Because AlphaEvolve can only solve problems that it can self-evaluate, the system can only work with certain types of problems — specifically those in fields like computer science and system optimization. In another major limitation, AlphaEvolve can only describe solutions as algorithms, making it a poor fit for problems that aren’t numerical.

To benchmark AlphaEvolve, DeepMind had the system attempt a curated set of around 50 math problems spanning branches from geometry to combinatorics. AlphaEvolve managed to “rediscover” the best-known answers to the problems 75% of the time and uncover improved solutions in 20% of cases, claims DeepMind.

DeepMind also evaluated AlphaEvolve on practical problems, like boosting the efficiency of Google’s data centers, and speeding up model training runs. According to the lab, AlphaEvolve generated an algorithm that continuously recovers 0.7% of Google’s worldwide compute resources on average. The system also suggested an optimization that reduced the overall time it takes Google to train its Gemini models by 1%.

To be clear, AlphaEvolve isn’t making breakthrough discoveries. In one experiment, the system was able to find an improvement for Google’s TPU AI accelerator chip design that had been flagged by other tools earlier.

DeepMind, however, is making the same case that many AI labs do for their systems: that AlphaEvolve can save time while freeing up experts to focus on other, more important work.



Source link

Share. Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Email
Previous ArticleGoogle is Gatekeeping Nextcloud by Limiting Core Functionality
Next Article OpenAI may build data centers in the UAE
Harish
  • Website
  • X (Twitter)

Related Posts

Lawyers could face ‘severe’ penalties for fake AI-generated citations, UK court warns

June 7, 2025

Trump administration takes aim at Biden and Obama cybersecurity rules

June 7, 2025

Week in Review: Why Anthropic cut access to Windsurf

June 7, 2025

Will Musk vs. Trump affect xAI’s $5 billion debt deal?

June 7, 2025

Building More Scalable GenAI Applications for Startups and Developers

June 7, 2025

2025 will be a ‘pivotal year’ for Meta’s augmented and virtual reality, says CTO

June 6, 2025
Add A Comment
Leave A Reply Cancel Reply

Our Picks

Stats: PBKS mount Target 200-plus again

June 8, 2025

‘The Lost Bus’ Teaser With Matthew McConaughey, America Ferrera

June 8, 2025

Pioneering Apple engineer Bill Atkinson dies at 74

June 8, 2025

Watch Hollywood Reporter’s TV Comedy Actress Roundtable Full Episode

June 8, 2025
Don't Miss
Blockchain

The battle for gaming data is on.

June 8, 20254 Mins Read

Opinion by: T-RO, co-founder of GamerBoomForget the old pitch about “interactive media.” Every dungeon crawl,…

Bitcoin Family Splits Seed Phrase Across Four Continents After Crypto Attacks

June 8, 2025

Dubai Real Estate Hits $18.2B in Sales Amid Tokenization Push

June 8, 2025

Bitcoin market of 2025 driven by stablecoin regulation: Finance Redefined

June 6, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to Luminari, your go-to hub for mastering modern tech and staying ahead in the digital world.

At Luminari, we’re passionate about breaking down complex technologies and delivering insights that matter. Whether you’re a developer, tech enthusiast, job seeker, or lifelong learner, our mission is to equip you with the tools and knowledge you need to thrive in today’s fast-moving tech landscape.

Our Picks

Lawyers could face ‘severe’ penalties for fake AI-generated citations, UK court warns

June 7, 2025

Trump administration takes aim at Biden and Obama cybersecurity rules

June 7, 2025

Week in Review: Why Anthropic cut access to Windsurf

June 7, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
© 2025 luminari. Designed by luminari.

Type above and press Enter to search. Press Esc to cancel.