Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Snapdragon 8 Elite 2 Leak Hints at 4 Million+ AnTuTu Score Ahead of Official Launch

    September 1, 2025

    Microsoft’s Next Annual Windows 11 (25H2) Update Enters Release Preview Testing

    August 31, 2025

    Meta Faces Challenges in $14.3B Collaboration With Scale AI

    August 30, 2025
    Facebook X (Twitter) Instagram Pinterest
    EchoCraft AIEchoCraft AI
    • Home
    • AI
    • Apps
    • Smart Phone
    • Computers
    • Gadgets
    • Live Updates
    • About Us
      • About Us
      • Privacy Policy
      • Terms & Conditions
    • Contact Us
    EchoCraft AIEchoCraft AI
    Home»AI»OpenAI and Google DeepMind Achieve Gold-Level in IMO Performance
    AI

    OpenAI and Google DeepMind Achieve Gold-Level in IMO Performance

    EchoCraft AIBy EchoCraft AIJuly 22, 2025No Comments5 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    IMO
    Share
    Facebook Twitter LinkedIn Pinterest Email

    In a notable milestone for artificial intelligence, both OpenAI and Google DeepMind have reported that their latest models reached gold-medal-level performance in the 2025 International Math Olympiad (IMO)—the prestigious mathematics competition traditionally dominated by the world’s top-performing high school students.

    Highlights

    • Breakthrough in AI Reasoning: OpenAI and Google DeepMind report solving 5 out of 6 problems from the 2025 International Math Olympiad (IMO), matching gold medalist performance.
    • Informal, Natural Language Reasoning: Unlike past symbolic methods, both models reasoned in plain English—signaling a leap in AI’s ability to handle abstract, creative problems.
    • Two Philosophies, One Goal: OpenAI used pure LLM chains for inference, while DeepMind used a hybrid symbolic-LLM method—showcasing different strategies to achieve similar outcomes.
    • Evaluation Controversy: OpenAI’s evaluation relied on independent reviewers and preceded official IMO grading; DeepMind followed the formal review process, sparking debate about procedure and credibility.
    • What Gold Means: Only ~10% of the 630+ human contestants earned gold medals. These AI models matched the top-tier performance in one of the world’s most rigorous academic competitions.
    • No Public Release Yet: Neither OpenAI nor DeepMind have released the models, citing experimental status. OpenAI suggests public access could take “many months.”
    • Strategy Over Speed: DeepMind’s emphasis on rigor and transparency reframes the AI race not just as a competition in capability, but also in scientific trust and governance.

    Both companies announced that their systems correctly solved five out of six IMO problems, surpassing the performance of most student participants.

    Unlike previous efforts that relied on formal symbolic systems requiring human pre-processing, this year’s models tackled the problems using “informal” reasoning. These models interpreted natural language questions directly and generated proof-based answers in plain English.

    Informal Reasoning

    This shift to informal systems marks a turning point in AI’s reasoning capabilities. Machines have historically struggled with the kind of ambiguity, multi-step logic, and creative thinking required in math olympiads.

    Researchers from both companies describe this achievement as a substantial advance in general reasoning, particularly in solving non-verifiable problems that extend beyond traditional math exercises or programming tasks.

    Google DeepMind used a hybrid approach that combined formal symbolic logic with large language model reasoning, while OpenAI’s approach relied entirely on LLM-generated reasoning—referred to internally as “pure LLM chains.”

    Both methods underscore evolving philosophies in AI model design and hint at future directions for advanced problem-solving AI.

    Differing Approaches and Disputes Over Recognition

    While both results are technically impressive, their release sparked debate—not about performance, but about procedure.

    OpenAI publicized its achievement shortly after the IMO student awards ceremony, but before undergoing any official grading process sanctioned by the IMO. Instead, the company enlisted three former IMO medalists to independently evaluate its model’s output.

    Google DeepMind, in contrast, participated in the official IMO evaluation. The company collaborated directly with the competition’s organizers and waited until the formal grading was completed before sharing its results publicly.

    Thang Luong, who leads DeepMind’s math reasoning research, emphasized the importance of adhering to the IMO’s established evaluation standards. According to Luong, “Any evaluation not based on that guideline cannot claim gold-level performance.”

    OpenAI has since clarified that it did not initially enter the formal process but chose to contact IMO organizers only after reaching what it believed to be a gold-worthy performance.

    While OpenAI states it waited until after the student awards to make its announcement, some within the AI research community expressed concern over the timing and process transparency.

    What Gold Means in the IMO

    Out of over 630 student participants this year, only around 10% received gold medals. That AI systems could match this level underscores a rapid acceleration in machine reasoning capabilities.

    IMO problems demand creativity, deep abstraction, and long-form logical deduction—skills long thought to be uniquely human.

    Unlike structured coding challenges or basic logic puzzles, these problems often require sustained reasoning across multiple conceptual domains, a hallmark of elite human cognition.

    Performance and Methodology

    • Google DeepMind’s Model – The hybrid reasoning engine that blends formal symbolic logic with natural language outputs (likely based on Gemini Deep Think).
    • OpenAI’s Model – Pure LLM-based reasoning, without external symbolic formalization. All logic is derived from generative model chaining.

    Both models executed their reasoning processes at test time using substantial computational resources.

    OpenAI has not disclosed the compute cost, but its approach appears to have relied heavily on deep inference-time reasoning, pushing the bounds of what’s possible with large-scale models.

    For AI Research and Education

    This achievement is being viewed as a precursor to broader applications of AI in mathematics and science.

    By demonstrating informal proof generation at a gold-medal level, both labs signal a future where AI could contribute meaningfully to open-ended scientific problems—not just replicate known patterns.

    Neither company plans to release these exact models in the near future. OpenAI has suggested it will be “many months” before a public rollout, underscoring the models’ current experimental status.

    AI Google IMO LLM OpenAI Reasoning Model
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleAsus Launches Vivobook 14 in India With Snapdragon X Processor
    Next Article Netflix Reportedly Testing Runway’s AI Video Tools in Content Production
    EchoCraft AI

    Related Posts

    Computers

    Microsoft’s Next Annual Windows 11 (25H2) Update Enters Release Preview Testing

    August 31, 2025
    AI

    Meta Faces Challenges in $14.3B Collaboration With Scale AI

    August 30, 2025
    Science

    China Launches ‘Darwin Monkey’, a Neuromorphic Supercomputer Modeled on the Brain

    August 30, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Search
    Top Posts

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024394 Views

    CapCut Ends Free Cloud Storage, Introduces Paid Plans Starting August 5

    July 12, 2024322 Views

    Anthropic Quietly Tightens Claude Code Usage Limits, Sparking User Frustration

    July 18, 2025316 Views
    Categories
    • AI
    • Apps
    • Computers
    • Gadgets
    • Gaming
    • Innovations
    • Live Updates
    • Science
    • Smart Phone
    • Social Media
    • Tech News
    • Uncategorized
    Latest in AI
    AI

    Meta Faces Challenges in $14.3B Collaboration With Scale AI

    EchoCraft AIAugust 30, 2025
    AI

    Reliance Taps Google and Meta to Build India’s AI Backbone

    EchoCraft AIAugust 29, 2025
    AI

    xAI Launches Grok Code Fast 1, a Lightweight Agentic AI Model for Developers

    EchoCraft AIAugust 29, 2025
    AI

    Microsoft Unveils Its First Homegrown AI Models – MAI-Voice-1 & MAI-1-Preview

    EchoCraft AIAugust 29, 2025
    AI

    Anthropic Blocks Hacker Attempts to Misuse Claude AI for Cybercrime

    EchoCraft AIAugust 28, 2025

    Subscribe to Updates

    Get the latest tech news from FooBar about tech, design and biz.

    Stay In Touch
    • Facebook
    • YouTube
    • Twitter
    • Instagram
    • Pinterest
    Tags
    2024 Adobe AI AI agents AI safety android Anthropic apple Apple Intelligence Apps ChatGPT Claude AI Copilot Cyberattack Elon Musk Gaming Gemini Generative Ai Google Grok AI India Innovation Instagram IOS iphone Meta Meta AI Microsoft NVIDIA Open-Source AI OpenAI PC privacy and Security Reasoning Model Robotics Samsung Smartphones Smart phones Social Media TikTok U.S Update whatsapp xAI YouTube
    Most Popular

    Samsung Galaxy S25 Rumours of A New Face in 2025

    March 19, 2024394 Views

    Alleged iPhone 17 Pro Geekbench Scores Hint at Significant A19 Pro Chip Performance Leap

    June 12, 2025251 Views

    Insightful iQoo Z9 Turbo with New Changes in 2024

    March 16, 2024220 Views
    Our Picks

    Google Tests AI-Powered Age Estimation to Shield Minors Across Its Products in the U.S.

    July 31, 2025

    Apple Previews Major Accessibility Upgrades, Explores Brain-Computer Interface Integration

    May 13, 2025

    Apple Advances Custom Chip Development for Smart Glasses, Macs, and AI Systems

    May 9, 2025

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • Home
    • Contact Us
    • Privacy Policy
    • Terms & Conditions
    • About Us
    © 2025 EchoCraft AI. All Right Reserved

    Type above and press Enter to search. Press Esc to cancel.

    Manage Consent
    To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
    Functional Always active
    The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
    Preferences
    The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
    Statistics
    The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
    Marketing
    The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
    Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
    View preferences
    {title} {title} {title}