Thursday, June 19, 2025
No Result
View All Result
ECNETNews
  • Home
  • World
  • Politics
  • Business
  • Science
  • Tech

    Get Lifetime Access to Sterling Stock Picker AI for Just £51!

    AI Startup’s Chatbots Disguised as Human Employees Uncovered

    After Pornhub Departed France, This VPN Experienced a 1,000% Surge in Usage Minutes

    23andMe’s DNA Data Set to Be Sold Again

    Where to Purchase the Nintendo Switch 2 In-Store on Launch Day

    Master 50 Languages for a Lifetime for Only £26!

    Trending Tags

    • Sillicon Valley
    • Climate Change
    • Election Results
  • Entertainment
    • All
    • Gaming
    • Movie
    • Music
    • Sports

    Rugby League Legend Appointed Head Coach of Perth Bears NRL Expansion Club

    Matt Dufty: The Pressure is on ‘Favourites’ Hull KR in the Challenge Cup Final

    Apple Arcade Expands Library with Nine New Games and a Bluey Crossover This Summer

    Exploring Sara Waisglass’ Boyfriend and Her Relationship History

    Listen to Mariah Carey’s Latest Song “Type Dangerous”

    Jack Grealish Responds to Critics as His Future with Man City Remains Uncertain | Football News

    Lenovo Legion Go Handheld PC Hits Best Price of the Year

    Blake Lively Drops Lawsuit, Trends Despite Ongoing Justin Baldoni Case

    Sabrina Carpenter Releases Video for Her Song “Manchild”: Watch Now

  • Lifestyle
    • All
    • Fashion
    • food
    • Health
    • Travel

    Redefining Indulgence: Healthy Comfort Foods Set to Trend in 2025

    The Next Generation of Food Delivery: Innovations for a Post-Pandemic World

    Cultural Flavors Unleashed: How Global Tastes are Transforming Local Menus

    Functional Foods: Nourishment Beyond Nutrition in 2025

    AI in the Kitchen: Predicting the Future of Cooking and Recipe Development

    Gut Health and Gourmet: The Intersection of Wellness and Fine Dining

    Zero Waste Kitchens: Trends in Food Sustainability for 2025

    From Farm to Fork: The Future of Urban Agriculture and Local Sourcing

    Snacking Reimagined: The Shift Towards Healthy, Convenience Foods

    Tech on the Table: How Innovations are Shaping Food Production in 2025

    Trending Tags

    • Climate Change
  • USA
  • Home
  • World
  • Politics
  • Business
  • Science
  • Tech

    Get Lifetime Access to Sterling Stock Picker AI for Just £51!

    AI Startup’s Chatbots Disguised as Human Employees Uncovered

    After Pornhub Departed France, This VPN Experienced a 1,000% Surge in Usage Minutes

    23andMe’s DNA Data Set to Be Sold Again

    Where to Purchase the Nintendo Switch 2 In-Store on Launch Day

    Master 50 Languages for a Lifetime for Only £26!

    Trending Tags

    • Sillicon Valley
    • Climate Change
    • Election Results
  • Entertainment
    • All
    • Gaming
    • Movie
    • Music
    • Sports

    Rugby League Legend Appointed Head Coach of Perth Bears NRL Expansion Club

    Matt Dufty: The Pressure is on ‘Favourites’ Hull KR in the Challenge Cup Final

    Apple Arcade Expands Library with Nine New Games and a Bluey Crossover This Summer

    Exploring Sara Waisglass’ Boyfriend and Her Relationship History

    Listen to Mariah Carey’s Latest Song “Type Dangerous”

    Jack Grealish Responds to Critics as His Future with Man City Remains Uncertain | Football News

    Lenovo Legion Go Handheld PC Hits Best Price of the Year

    Blake Lively Drops Lawsuit, Trends Despite Ongoing Justin Baldoni Case

    Sabrina Carpenter Releases Video for Her Song “Manchild”: Watch Now

  • Lifestyle
    • All
    • Fashion
    • food
    • Health
    • Travel

    Redefining Indulgence: Healthy Comfort Foods Set to Trend in 2025

    The Next Generation of Food Delivery: Innovations for a Post-Pandemic World

    Cultural Flavors Unleashed: How Global Tastes are Transforming Local Menus

    Functional Foods: Nourishment Beyond Nutrition in 2025

    AI in the Kitchen: Predicting the Future of Cooking and Recipe Development

    Gut Health and Gourmet: The Intersection of Wellness and Fine Dining

    Zero Waste Kitchens: Trends in Food Sustainability for 2025

    From Farm to Fork: The Future of Urban Agriculture and Local Sourcing

    Snacking Reimagined: The Shift Towards Healthy, Convenience Foods

    Tech on the Table: How Innovations are Shaping Food Production in 2025

    Trending Tags

    • Climate Change
  • USA
No Result
View All Result
ECNETNews
No Result
View All Result
Home Science

Top AI Models Fall Short in Latest Assessment of Artificial General Intelligence

by ECNetNews
March 26, 2025
in Science
0
0
SHARES
114
VIEWS
Share on FacebookShare on Twitter

The latest ARC-AGI-2 benchmark presents a challenging new test for artificial intelligence models, revealing that even the most advanced systems currently available are struggling to meet the criteria for artificial general intelligence (AGI). This benchmark assesses not only the capabilities of AI models but also the efficiency and cost associated with their operation.

AGI is generally defined as AI that can perform any cognitive task that humans are capable of. Historically, the ARC Prize Foundation introduced ARC-AGI-1 to evaluate AI reasoning abilities. Last December, a high score from OpenAI’s model sparked discussions about the company’s progress toward AGI.

However, the introduction of ARC-AGI-2 has significantly raised expectations. Current AI systems are unable to achieve more than a single-digit score out of 100, despite every question being successfully answered by at least two humans in under two attempts.

ARC president Greg Kamradt emphasized the importance of this new benchmark, stating it requires a blend of adaptability and efficiency to excel, differentiating it from previous evaluations. “To beat it, you must demonstrate both a high level of adaptability and high efficiency,” he remarked.

Unlike other benchmarks that assess complex tasks, ARC-AGI-2 emphasizes basic tasks, such as making changes to an image based on prior examples. While current models excel in deep learning tasks measured by ARC-AGI-1, they fall short in completing these seemingly simpler challenges that demand intricate reasoning and interaction. For instance, OpenAI’s o3-low model achieves a score of 75.7% on ARC-AGI-1 but only manages 4% on ARC-AGI-2.

The new benchmark introduces a crucial perspective by evaluating AI problem-solving efficiency, factoring in the operational costs. For instance, while human testers were compensated $17 per task, the estimated cost for OpenAI’s o3-low to complete similar tasks is approximately $200.

Joseph Imperial from the University of Bath highlights that this focus on balancing performance with efficiency is a notable advancement in evaluating AI. He notes that this shift may lead to more sustainable AI development, addressing concerns about energy consumption in pursuit of performance.

Nevertheless, not all experts agree with the implications of ARC-AGI-2. Catherine Flick from the University of Staffordshire argues that framing it as a measure of intelligence may be misleading, as the benchmarks primarily evaluate the ability to accomplish specific tasks rather than general intelligence. She cautions against overinterpreting these scores as evidence of human-level intelligence, stating, “What they are doing is really just responding to a particular prompt accurately.”

The future of AGI benchmarks remains an open question. If a model were to succeed in passing ARC-AGI-2, discussions around the need for continued evolution of benchmarks, such as a potential ARC-AGI-3, would likely intensify. This ongoing dialogue indicates that the pursuit of true artificial general intelligence is far from reaching a conclusion.

Topics:

Tags: artificialAssessmentFallGeneralIntelligenceLatestModelsShortTop
ECNetNews

ECNetNews

Next Post

TikTok Introduces Security Checkup Tool to Improve User Safety

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Alpha Boost Pro Announces New Approach to Promoting Energy and Vitality
  • HP9 Guard Offers New Immune Support Protocols Backed by Nutrition Science for 2025 Wellness
  • Phone Ninjas Names Nicole Marcellino as Official Brand Ambassador
  • Investors in shares of Pacira BioSciences, Inc. (NASDAQ:PCRX) Should Contact the Shareholders Foundation in Connection with Lawsuit
  • Diamond Care Transportation Expands Non-Emergency Medical Transportation (NEMT) Services Across SC, FL, WA, DC & MD

Categories

  • Botsuana
  • Brazil
  • Business
  • Caribbean News
  • Crypto
  • Fashion
  • food
  • Gaming
  • German
  • Health
  • India
  • Indonesia
  • Mexican
  • Mongolian
  • Movie
  • Music
  • Nigeria
  • Politics
  • Press Release
  • Science
  • Sports
  • Tanzania
  • Tech
  • Thai
  • Travel
  • USA
  • World

UNESCO Support Strengthens ECNETNews.com’s Mission

ECNETNews.com proudly acknowledges support from UNESCO’s International Programme for the Development of Communications, bolstering our mission to deliver accurate, unbiased news and foster informed communities across the World

About Us

ECNETNews.com is a historic important News Website running now over 20 years, since 2004 serving as neutral news source.

  • About
  • RSS Feed
  • International News
  • Privacy Policy

© 2025 ECNETNEWS - International News site for open minded news ECNETNews.com.

No Result
View All Result
  •  Privacy Policy & Cookies
  • About
  • Blog
  • Contact
  • Gallery
  • Home
  • International News
  • Pricing
  • RSS Feed
  • Sample Page
  • Services

© 2025 ECNETNEWS - International News site for open minded news ECNETNews.com.