{"id":2548,"date":"2025-06-09T15:11:49","date_gmt":"2025-06-09T15:11:49","guid":{"rendered":"https:\/\/musictechohio.online\/site\/apple-damning-paper-ai-reasoning\/"},"modified":"2025-06-09T15:11:49","modified_gmt":"2025-06-09T15:11:49","slug":"apple-damning-paper-ai-reasoning","status":"publish","type":"post","link":"https:\/\/musictechohio.online\/site\/apple-damning-paper-ai-reasoning\/","title":{"rendered":"Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry"},"content":{"rendered":"<div>\n<div><img loading=\"lazy\" width=\"1200\" height=\"630\" src=\"https:\/\/wordpress-assets.futurism.com\/2025\/06\/apple-damning-paper-ai-reasoning-1.jpg\" class=\"attachment-full size-full wp-post-image\" alt='Researchers at Apple have released a damning paper that throws cold water on the \"reasoning\" capabilities of modern AIs.' style=\"margin-bottom: 15px;\" decoding=\"async\"><\/div>\n<p>Researchers at Apple have released an <a href=\"https:\/\/ml-site.cdn-apple.com\/papers\/the-illusion-of-thinking.pdf\">eyebrow-raising paper<\/a> that throws cold water on the &#8220;reasoning&#8221; capabilities of the latest, most powerful large language models.<\/p>\n<p>In the paper, a team of machine learning experts makes the case that the AI industry is grossly overstating the ability of its top AI models, including OpenAI&#8217;s o3, Anthropic&#8217;s Claude 3.7, and Google&#8217;s Gemini.<\/p>\n<p>In particular, the researchers assail the claims of companies like OpenAI that their most advanced models can now &#8220;reason&#8221; \u2014 a supposed capability that the Sam Altman-led company has <a href=\"https:\/\/openai.com\/index\/learning-to-reason-with-llms\/\">increasingly leaned on<\/a> over the past year for marketing purposes \u2014 which the Apple team characterizes as merely an &#8220;illusion of thinking.&#8221;<\/p>\n<p>It&#8217;s a particularly noteworthy finding, considering Apple has been <a href=\"https:\/\/www.unite.ai\/how-apple-lost-the-ai-race-ahead-of-wwdc-2025\/\">accused<\/a> of falling far behind the competition in the AI space. The company has chosen a far more careful path to integrating the tech in its consumer-facing products \u2014 with some seriously <a href=\"https:\/\/futurism.com\/apple-ai-disaster\">mixed results<\/a> so far.<\/p>\n<p>In theory, reasoning models break down user prompts into pieces and use sequential &#8220;chain of thought&#8221; steps to arrive at their answers. But now, Apple&#8217;s own top minds are questioning whether frontier AI models simply aren&#8217;t as good at &#8220;thinking&#8221; as they&#8217;re being made out to be.<\/p>\n<p>&#8220;While these models demonstrate improved performance on reasoning benchmarks, their fundamental capabilities, scaling properties, and limitations remain insufficiently understood,&#8221; the team wrote in its paper.<\/p>\n<p>The authors \u2014 who include Samy Bengio, the director of Artificial Intelligence and Machine Learning Research at the software and hardware giant \u2014 argue that the existing approach to benchmarking &#8220;often suffers from data contamination and does not provide insights into the reasoning traces\u2019 structure and quality.&#8221;<\/p>\n<p>By using &#8220;controllable puzzle environments,&#8221; the team estimated the AI models&#8217; ability to &#8220;think&#8221; \u2014 and made a seemingly damning discovery.<\/p>\n<p>&#8220;Through extensive experimentation across diverse puzzles, we show that frontier [large reasoning models] face a complete accuracy collapse beyond certain complexities,&#8221; they wrote.<\/p>\n<p>Thanks to a &#8220;counter-intuitive scaling limit,&#8221; the AIs&#8217; reasoning abilities &#8220;declines despite having an adequate token budget.&#8221;<\/p>\n<p>Put simply, even with sufficient training, the models are struggling with problem beyond a certain threshold of complexity \u2014 the result of &#8220;an &#8216;overthinking&#8217; phenomenon,&#8221;\u00a0in the paper&#8217;s phrasing.<\/p>\n<p>The finding is reminiscent of a broader trend. Benchmarks have shown that the latest generation of reasoning models is <a href=\"https:\/\/futurism.com\/ai-industry-problem-smarter-hallucinating\"><em>more\u00a0<\/em>prone to hallucinating, not less<\/a>, indicating the tech may now be heading in the wrong direction in a key way.<\/p>\n<p>Exactly how reasoning models choose which path to take remains surprisingly murky, the Apple researchers found.<\/p>\n<p>&#8220;We found that LRMs have limitations in exact computation,&#8221; the team concluded in its paper. &#8220;They fail to use explicit algorithms and reason inconsistently across puzzles.&#8221;<\/p>\n<p>The researchers claim their findings raise &#8220;crucial questions&#8221; about the current crop of AI models&#8217; &#8220;true reasoning capabilities,&#8221; undercutting a much-hyped new avenue in the burgeoning industry.<\/p>\n<p>That&#8217;s despite tens of billions of dollars being poured into the tech&#8217;s development, with the likes of OpenAI, Google, and Meta, constructing enormous data centers to run increasingly power-hungry AI models.<\/p>\n<p>Could the Apple researchers&#8217; finding be <a href=\"https:\/\/futurism.com\/the-byte\/ai-expert-crash-imminent\">yet another<\/a> canary in the coalmine, suggesting the tech has &#8220;hit a wall&#8221;?<\/p>\n<p>Or is the company trying to hedge its bets, calling out its outperforming competition as it lags behind, as <a href=\"https:\/\/x.com\/sanderssays\/status\/1932078564536463764\">some have suggested<\/a>?<\/p>\n<p>It&#8217;s certainly a surprising conclusion, considering Apple&#8217;s precarious positioning in the AI industry:\u00a0at the same time that its researchers are trashing the tech&#8217;s current trajectory, it&#8217;s promised a suite of Apple Intelligence tools for its devices like the iPhone and MacBook.<\/p>\n<p>&#8220;These insights challenge prevailing assumptions about LRM capabilities and suggest that current approaches may be encountering fundamental barriers to generalizable reasoning,&#8221; the paper reads.<\/p>\n<p><strong>More on AI models:<\/strong> <em><a href=\"https:\/\/futurism.com\/car-dealerships-ai-voice\">Car Dealerships Are Replacing Phone Staff With AI Voice Agents<\/a><\/em><\/p>\n<p>The post <a href=\"https:\/\/futurism.com\/apple-damning-paper-ai-reasoning\">Apple Researchers Just Released a Damning Paper That Pours Water on the Entire AI Industry<\/a> appeared first on <a href=\"https:\/\/futurism.com\/\">Futurism<\/a>.<\/p>\n<\/div>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>Researchers at Apple have released an eyebrow-raising paper that throws cold water on the &#8220;reasoning&#8221; capabilities of the latest, most powerful large language models. In the paper, a team of&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1513,177,1453,1514],"tags":[],"class_list":["post-2548","post","type-post","status-publish","format-standard","hentry","category-apple","category-artificial-intelligence","category-large-language-models","category-reasoning-models"],"_links":{"self":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/2548","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/comments?post=2548"}],"version-history":[{"count":0,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/2548\/revisions"}],"wp:attachment":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/media?parent=2548"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/categories?post=2548"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/tags?post=2548"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}