{"id":1254,"date":"2025-05-18T09:00:29","date_gmt":"2025-05-18T09:00:29","guid":{"rendered":"https:\/\/musictechohio.online\/site\/ai-chatbots-summarizing-research\/"},"modified":"2025-05-18T09:00:29","modified_gmt":"2025-05-18T09:00:29","slug":"ai-chatbots-summarizing-research","status":"publish","type":"post","link":"https:\/\/musictechohio.online\/site\/ai-chatbots-summarizing-research\/","title":{"rendered":"AI Chatbots Are Becoming Even Worse At Summarizing Data"},"content":{"rendered":"<div>\n<div><img width=\"1200\" height=\"630\" src=\"https:\/\/wordpress-assets.futurism.com\/2025\/05\/ai-chatbots-summarizing-research.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"Researchers have found that newer AI models can omit key details from text summaries as much as 73 percent of the time.\" style=\"margin-bottom: 15px;\" decoding=\"async\" fetchpriority=\"high\"><\/div>\n<p>Ask the CEO of any AI startup, and you&#8217;ll probably get an earful about the tech&#8217;s potential to &#8220;transform work,&#8221; or &#8220;revolutionize the way we access knowledge.&#8221;<\/p>\n<p>Really, there&#8217;s no shortage of promises that AI is only getting smarter \u2014 which we&#8217;re told will speed up the rate of <a href=\"https:\/\/www.nytimes.com\/2025\/03\/10\/technology\/ai-science-lab-lila.html\">scientific breakthroughs<\/a>, <a href=\"https:\/\/allofus.nih.gov\/article\/all-of-us-artificial-intelligence-help-speed-up-search-for-promising-medicines\">streamline medical testing<\/a>, and breed a <a href=\"https:\/\/www.sssp-research.org\/ai-in-scholarship-what-is-it-and-how-can-it-help-me\/\">new kind of scholarship<\/a>.<\/p>\n<p>But according to a <a href=\"https:\/\/royalsocietypublishing.org\/doi\/10.1098\/rsos.241776\">new study<\/a> published in the\u00a0<em>Royal Society<\/em>, as many as 73 percent of seemingly reliable\u00a0answers from AI chatbots could actually be inaccurate.<\/p>\n<p>The collaborative research paper looked at nearly 5,000 large language model (LLM) summaries of scientific studies by ten widely used chatbots, including ChatGPT-4o, ChatGPT-4.5, DeepSeek, and LLaMA 3.3 70B. It found that, even when explicitly goaded into providing the right facts, AI answers lacked key details at a rate of five times that of human-written scientific summaries.<\/p>\n<p>&#8220;When summarizing scientific texts, LLMs may omit details that limit the scope of research conclusions, leading to generalizations of results broader than warranted by the original study,&#8221; the researchers wrote.<\/p>\n<p>Alarmingly, the LLMs&#8217; rate of error was found to increase the newer the chatbot was \u2014 the exact opposite of what AI industry leaders <a href=\"https:\/\/x.com\/8teAPi\/status\/1908688178804121724\">have been promising us<\/a>. This is in addition to a correlation between an LLM&#8217;s tendency to overgeneralize with how widely used it is, &#8220;posing a significant risk of large-scale misinterpretations of research findings,&#8221; according to the study&#8217;s authors.<\/p>\n<p>For example, use of the two ChatGPT models listed in the study doubled from 13 to 26 percent <a href=\"https:\/\/royalsocietypublishing.org\/doi\/10.1098\/rsos.241776#B23\">among US teens<\/a> between 2023 and 2025. Though the older ChatGPT-4 Turbo was roughly 2.6 times more likely to omit key details compared to their original texts, the newer ChatGPT-4o models were nine times as likely. This tendency was also found in Meta&#8217;s LLaMA 3.3 70B, which was 36.4 times more likely to overgeneralize compared to older versions.<\/p>\n<p>The job of synthesizing huge swaths of data into just a few sentences is a tricky one. Though it comes pretty easily to fully-grown humans, it&#8217;s a <a href=\"https:\/\/www.abstractivehealth.com\/article\/why-is-summarizing-so-difficult-in-nlp\">really complicated process<\/a> to program into a chatbot.<\/p>\n<p>While the human brain can instinctively learn broad lessons from specific experiences \u2014 like touching a hot stove \u2014 complex nuances make it difficult for chatbots to know what facts to focus on. A human quickly understands that stoves can burn while refrigerators do not, but an LLM might reason that <em>all\u00a0<\/em>kitchen appliances get hot, unless otherwise told. Expand that metaphor out a bit to the scientific world, and it gets complicated fast.<\/p>\n<p>But summarizing is also time-consuming for humans; the researchers list clinical medical settings as one area where LLM summaries could have a huge impact on work. It goes the other way, too, though: in clinical work, details are extremely<em>\u00a0<\/em>important, and even the tiniest omission can compound into a life-changing disaster.<\/p>\n<p>This makes it all the more troubling that LLMs are being shoehorned into every possible workspace, from <a href=\"https:\/\/nymag.com\/intelligencer\/article\/openai-chatgpt-ai-cheating-education-college-students-school.html\">high school homework<\/a> to <a href=\"https:\/\/futurism.com\/neoscope\/new-law-ai-replace-doctor-prescribe-drugs\">pharmacies<\/a> to <a href=\"https:\/\/www.imeche.org\/news\/news-article\/feature-how-ai-is-already-changing-engineering-and-the-role-of-the-engineer\">mechanical engineering<\/a> \u2014\u00a0despite a <a href=\"https:\/\/technijian.com\/chatgpt\/ai-in-tech\/chatgpt-is-getting-smarter-but-its-hallucinations-are-spiraling-out-of-control\/\">growing body of work<\/a> showing widespread accuracy problems inherent to AI.<\/p>\n<p>However, there were some important drawbacks to their findings, the scientists pointed out. For one, the prompts fed to LLMs can have a significant impact on the answer it spits out. Whether this affects LLM summaries of scientific papers is unknown, suggesting a future avenue for research.<\/p>\n<p>Regardless, the <a href=\"https:\/\/futurism.com\/ai-industry-problem-smarter-hallucinating\">trendlines<\/a> are clear. Unless AI developers can set their new LLMs on the right path, you&#8217;ll just have to keep relying on humble human bloggers to summarize scientific reports for you (wink).<\/p>\n<p><strong>More on AI: <\/strong><a href=\"https:\/\/futurism.com\/senators-safety-records-ai-chatbot\"><em>Senators Demand Safety Records from AI Chatbot Apps as Controversy Grows<\/em><\/a><\/p>\n<p>The post <a href=\"https:\/\/futurism.com\/ai-chatbots-summarizing-research\">AI Chatbots Are Becoming Even Worse At Summarizing Data<\/a> appeared first on <a href=\"https:\/\/futurism.com\/\">Futurism<\/a>.<\/p>\n<\/div>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>Ask the CEO of any AI startup, and you&#8217;ll probably get an earful about the tech&#8217;s potential to &#8220;transform work,&#8221; or &#8220;revolutionize the way we access knowledge.&#8221; Really, there&#8217;s no&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[316,177,476,196,189],"tags":[],"class_list":["post-1254","post","type-post","status-publish","format-standard","hentry","category-ai","category-artificial-intelligence","category-chatbots","category-chatgpt","category-meta"],"_links":{"self":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/1254","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/comments?post=1254"}],"version-history":[{"count":0,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/1254\/revisions"}],"wp:attachment":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/media?parent=1254"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/categories?post=1254"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/tags?post=1254"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}