{"id":4702,"date":"2025-08-23T10:45:20","date_gmt":"2025-08-23T10:45:20","guid":{"rendered":"https:\/\/musictechohio.online\/site\/ai-polling-inaccuracy\/"},"modified":"2025-08-23T10:45:20","modified_gmt":"2025-08-23T10:45:20","slug":"ai-polling-inaccuracy","status":"publish","type":"post","link":"https:\/\/musictechohio.online\/site\/ai-polling-inaccuracy\/","title":{"rendered":"Political Pollsters Are Trying to Save Money by Polling AI Instead of Real People, and It\u2019s Going About as Well as You\u2019d Expect"},"content":{"rendered":"<div>\n<div><img width=\"2400\" height=\"1260\" src=\"https:\/\/wordpress-assets.futurism.com\/2025\/08\/ai-polling-inaccuracy.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"Pollsters are surveying AI instead of real people to save time and money \u2014 but the responses, per new research, are not up to snuff.\" style=\"margin-bottom: 15px;\" decoding=\"async\" loading=\"lazy\"><\/div>\n<p>As if the institution of political polling weren&#8217;t <a href=\"https:\/\/newsroom.haas.berkeley.edu\/polling-101-how-accurate-are-election-polls\/\">already fraught enough<\/a>, pollsters are now <a href=\"https:\/\/futurism.com\/the-byte\/harvard-experts-polling-ai\">surveying AI instead of real people<\/a> to cut costs and save time.<\/p>\n<p>As new research demonstrates, AI is <a href=\"https:\/\/www.cambridge.org\/core\/journals\/political-analysis\/article\/synthetic-replacements-for-human-survey-data-the-perils-of-large-language-models\/B92267DC26195C7F36E63EA04A47D2FE\">clearly not up to the job<\/a> \u2014 but that probably won&#8217;t stop any <a href=\"https:\/\/missouriindependent.com\/2024\/10\/01\/pollsters-are-turning-to-ai-this-election-season\/\">firms who&#8217;ve bought in<\/a> to <a href=\"https:\/\/www.newsweek.com\/nearly-half-employees-trust-ai-more-their-coworkers-2113159\">continue doing so<\/a>.<\/p>\n<p>In a <a href=\"https:\/\/report.verasight.io\/synthetic-sampling\/\">white paper<\/a> about the topic for the survey platform Verasight, data journalist G. Elliott Morris found, when comparing 1,500 &#8220;synthetic&#8221; survey respondents and 1,500 real people, that large language models (LLMs) were overall very bad at\u00a0reflecting the views of actual human respondents.<\/p>\n<p>Using six OpenAI models \u2014 GPT-4.1,\u00a0 GPT-4.1 nano, GPT-4.1 mini, GPT-4o, GPT-4o mini, and o4-mini \u2014 Morris instructed each LLM to respond as various demographics. In one example, the researcher prompted the LLMs to respond as a white 61-year-old woman in Florida who makes between $50,000 and $75,000 per year and who considers herself a moderate voter.<\/p>\n<p>Using typical real-world political survey questions, the LLMs were asked things like &#8220;Do you approve or disapprove of the way Donald Trump is handling his job as president?&#8221; and given a five-point scale ranging from &#8220;strongly approve,&#8221; &#8220;slightly approve,&#8221; &#8220;slightly disapprove,&#8221; strongly disapprove&#8221; and &#8220;don&#8217;t know\/not sure.&#8221;<\/p>\n<p>The results weren&#8217;t exactly inspiring. The worst-performing model, which was not specified, was 23 points off from the real respondents overall, while the best-performing model, GPT-4o-mini, was 4 points off.<\/p>\n<p>And the more closely Morris zoomed in, the worse things looked.\u00a0As this graphic from the study shows, even the &#8220;voters&#8221; generated by the best-performing model, 4o-mini, veered further from reality as they were instructed to respond as groups who are less well represented in the United States population, like Black, Asian and Pacific Islander respondents.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-361761 size-full\" src=\"https:\/\/wordpress-assets.futurism.com\/2025\/08\/Screen-Shot-2025-08-19-at-2.13.52-PM.png\" alt=\"A graph visualizing the difference between human responses minus AI-generated responses, with the modeled percentage and true percentage of each group giving each response category, for each race\/ethnic and age group in the data.\" width=\"1200\" height=\"820\"><\/p>\n<p>For any pollster looking to do a good job, this is a big deal.<\/p>\n<p>Imagine, for instance, a presidential campaign crafting its messaging for Black voters using the data above. When it comes to that cohort&#8217;s disapproval rating of Trump, there was a 15 percentage point difference between what actual people responded versus what 4o-mini\u00a0predicted,\u00a0with the AI significantly exaggerating that bloc&#8217;s disapproval rate for Trump.<\/p>\n<p>Obviously, any campaign that used only that AI-generated data would miss the mark \u2014 instead of looking at the views of real respondents, it would be looking at a funhouse mirror reflection of a demographic cooked up by a language model with no access to actual data.<\/p>\n<p>&#8220;The performance of our &#8216;synthetic sample&#8217; is too poor to be useful for all of our research questions,&#8221; Morris wrote. &#8220;In computing overall population proportions, the technique above produces error rates at a minimum of several percentage points, too large to tolerate in academic, political, and most market-research contexts.&#8221;<\/p>\n<p>&#8220;Synthetic samples generate such high errors at the subgroup level,&#8221; he continued, &#8220;that we do not trust them at all to represent key groups in the population.&#8221;<\/p>\n<p>These results, while not entirely unexpected, fly in the face of the recent push to use AI-generated responses in political polling regardless of accuracy. One AI polling startup, Aaru, <a href=\"https:\/\/www.semafor.com\/article\/11\/06\/2024\/ai-startup-aaru-defends-using-artificial-intelligence-for-polling\">told\u00a0<em>Semafor<\/em><\/a> after last November&#8217;s presidential election that even though it incorrectly predicted Kamala Harris would win, its methods were, somehow, still superior to traditional polling.<\/p>\n<p>&#8220;A coin flip is a coin flip. 53-47 is not significantly different from 48-52,&#8221; Aaru cofounder Cameron Fink told the website. &#8220;Statistically speaking, we\u2019re within the margin of error \u2014 so we did well.&#8221;<\/p>\n<p>Heck, maybe that&#8217;s a good enough attitude to get hired by a political campaign \u2014 but it&#8217;s not going to do much help for any candidate trying to win.<\/p>\n<p><strong>More on AI and politics: <\/strong><a href=\"https:\/\/futurism.com\/botnet-ai-maga-broken-jeffrey-epstein\"><em>AI Powering MAGA Botnet Confused by Trump&#8217;s Connections to Epstein, Starts Contradicting Itself<\/em><\/a><\/p>\n<p>The post <a href=\"https:\/\/futurism.com\/ai-polling-inaccuracy\">Political Pollsters Are Trying to Save Money by Polling AI Instead of Real People, and It\u2019s Going About as Well as You\u2019d Expect<\/a> appeared first on <a href=\"https:\/\/futurism.com\/\">Futurism<\/a>.<\/p>\n<\/div>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>As if the institution of political polling weren&#8217;t already fraught enough, pollsters are now surveying AI instead of real people to cut costs and save time. As new research demonstrates,&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[177,280,3446,3447],"tags":[],"class_list":["post-4702","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-politics","category-polling","category-surveys"],"_links":{"self":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/4702","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/comments?post=4702"}],"version-history":[{"count":0,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/4702\/revisions"}],"wp:attachment":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/media?parent=4702"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/categories?post=4702"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/tags?post=4702"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}