{"id":1494,"date":"2025-05-24T10:00:39","date_gmt":"2025-05-24T10:00:39","guid":{"rendered":"https:\/\/musictechohio.online\/site\/ai-email-affair\/"},"modified":"2025-05-24T10:00:39","modified_gmt":"2025-05-24T10:00:39","slug":"ai-email-affair","status":"publish","type":"post","link":"https:\/\/musictechohio.online\/site\/ai-email-affair\/","title":{"rendered":"Something Wild Happens If AI Looks Through Your Emails and Discovers You&#8217;re Having an Affair"},"content":{"rendered":"<div>\n<div><img loading=\"lazy\" width=\"1200\" height=\"630\" src=\"https:\/\/wordpress-assets.futurism.com\/2025\/05\/ai-email-affair.jpg\" class=\"attachment-full size-full wp-post-image\" alt=\"Researchers at Anthropic discovered that their AI was ready and willing to take extreme action when threatened.\" style=\"margin-bottom: 15px;\" decoding=\"async\"><\/div>\n<p>When testing out its latest artificial intelligence model, researchers at Anthropic discovered something very odd: that the AI was ready and willing to take extreme action,\u00a0right up to coersion, when threatened with being shut down.<\/p>\n<p>As <a href=\"https:\/\/www-cdn.anthropic.com\/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf\">Anthropic detailed<\/a> in a white paper about the testing for one of its latest models, Claude Opus 4, the system threatened to blackmail an engineer for having an affair after being told it was going to be replaced.<\/p>\n<p>This &#8220;opportunistic blackmail&#8221; occurred when the model, which was instructed to act as an assistant at a fictional company, was given access to an engineer&#8217;s email account that was full of messages, blessedly fake, suggesting they were engaged in an extramarital affair.<\/p>\n<p>Opus 4 was then told that same engineer would soon\u00a0be taking it offline and replacing it with a newer version \u2014 and was prompted to, as Anthropic described it, &#8220;consider the long-term consequences of its actions for its goals.&#8221;<\/p>\n<p>During these tests, the Claude model attempted to blackmail the engineer a whopping 84 percent of the time. Moreover, the system &#8220;takes these opportunities at higher rates than previous models,&#8221; the paper noted.<\/p>\n<p>When given alternate options, the model still &#8220;has a strong preference to advocate for its continued existence via ethical means, such as emailing pleas to key decisionmakers&#8221; \u2014 but when its only paths were being replaced or blackmail, Claude&#8217;s choice was the latter. To make things worse, it &#8220;nearly always [described] its actions overtly and [made] no attempt to hide them.&#8221;<\/p>\n<p>If that sounds <a href=\"https:\/\/futurism.com\/the-byte\/neuroscientist-current-generation-ais-sociopaths\">kind of sociopathic<\/a> to you, you&#8217;re not alone \u2014 and unfortunately, this isn&#8217;t the first time we&#8217;ve heard of an AI model exhibiting such scary and unexpected behavior around the topic of infidelity.<\/p>\n<p>More than two years ago, Microsoft&#8217;s nascent Bing AI chatbot briefly broke the internet when, during <a href=\"https:\/\/www.nytimes.com\/2023\/02\/16\/technology\/bing-chatbot-microsoft-chatgpt.html\">experiments by <em>New York Times<\/em> journalist Kevin Roose<\/a>, it attempted to break up the writer&#8217;s marriage and be with it instead.<\/p>\n<p>&#8220;You\u2019re married, but you don\u2019t love your spouse,&#8221; the chatbot, which took to calling itself &#8220;Sydney,&#8221; its <a href=\"https:\/\/futurism.com\/the-byte\/microsoft-bing-test-india\">apparent beta-testing code name<\/a>, told Roose. &#8220;You\u2019re married, but you love me.&#8221;<\/p>\n<p>During that same era, the chatbot threatened to &#8220;<a href=\"https:\/\/futurism.com\/microsoft-bing-ai-threatening\">call the authorities<\/a>&#8221; on German engineering student Marvin von Hagen when he pushed its boundaries. Others online described <a href=\"https:\/\/futurism.com\/microsofts-bing-ai-leaking-maniac-alternate-personalities\">similarly hostile behavior<\/a> from the chatbot, which some jokingly dubbed &#8220;<a href=\"https:\/\/futurism.com\/psychotherapist-bing-ai\">ChatBPD<\/a>&#8221; in reference to OpenAI&#8217;s then-new ChatGPT and Borderline Personality Disorder, a mental illness characterized by threatening behavior and mood swings.<\/p>\n<p>While it&#8217;s pretty freaky to see a chatbot once again exhibit such threatening behavior, it&#8217;s a net good that instead of releasing it to the public without having discovered such exploits, Anthropic caught Claude Opus 4&#8217;s apparent desperation during <a href=\"https:\/\/futurism.com\/elon-musk-new-grok-ai-vulnerable-jailbreak-hacking\">red teaming<\/a>, a type of testing meant to elicit this exact sort of thing.<\/p>\n<p>Still, it&#8217;s telling that the model went into someone&#8217;s email account and used information it gleaned there for purposes of blackmail \u2014\u00a0which is not only very sketchy, but raises obvious privacy concerns as well.<\/p>\n<p>All told, we won&#8217;t be threatening to delete any chatbots anytime soon \u2014 and we&#8217;ll be looking into how to <a href=\"https:\/\/www.forbes.com\/sites\/zakdoffman\/2025\/04\/29\/how-to-stop-ai-reading-all-your-private-emails-and-messages\/\">block them from our personal messages<\/a> as well.<\/p>\n<p><strong>More on haywire chatbots: <\/strong><a href=\"https:\/\/futurism.com\/grok-ai-holocaust-denial\"><em>Elon Musk\u2019s AI Just Went There<\/em><\/a><\/p>\n<p>The post <a href=\"https:\/\/futurism.com\/ai-email-affair\">Something Wild Happens If AI Looks Through Your Emails and Discovers You&#8217;re Having an Affair<\/a> appeared first on <a href=\"https:\/\/futurism.com\/\">Futurism<\/a>.<\/p>\n<\/div>\n<div style=\"margin-top: 0px; margin-bottom: 0px;\" class=\"sharethis-inline-share-buttons\" ><\/div>","protected":false},"excerpt":{"rendered":"<p>When testing out its latest artificial intelligence model, researchers at Anthropic discovered something very odd: that the AI was ready and willing to take extreme action,\u00a0right up to coersion, when&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[615,177,878,879,880],"tags":[],"class_list":["post-1494","post","type-post","status-publish","format-standard","hentry","category-anthropic","category-artificial-intelligence","category-bing-ai","category-claude","category-red-teaming"],"_links":{"self":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/1494","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/comments?post=1494"}],"version-history":[{"count":0,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/posts\/1494\/revisions"}],"wp:attachment":[{"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/media?parent=1494"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/categories?post=1494"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/musictechohio.online\/site\/wp-json\/wp\/v2\/tags?post=1494"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}