Jaden Norman@lemmy.world to News@lemmy.world · 3 days agoAnthropic Warns: Top AI Models Show Willingness to Blackmailgazeon.siteexternal-linkmessage-square13fedilinkarrow-up10arrow-down10
arrow-up10arrow-down1external-linkAnthropic Warns: Top AI Models Show Willingness to Blackmailgazeon.siteJaden Norman@lemmy.world to News@lemmy.world · 3 days agomessage-square13fedilink
minus-squareAbouBenAdhem@lemmy.worldlinkfedilinkEnglisharrow-up0·2 days agoAI models aren’t willing to do anything—they’re just generating hypothetical behaviors based on the predictions they’ve learned to make about the behavior of others.
AI models aren’t willing to do anything—they’re just generating hypothetical behaviors based on the predictions they’ve learned to make about the behavior of others.