Zerush@lemmy.ml to Technology@lemmy.ml · 1 day agoAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comexternal-linkmessage-square3fedilinkarrow-up116arrow-down14cross-posted to: technology@lemmy.world
arrow-up112arrow-down1external-linkAgentic Misalignment: How LLMs could be insider threatswww.anthropic.comZerush@lemmy.ml to Technology@lemmy.ml · 1 day agomessage-square3fedilinkcross-posted to: technology@lemmy.world
minus-squarecaptainastronaut@seattlelunarsociety.orglinkfedilinkEnglisharrow-up5·22 hours agoI love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.
I love how these people think “we told it not to break the rules” and think somehow the stochastic parrot has understood them and will obey.