Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 4 days agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square284fedilinkarrow-up1983arrow-down121cross-posted to: technology@lemmy.mltechnology@beehaw.org
arrow-up1962arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 4 days agomessage-square284fedilinkcross-posted to: technology@lemmy.mltechnology@beehaw.org
minus-squaredylanmorgan@slrpnk.netlinkfedilinkEnglisharrow-up3·4 days agoWhat are you checking against? Part of my job is looking for events in cities that are upcoming and may impact traffic, and ChatGPT has frequently missed events that were obviously going to have an impact.
minus-squarelepinkainen@lemmy.worldlinkfedilinkEnglisharrow-up5·4 days agoLLMs are shit at current events Perplexity is kinda ok, but it’s just a search engine with fancy AI speak on top
What are you checking against? Part of my job is looking for events in cities that are upcoming and may impact traffic, and ChatGPT has frequently missed events that were obviously going to have an impact.
LLMs are shit at current events
Perplexity is kinda ok, but it’s just a search engine with fancy AI speak on top