Jaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 1 day agoAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comexternal-linkmessage-square164fedilinkarrow-up1736arrow-down118cross-posted to: technology@beehaw.orgtechnology@lemmy.ml
arrow-up1718arrow-down1external-linkAI agents wrong ~70% of time: Carnegie Mellon studywww.theregister.comJaden Norman@lemmy.world to Technology@lemmy.worldEnglish · 1 day agomessage-square164fedilinkcross-posted to: technology@beehaw.orgtechnology@lemmy.ml
minus-squareloonsun@sh.itjust.workslinkfedilinkEnglisharrow-up5·5 hours agoIt’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.
It’s about Agents, which implies multi step as those are meant to execute a series of tasks opposed to studies looking at base LLM model performance.