2 Comments
User's avatar
Owen Skriloff's avatar

ScaleAI’s analysis shows that with a 20% error rate per action—a generous estimate for current LLMs—an agent attempting a five-step task has only a 32% chance of getting everything right.

Chris, may I have the link to ScaleAI? I could not find it at https://scale.com/leaderboard/tool_use_enterprise

Expand full comment
Chris Tyson's avatar

The original source is here on LinkedIn, albeit a bit old now, it's still very relevant depending what you're building and which tech you're using. https://www.linkedin.com/posts/quintinau_what-people-arent-talking-about-ai-agents-activity-7257746387809755136-xOz8

Expand full comment