Is it true that only 130 of thousands of "agentic AI" vendors are genuine?
ScaleAI’s analysis shows that with a 20% error rate per action—a generous estimate for current LLMs—an agent attempting a five-step task has only a 32% chance of getting everything right.
Chris, may I have the link to ScaleAI? I could not find it at https://scale.com/leaderboard/tool_use_enterprise
The original source is here on LinkedIn, albeit a bit old now, it's still very relevant depending what you're building and which tech you're using. https://www.linkedin.com/posts/quintinau_what-people-arent-talking-about-ai-agents-activity-7257746387809755136-xOz8
ScaleAI’s analysis shows that with a 20% error rate per action—a generous estimate for current LLMs—an agent attempting a five-step task has only a 32% chance of getting everything right.
Chris, may I have the link to ScaleAI? I could not find it at https://scale.com/leaderboard/tool_use_enterprise
The original source is here on LinkedIn, albeit a bit old now, it's still very relevant depending what you're building and which tech you're using. https://www.linkedin.com/posts/quintinau_what-people-arent-talking-about-ai-agents-activity-7257746387809755136-xOz8