Articles
Arbor: Tree Search as a Cognition Layer for Autonomous Agents
Arbor is a multi-agent framework using structured tree search as a cognition layer for autonomous agents, enabling full-stack LLM inference optimization with up...
ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs
ToolSense diagnoses whether LLMs truly understand tools in agent systems, revealing a 50-64% performance gap between benchmark and real-world retrieval.
xAI Launches Grok Build Plugin Marketplace with MongoDB, Vercel, Sentry, Chrome DevTools, Cloudflare, and Superpowers Plugins
xAI launches Grok Build Plugin Marketplace with MongoDB, Vercel, Sentry, Chrome DevTools, Cloudflare, and Superpowers plugins for terminal-based coding.
