Articles
ToolSense: A Diagnostic Framework for Auditing Parametric Tool Knowledge in LLMs
ToolSense diagnoses whether LLMs truly understand tools in agent systems, revealing a 50-64% performance gap between benchmark and real-world retrieval.
Position: Hippocampal Explicit Memory Is the Cornerstone for AGI
A position paper argues hippocampal explicit memory is essential for AGI, as LLMs rely on implicit memory and lack higher-order reasoning like planning.
