TabClaw: An Interactive and Self-Evolving Agent for Spreadsheet Manipulation and Table Reasoning
Quick Answer
TabClaw is an open-source interactive AI agent designed for spreadsheet manipulation and table reasoning, enhancing task completion and reasoning performance.
Quick Take
TabClaw is an open-source interactive AI agent designed for spreadsheet manipulation and table reasoning, enhancing task completion and reasoning performance. It clarifies user intent, manages multi-table comparisons, and personalizes workflows based on user feedback, significantly improving efficiency in data analysis tasks.
Key Points
- TabClaw supports CSV and Excel file uploads for natural-language requests.
- It features an editable execution plan and a ReAct-style analysis loop.
- The agent records workflows and extracts user memory for personalized assistance.
- Experiments show improved task completion and reasoning performance.
- TabClaw adapts to user preferences over time through feedback.
Paper Resources
Article Content
From source RSS / original summaryarXiv:2606. 10316v1 Announce Type: new Abstract: Spreadsheets and tables are widely used representations for structured data analysis, but effective analysis still requires substantial manual effort and domain expertise. Recent large language model (LLM) agents can automate parts of this process, but they often provide limited transparency into intermediate decisions, rely on implicit assumptions, struggle with multi-table comparison, and repeat similar workflows without adapting to a user's preferences.
This paper presents TabClaw, an open-source interactive AI agent for spreadsheet manipulation and table reasoning. Users upload CSV or Excel files and issue natural-language requests; TabClaw clarifies ambiguous intent, exposes an editable execution plan, streams a ReAct-style tool-using analysis loop, dispatches specialist agents for parallel multi-table reasoning, and synthesizes findings with explicit consensus and uncertainty markers.
Beyond one-off analysis, TabClaw records completed workflows, extracts persistent user memory, distills reusable skills from repeated patterns, supports package-style skill import, and upgrades skills from negative feedback. Experiments on spreadsheet manipulation and table reasoning benchmarks show that TabClaw improves executable task completion and reasoning performance while preserving an inspectable user workflow.
This paper shows how TabClaw turns spreadsheets and tables into inspectable analytical workflows while gradually personalizing itself to recurring data-analysis tasks. Our code is available.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.