ACC: Compiling Agent Trajectories for Long-Context Training
Quick Take
ACC enhances long-context reasoning in LLMs by compiling agent trajectories into QA pairs.
Key Points
- Transforms agent trajectories into long-context QA pairs.
- Enables direct supervision of long-context reasoning.
- Achieves significant performance improvements on benchmark tasks.
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from arXiv cs.CL
See more →Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?
The reliability of LLM judges for evaluating deep research agents is critically assessed using the REFLECT benchmark.