ScarfBench: Benchmarking AI Agents for… | AI Deep Signal