Diagnosing Multi-step Reasoning Failures in… | AI Deep Signal