The Evaluation Trap: Benchmark Design as Theoretical Commitment · DeepSignal