Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria

Reader Mode is being prepared.