Configurable Reward Model for Balanced Safety… · DeepSignal