Mid-Training with Self-Generated Data Improves Reinforcement Learning in Language Models · DeepSignal