ConTrans: Learning Text-enhanced Local-global… · DeepSignal