Annotations Are Not All You Need: A Cross-modal… · DeepSignal