What We are Missing in Multimodal LLM… | AI Deep Signal