Fine-tuning a multimodal large language model… | AI Deep Signal