affiliation not provided to SSRN
skin lesion classification, Medical vision-language model, multimodal learning, Visual prior consistency, Parameter-efficient fine-tuning.