Abstract: Pre-trained vision-language models have achieved great zero-shot performance in various downstream tasks. With the rapid development of vision-language models, many task-specific Contrastive ...