Fine-Tuning Prototypes for Cross-Domain Few-Shot Image Classification Using Contrastive Objective - MSc Thesis Proposal by: Abhishek Mahajan

Monday, May 26, 2025 - 09:00

The School of Computer Science is pleased to present…

Fine-Tuning Prototypes for Cross-Domain Few-Shot Image Classification Using Contrastive Objective

MSc Thesis Proposal by: Abhishek Mahajan

 

Date: Monday, May 26th, 2025

Time: 9:00 AM  

Location: Essex Hall, Room 122

 

Abstract:

Cross-domain few-shot learning (CFC) seeks to enable accurate classification in novel visual domains using only a few labelled samples per class. However, it remains a challenging task due to prototype misalignment and domain-induced embedding distortions, in addition to a lack of purified data. While the state-of-the-art transformation network effectively realigns prototypes in one-shot settings, it is not designed for broader domain generalization or transformer-based architectures in cross-domain settings. We introduce CosRestViT, a prototype-aware few-shot learning framework designed to tackle the problems of scarce data with a lot of outliers. Our method integrates a custom transformation network with a Vision Transformer (ViT) backbone to recalibrate noisy embeddings and align them with class-level prototypes. To enhance semantic consistency in the embedding space, we fine-tune the model using a hybrid contrastive objective that combines cross-entropy loss with CoSENT’s ranking loss. We pretrain CosRestViT on the large-scale base dataset using a prototype-aware contrastive loss to enhance the representational power of ViT and ensure effective clustering of semantically similar samples. The model is then evaluated on the Meta-Dataset benchmark under two training regimes: (i) training on all seen datasets, and (ii) training exclusively on ImageNet for transfer learning. Experimental results demonstrate that CosRestViT outperforms existing baselines across both seen and unseen domains, achieving superior generalization in low-data scenarios.

Thesis Committee:

Internal Reader: Dr. Dan Wu

External Reader: Dr. Abdul A. Hussein    

Advisor: Dr. Ziad Kobti

Vector Logo

Registration Link (only MAC students need to pre-register)