Model Parity Aligner
Label-free framework enabling small VLMs like SmolVLM-500M to learn from large VLMs like Qwen2VL-7B efficiently.
I am currently pursuing an MSc in Computing (Artificial Intelligence and Machine Learning) at Imperial College London, having previously completed my B.Tech in Computer Science from the Indian Institute of Technology Jodhpur. My research interests lie in multimodal artificial intelligence, encompassing work across mutiple modalities. I have contributed to publications at prominent academic venues through collaborative research. Prior to my graduate studies, I served as an AI Research Engineer at MetaFusion, where I developed vision-language models for attribute recognition and scene captioning applications.
MSc in Computing (AI & ML)
Deployed and trained VLMs for traffic analytics across Indian cities
Research internship on interpretability in imbalanced learning
B.Tech in Computer Science and Engineering
Label-free framework enabling small VLMs like SmolVLM-500M to learn from large VLMs like Qwen2VL-7B efficiently.
Train unified vision-language model for attribute classification and captioning.
Novel debiasing approach generating bias-conflicting samples without explicit annotations.