Model Parity Aligner
Label-free framework enabling small VLMs like SmolVLM-500M to learn from large VLMs like Qwen2VL-7B efficiently.
I am pursuing an MSc in Computing (AI & ML) at Imperial College London, after earning my B.Tech in Computer Science from IIT Jodhpur. My research focuses on generative and multimodal AI, especially vision-language and diffusion models, with work published at leading venues through international collaborations. Previously, I was an AI Research Engineer at MetaFusion, where I built vision-language models for attribute recognition and scene captioning, and deployed unified traffic detection systems now used across Indian cities. I aim to advance AI research that bridges theory with real-world impact.
MSc in Computing (AI & ML)
Deployed and trained VLMs for traffic analytics across Indian cities
Research internship on interpretability in imbalanced learning
B.Tech in Computer Science and Engineering
Label-free framework enabling small VLMs like SmolVLM-500M to learn from large VLMs like Qwen2VL-7B efficiently.
Train unified vision-language model for attribute classification and captioning.
Novel debiasing approach generating bias-conflicting samples without explicit annotations.