I am a final-year Ph.D. candidate in Artificial Intelligence Engineering at the AImageLab research group under the supervision of Professor Rita Cucchiara.
My research focuses on advancing the field of Artificial Intelligence through the development of Large Vision Language Models, Vision-and-Language Foundation Models, and Retrieval-Augmented Generation. I've published at top conferences like CVPR, ICLR, ACL, and BMVC, and recently joined Amazon Science in Cambridge, UK, as a Research Intern.
I'm thrilled to share that I will be joining Amazon Science in Cambridge, UK, for a 6-month internship as an Applied Research Intern.
I attended The Thirteenth International Conference on Learning Representations 2025 in Singapore.
My work on "Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering" has been accepted at The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2025!
My work on "Causal Graphical Models for Vision-Language Compositional Understanding" has been accepted at The Thirteenth International Conference on Learning Representations 2025!
I attended The international Conference on Pattern Recognition 2024 in Kolkata, India.
I attended The British Machine Vision Conference 2024 in Glasgow, United Kingdom.
I attended The European Conference on Computer Vision 2024 in Milan, Italy.
I participated in The 2024 IEEE-EURASIP Summer School on Signal Processing in Capri, Italy.
My work on "Personalizing Multimodal Large Language Models for Image Captioning: An Experimental Analysis" has been accepted at The European Conference on Computer Vision Workshops 2024!
In Conference on Computer Vision and Pattern Recognition, 2025
In International Conference on Learning Representations 2025
In British Machine Vision Conference 2024
In Conference on Computer Vision and Pattern Recognition Workshops, 2024
In Findings of the Association for Computational Linguistics, 2024
In International Conference on Pattern Recognition, 2024
Under review at a top tier journal
In European Conference on Computer Vision and Pattern Recognition, 2024
In Sensors MDPI, 2023
In IEEE Intelligent Systems, 2024
A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning.
I'm always open to discussing research ideas, potential collaborations, or opportunities to apply AI in innovative ways.
Email Me