I'm on the job market, looking for industry Research Scientist position! Feel free to connect with me via email!
I am a final-year Ph.D. student in the Department of Computer Science at UNC Chapel Hill, advised by Professor Gedas Bertasius. My research focuses on computer vision, video understanding, and multimodal deep learning, with a particular emphasis on efficient vision-language models, multimodal large language models (MLLMs), and long-range video analysis. My work has been published in top-tier conferences, including ECCV 2022, CVPR 2023, ECCV 2024, EMNLP 2024, and CVPR 2025.
I have completed two research internships at FAIR, Meta AI and one at Comcast AI, where I worked on multimodal large language models, video agents, and efficient models for long-range video understanding. Prior to my Ph.D., I gained valuable industry experience as a Software Engineer at Samsung R&D Institute.
Download my resumé.
PhD in Computer Science, 2021-Present
UNC Chapel Hill
MSc in Computer Science, 2021-2023
UNC Chapel Hill
BSc in Computer Science and Engineering, 2014-2018
Bangladesh University of Engineering and Technology