Md Mohaiminul Islam

Md Mohaiminul Islam

PhD Student UNC Chapel Hill Research Scientist Intern Meta AI

Biography

I am a fourth-year PhD student in the Department of Computer Science at UNC Chapel Hill, where I have the privilege of working with Professor Gedas Bertasius. My primary research focuses on computer vision, video understanding, and multi-modal deep learning.

During the summer of 2023, I had the incredible opportunity to join FAIR, Meta AI as a Research Scientist Intern. This followed my previous experience as a Machine Learning Intern at Comcast AI during Summer 2022. Before starting my PhD program at UNC, I gained valuable industry experience as a Software Engineer at Samsung R&D Institute. Additionally, I had the honor of serving as a lecturer at the Computer Science Department of the University of Asia Pacific.

Download my resumé.

Interests
  • Computer Vision
  • Video Understanding
  • Multi-modal deep learning
  • Machine Learning
  • Natural Language Processing
Education
  • PhD in Computer Science, 2021-Present

    UNC Chapel Hill

  • MSc in Computer Science, 2021-2023

    UNC Chapel Hill

  • BSc in Computer Science and Engineering, 2014-2018

    Bangladesh University of Engineering and Technology

Recent News

Experience

 
 
 
 
 
FAIR Accel, Meta AI
Research Scientist Intern
May 2023 – Aug 2023 Menlo Park, California
 
 
 
 
 
Comcast AI
Machine Learning Intern
May 2022 – Aug 2022 Virtual
 
 
 
 
 
Lecturer
Apr 2019 – Dec 2020 Bangladesh
 
 
 
 
 
Software Engineer
Nov 2018 – Mar 2019 Bangladesh

Recent Publications

Quickly discover relevant content by filtering publications.
(2024). Video ReCap: Recursive Captioning of Hour-Long Videos. In CVPR 2024.

Cite ArXiv Website Code Dataset HuggingFace

(2024). A Simple LLM Framework for Long-Range Video Question-Answering. In ArXiv 2024.

Cite ArXiv Code

(2024). RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos. In ArXiv 2024.

Cite ArXiv Code

(2023). Efficient Movie Scene Detection using State-Space Transformers. In CVPR 2023.

Cite ArXiv Code

(2022). Long Movie Clip Classification with State-Space Video Models. In ECCV 2022.

Cite ArXiv Code

(2022). COVID-DenseNet: A Deep Learning Architecture to Detect COVID-19 from Chest Radiology Images. In ICDSA 2022.

Cite ArXiv Code

Contact

  • mmiemon@cs.unc.edu
  • Raleigh, North Carolina