Mohaiminul
Mohaiminul
Home
News
Experience
Publications
Contact
CV
Light
Dark
Automatic
Publications
Type
Conference paper
Preprint
Report
Date
2024
2023
2022
Md Mohaiminul Islam
,
Ngan Ho
,
Xitong Yang
,
Tushar Nagarajan
,
Lorenzo Torresani
,
Gedas Bertasius
(2024).
Video ReCap: Recursive Captioning of Hour-Long Videos
. In
CVPR 2024
.
Cite
ArXiv
Website
Code
Dataset
HuggingFace
Kristen Grauman
,
Md Mohaiminul Islam
,
et al
(2024).
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
. In
CVPR 2024
.
Cite
ArXiv
Website
Blog
Video
Ce Zhang
,
Taixi Lu
,
Md Mohaiminul Islam
,
Ziyang Wang
,
Shoubin Yu
,
Mohit Bansal
,
Gedas Bertasius
(2024).
A Simple LLM Framework for Long-Range Video Question-Answering
. In
ArXiv 2024
.
Cite
ArXiv
Code
Tanveer Hannan
,
Md Mohaiminul Islam
,
Thomas Seidl
,
Gedas Bertasius
(2024).
RGNet: A Unified Clip Retrieval and Grounding Network for Long Videos
. In
ArXiv 2024
.
Cite
ArXiv
Code
Md Mohaiminul Islam
,
Mahmudul Hasan
,
Kishan Shamsundar Athrey
,
Tony Braskich
,
Gedas Bertasius
(2023).
Efficient Movie Scene Detection using State-Space Transformers
. In
CVPR 2023
.
Cite
ArXiv
Code
Md Mohaiminul Islam
,
Gedas Bertasius
(2022).
Long Movie Clip Classification with State-Space Video Models
. In
ECCV 2022
.
Cite
ArXiv
Code
Md Mohaiminul Islam
,
Gedas Bertasius
(2022).
Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism
. In
Ego4D Workshop, CVPR 2022
.
Cite
ArXiv
Code
Md Mohaiminul Islam
,
Tanveer Hannan
,
Laboni Sarker
,
Zakaria Ahmed
(2022).
COVID-DenseNet: A Deep Learning Architecture to Detect COVID-19 from Chest Radiology Images
. In
ICDSA 2022
.
Cite
ArXiv
Code
Cite
×