Education
-
2022.03 - Present |
Geelong, Australia |
Applied Artificial Intelligence Institute, Deakin University
Video Understanding
- Action Recognition in Videos
- Instance Segmentation
- Human-Centric Video Analysis
-
2015.08 - 2019.06 |
Hanoi, Vietnam |
University of Engineering and Technology, Vietnam National University
Signal Processing
- Digital Signal Processing
- Image Processing
- Pattern Recognition
-
2012.08 - 2015.06 |
Hanoi, Vietnam |
Hanoi - Amsterdam High School for The Gifted
Specialized in Mathematics
Awards
-
2024.08.15
6th Large-scale Video Object Segmentation Challenge (LSVOS), ECCV2024
LSVOS is the prestigious competition held regularly in conjunction with top-tier computer vision conferences. Focused on advancing the state of the art in video object segmentation, the challenge encourages innovative solutions to address the problem in large scale dataset. Previous challenge results can be found here.
-
2023.06.18
OmniLabel Challenge, CVPR2023
The OmniLabel Challenge aim to detect object in image using a free-form query input. The total prize in this competition is $10,000 USD, with $3,300 USD awarded to the second-place winner.
-
2015.2019
University of Engineering and Technology, Vietnam National University
Scholarships are provided for students each semester based on the academic performance, with 100% tuition fee and additional support for living expenses
-
2019
President of Vietnam National University
For achievement as the Valedictorian of the Electronics and Communications Faculty and one of the five most outstanding students among entire 2015-2019 class at the VNU University of Engineering Technology
-
2019
LSI Design Contest in Okinawa
Participated in the 22nd LSI Design Contest in Okinawa, Japan, where our team advanced to the final round and was awarded the Fighting Spirit Prize.
-
2014
Hanoi Department of Education and Training
Work
-
Vin Big Data Institute (Vingroup)
Developed an AI-integrated controller module for autonomous vehicle navigation, as part of Vingroup's Autopilot project.
- Computer Vision
- Deep Learning
-
University of Engineering and Technology, Vietnam National University
Conduct fundamental researches on matrix computation to developed efficient algorithms to segment crack-like objects from the images collected by UAV.
- Computer Vision
- Signal Processing
Publications
-
European Conference on Artificial Intelligence 2025
We propose Planner-Refiner, a novel framework that dynamically refines the alignment between visual and textual modalities in videos. Our approach leverages a two-stage refinement process, enhancing the robustness of vision-language models in video understanding tasks.
-
4th Workshop on What is Next in Multimodal Foundation Models? at ICCV 2025
We propose an agentic AI system for multimodal-guided video object segmentation
-
Instance-Level Recognition Workshop at ECCV 24
This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
-
Large-scale Video Object Segmentation Workshop ECCV 2024
This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
-
British Machine Vision Conference 2024
We propose in this work a comprehensive multimodal framework for robust video-based human activity recognition. Our key contribution is the introduction of a novel compositional query machine, called COMPUTER (COMPositional hUman–cenTric quERy machine) , a generic neural architecture that models the interactions between a human of interest and its surroundings in both space and time.
-
Large-scale Video Object Segmentation Workshop ECCV 2024
We present our solution for the RVOS track of the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
-
4th International Workshop on Deep Learning for Human Activity Recognition, IJCAI 2024
Languages
| Vietnamese |
| Native speaker |