CV | Tuyen Tran

Basics

Name	Tuyen Tran
Position	PhD Student
Email	t.tran@deakin.edu.au
Url	https://tranxuantuyen.github.io/

Education

2022.03 - Present

Geelong, Australia
PhD Student

Applied Artificial Intelligence Institute, Deakin University

Video Understanding
- Action Recognition in Videos
- Instance Segmentation
- Human-Centric Video Analysis
2015.08 - 2019.06

Hanoi, Vietnam
Bachelor of Electronics and Communications

University of Engineering and Technology, Vietnam National University

Signal Processing
- Digital Signal Processing
- Image Processing
- Pattern Recognition
2012.08 - 2015.06

Hanoi, Vietnam
Hanoi - Amsterdam High School for The Gifted

Specialized in Mathematics

Awards

2024.08.15

Second Place Award in LSVOS Challenge

6th Large-scale Video Object Segmentation Challenge (LSVOS), ECCV2024

LSVOS is the prestigious competition held regularly in conjunction with top-tier computer vision conferences. Focused on advancing the state of the art in video object segmentation, the challenge encourages innovative solutions to address the problem in large scale dataset. Previous challenge results can be found here.
2023.06.18

Second Place Award in the OmniLabel challenge

OmniLabel Challenge, CVPR2023

The OmniLabel Challenge aim to detect object in image using a free-form query input. The total prize in this competition is $10,000 USD, with $3,300 USD awarded to the second-place winner.
2015.2019

Merit based Scholarship for Top 5% Excellent Academic Students

University of Engineering and Technology, Vietnam National University

Scholarships are provided for students each semester based on the academic performance, with 100% tuition fee and additional support for living expenses
2019

Prestigious Certificate of Merit

President of Vietnam National University

For achievement as the Valedictorian of the Electronics and Communications Faculty and one of the five most outstanding students among entire 2015-2019 class at the VNU University of Engineering Technology
2019

Fighting Spirit Award in LSI Design

LSI Design Contest in Okinawa

Participated in the 22nd LSI Design Contest in Okinawa, Japan, where our team advanced to the final round and was awarded the Fighting Spirit Prize.
2014

Second prize in Ha Noi Mathematics Competition

Hanoi Department of Education and Training

Work

2020.10 - 2021.12
AI Engineering

Vin Big Data Institute (Vingroup)

Developed an AI-integrated controller module for autonomous vehicle navigation, as part of Vingroup's Autopilot project.
- Computer Vision
- Deep Learning
2019.06 - 2020.10
Research Assistant

University of Engineering and Technology, Vietnam National University

Conduct fundamental researches on matrix computation to developed efficient algorithms to segment crack-like objects from the images collected by UAV.
- Computer Vision
- Signal Processing

Publications

2025.12.07

Planner-Refiner: Dynamic Space-Time Refinement for Vision-Language Alignment in Videos

European Conference on Artificial Intelligence 2025

We propose Planner-Refiner, a novel framework that dynamically refines the alignment between visual and textual modalities in videos. Our approach leverages a two-stage refinement process, enhancing the robustness of vision-language models in video understanding tasks.
2025.10.07

Towards Agentic AI for Multimodal-Guided Video Object Segmentation

4th Workshop on What is Next in Multimodal Foundation Models? at ICCV 2025

We propose an agentic AI system for multimodal-guided video object segmentation
2024.09.09

Promptable Iterative Visual Refinement for Video Instance Segmentation

Instance-Level Recognition Workshop at ECCV 24

This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
2024.09.09

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

Large-scale Video Object Segmentation Workshop ECCV 2024

This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
2024.09.04

Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity Recognition

British Machine Vision Conference 2024

We propose in this work a comprehensive multimodal framework for robust video-based human activity recognition. Our key contribution is the introduction of a novel compositional query machine, called COMPUTER (COMPositional hUman–cenTric quERy machine) , a generic neural architecture that models the interactions between a human of interest and its surroundings in both space and time.
2024.08.22

The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation

Large-scale Video Object Segmentation Workshop ECCV 2024

We present our solution for the RVOS track of the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.
2024.06.04

Unified Framework with Consistency across Modalities for Human Activity Recognition

4th International Workshop on Deep Learning for Human Activity Recognition, IJCAI 2024

Languages

	Vietnamese
	Native speaker

	English
	Fluent

Basics

Education

Applied Artificial Intelligence Institute, Deakin University

Video Understanding

University of Engineering and Technology, Vietnam National University

Signal Processing

Hanoi - Amsterdam High School for The Gifted

Specialized in Mathematics

Awards

6th Large-scale Video Object Segmentation Challenge (LSVOS), ECCV2024

OmniLabel Challenge, CVPR2023

The OmniLabel Challenge aim to detect object in image using a free-form query input. The total prize in this competition is $10,000 USD, with $3,300 USD awarded to the second-place winner.

University of Engineering and Technology, Vietnam National University

Scholarships are provided for students each semester based on the academic performance, with 100% tuition fee and additional support for living expenses

President of Vietnam National University

For achievement as the Valedictorian of the Electronics and Communications Faculty and one of the five most outstanding students among entire 2015-2019 class at the VNU University of Engineering Technology

LSI Design Contest in Okinawa

Participated in the 22nd LSI Design Contest in Okinawa, Japan, where our team advanced to the final round and was awarded the Fighting Spirit Prize.

Hanoi Department of Education and Training

Work

Vin Big Data Institute (Vingroup)

Developed an AI-integrated controller module for autonomous vehicle navigation, as part of Vingroup's Autopilot project.

University of Engineering and Technology, Vietnam National University

Conduct fundamental researches on matrix computation to developed efficient algorithms to segment crack-like objects from the images collected by UAV.

Publications

European Conference on Artificial Intelligence 2025

We propose Planner-Refiner, a novel framework that dynamically refines the alignment between visual and textual modalities in videos. Our approach leverages a two-stage refinement process, enhancing the robustness of vision-language models in video understanding tasks.

4th Workshop on What is Next in Multimodal Foundation Models? at ICCV 2025

We propose an agentic AI system for multimodal-guided video object segmentation

Instance-Level Recognition Workshop at ECCV 24

This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.

Large-scale Video Object Segmentation Workshop ECCV 2024

This report introduces the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.

British Machine Vision Conference 2024

Large-scale Video Object Segmentation Workshop ECCV 2024

We present our solution for the RVOS track of the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction with ECCV 2024 workshop.

4th International Workshop on Deep Learning for Human Activity Recognition, IJCAI 2024

Languages