Huy Anh Nguyen
Education
- Member of Computer Vision Lab
- CGPA: 3.80/4.0
- Thesis: Hand-held Object Segmentation and Tracking
- Advisor (M.Sc. and Ph.D.): Associate Prof. Minh Hoai Nguyen
- CGPA: 3.24/4.0 | Ranking 9/47 (Department)
- Thesis: New classes of operators and De Morgan triples in Picture Fuzzy Set
- Advisor: Prof. Dr. Sc. Bui Cong Cuong
Research Interests
| Computer Vision | Video Object/Instance Segmentation (VOS/VIS), Hand-Object Detection, Open Vocabulary Segmentation, Extended Reality (AR/VR/MR) |
| Machine Learning | Representation Learning, Self-Supervised Learning |
Research Experiences
- Hand-held Object Identification, Segmentation, and Tracking: Developed algorithms and datasets for identifying objects held by hands, segmenting, and tracking them across videos of varying lengths.
- Developed Unity and ROS plugins for data streaming from Hololens 2, with a rospy client for data retrieval, achieving 12 fps for RGB and 5 fps for Depth streams with 200-400ms latency. The system feeds data into action, hand, and active object recognition modules for interactive applications.
- Developed hand-object segmentation, tracking and distance estimation models using synchronized RGB and Depth frames.
- Developed a multi-device synchronized data collection system integrating a webcam, screen recording, and eye-gaze video from a GP3 tracker, achieving sub-120ms latency across all streams for accurate gaze-action alignment in cognitive science research.
- Read and experimented hand-detection techniques on egocentric video datasets.
- Conducted literature review and proved De Morgan triples from supervisor's papers, leading to the construction of new classes of picture fuzzy negation and triples.
- Presented findings at National Mathematical Conference (2018).
Publication & Presentation
[1] Supreeth Narasimhaswamy, Huy Anh Nguyen, Lihan Huang, Minh Hoai, "HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild", IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[2] Nguyen Thanh Binh, Huy Anh Nguyen, Pham Ngoc Linh, Nguyen Linh Giang, Tran Ngoc Thang, "Attentive RNN for HS Code Hierarchy Classification on Vietnamese Goods Declaration", International Conference on Intelligent System and Network (ICISN), 2021 (Springer Paper).
[3] Nguyen Huy Anh, Roan Thi Ngan, Bui Cong Cuong, "Some new classes of Picture Fuzzy negation and De Morgan triples, 9th Vietnam Mathematical Congress", Oral presentation in Discrete mathematics section, Nha Trang, Vietnam, 2018.
Work Experiences
- Researched and developed models to generate captions in Japanese from stock images.
- Developed a model (BertCap with ResNeST backbone) which was later chosen to integrate into production with less than 0.65s per image inference time using ONNX.
- Collaborated with Customs Department experts to form the problem, process over 2TB of data, and create datasets emphasizing high-risk HTS Codes.
- Developed a hierarchical classifier to predict HS Codes with 68.9% accuracy on full codes and 95% on chapter/heading, with its application aiding in fraudulent activity identification, resulting in a publication in ICISN 2021.
- Researched and developed algorithmic models, referred to as 'alphas,' to predict stock market movements, utilizing 25 datasets.
- Gold medal in WorldQuant Challenge (global - 2018)
Teaching Experiences
- CSE353 Machine Learning (Fall 2021)
- CSE519 Data Science Fundamentals (Fall 2023)
Skills
| Languages | Python, SQL, C++, C#, Java |
| Tools/ Frameworks | PyTorch, Numpy, Pandas, TensorFlow/Keras, Scikit-learn, Git, ROS, FreeFEM, Linux, Hadoop/Spark, FastAPI, ONNX, Docker |
