I'm a Year 3 PhD student at Nanyang Technological University (School of Electrical & Electronic Engineering) and a CIS Scholar at A*STAR's Institute for Infocomm Research (IยฒR). My advisor is Prof. Xudong Jiang.
My research focuses on efficient video understanding using state space models (Mamba), achieving significant spatiotemporal compression while maintaining segmentation accuracy. I'm particularly interested in making large foundation models computationally tractable for real-world deployment.
Originally from Tamil Nadu, India, I graduated with First-Class Honours while working full-time as a Research Assistant, where I led R&D projects that secured >$200K in government funding and supervised 40+ students.
School of Electrical and Electronic Engineering
Topic: Effective and Label-Efficient Visual Perception with Large Foundation Models
Advisor: Prof. Xudong Jiang
Institute for Infocomm Research (IยฒR), Fusionopolis, Singapore
Computing & Intelligence Systems Programme
Electrical and Electronic Engineering
FYP: Deep Features based Real-Time SLAM
Concurrent with Bachelor's studies (full-time work + part-time degree)
Final Year Project: Improving Recognition Performance for Low-Resolution Images Using DBPN
GPA: >3.9/4.0 Director's Roll of Honor 4ร Module Prize 15/19 DistinctionsIEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025 CVPR 2025 22.1% Acc. Rate
Efficient video segmentation using shared temporal state spaces in Mamba architecture for reduced computational cost.
IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2021 ICME Workshop
Pixel-to-prototype mapping framework for improving transfer learning in few-shot classification tasks.
Machine Intelligence Research, 2025 Journal
Benchmark and analysis of Segment Anything Model 2 for video-level semantic segmentation tasks.
Journal of Automation and Intelligence, 2023 Journal
Few-shot learning approach for tire pattern identification in forensic investigations with limited training samples.
CVPR 2026 Conference Submission, December 2025 Under Review
Efficient state-space compression framework that achieves 85% token reduction while maintaining competitive accuracy with 1.8ร speedup in both training and inference.
arXiv:2506.13552, 2025 Under Review
Holistic review of video scene parsing covering semantic, instance, panoptic segmentation and open-vocabulary methods.
IEEE 19th Conference on Industrial Electronics and Applications (ICIEA), 2024 ICIEA 2024
Visual SLAM system for real-time tracking and monitoring of underwater objects in challenging aquatic environments.
IEEE 18th Conference on Industrial Electronics and Applications (ICIEA), 2023 ICIEA 2023
Efficient mobile-optimized deep learning model for automated tire pattern recognition and classification.
IEEE 17th Conference on Industrial Electronics and Applications (ICIEA), 2022 ICIEA 2022
Integrated framework combining visual SLAM with real-time object detection for versatile robotic applications.
IEEE 17th Conference on Industrial Electronics and Applications (ICIEA), 2022 ICIEA 2022
YOLO-based AI system for automated thermal screening and facial detection in public health monitoring.
IEEE 17th Conference on Industrial Electronics and Applications (ICIEA), 2022 ICIEA 2022
Visual SLAM-based borescope navigation system for 3D reconstruction and inspection of internal aircraft structures.
IEEE 16th Conference on Industrial Electronics and Applications (ICIEA), 2021 ICIEA 2021
Deep Back-Projection Network for super-resolution enhancement of low-resolution images to improve recognition accuracy.
*denotes corresponding/equal contribution
โ View all publications on Google Scholar
Singapore
A*STAR IยฒR, Fusionopolis
_________________________
Rapid-Rich Object Search Lab (ROSE) | NTU Singapore