Monash University Research Group

Vision & Learning for
Autonomous AI

Advancing the theoretical foundations and practical applications of computer vision and machine learning for embodied agents.

About the lab

Perceiving, Predicting,
& Interacting.

Our research addresses the theoretical foundations and practical applications of computer vision and machine learning for an embodied AI to perceive, predict and interact with the dynamic environment around it.

Our interest lies in discovering and proposing the fundamental principles, algorithms and practical implementations for solving high-level visual perception such as:

  • Object and scene understanding and reconstruction.
  • Prediction and reasoning about human motion, activity and behaviour in the presence of physical and social interactions.

Our overarching aim is to develop an end-to-end perception system for an embodied agent to learn, perceive and act simultaneously through interaction with the dynamic world.

Lab Research

Latest Project

JRMOT: Robot Visual Perception

Impact

Seminal Works

TPAMI 2023

Unifying Flow, Stereo and Depth Estimation

CVPR 2022

GMFlow: Learning Optical Flow via Global Matching

TPAMI 2021

JRDB: A Dataset and Benchmark of Egocentric Robot Visual Perception

Top 7 Highly Cited

Generalized Intersection over Union (GIoU): A Metric and A Loss

CVPR 2019

View all publications

Updates

Latest News

Explore Archive
News Image
Feb 2026

Our work MATA published in ICLR 2026

Congratulations to Zhixi and the team for this outstanding achievement.

News Image
Jan 2026

Aeroseg++ published in TOMM 2026

Congratulations to Saikat for his work on scale-aware segmentation.

News Image
Nov 2025

Work MGIoU published in AAAI 2025

Tho Le's work on generalized intersection over union for parametric shapes.