Domain-Consistent and Uncertainty-Aware Network for Generalizable Gaze Estimation

Link
Abstract

Unsupervised domain adaptive (UDA) gaze estimation aims to predict gaze directions of unlabeled target face or eye images given a set of annotated source images, which has been widely applied in practical applications. However, existing methods still perform poorly due to two major challenges. 1) There exists large personalized differences and style discrepancies between source and target samples, which leads the learned source model easily collapsing to biased results; 2) Data uncertainties inherent in reference samples will affect the generalization ability of their models. To tackle the above challenges, in this paper, we propose a novel Domain-Consistent and Uncertainty-Aware (DCUA) network for generalizable gaze estimation. Our DCUA network employs a two-phase framework where a primary training sub-network (PTNet) and a refined adaptation sub-network (RANet) are trained on the source and target domain, respectively. Firstly, to obtain robust and pure gaze-related features, we propose twain domain consistent constraints, that is, the intra-domain consistent constraint and the inter-domain consistent constraint. These two constraints could eliminate the impact of gaze-irrelevant factors by maintaining consistency between label and feature space. Secondly, to further improve the adaptability of our model, we propose dual uncertainty perception modules, which include an intrinsic uncertainty module and an extrinsic uncertainty module. These modules help DCUA network distinguish inferior reference samples and avoid overfitting to them. Experiments on four cross-domain gaze estimation tasks demonstrate the effectiveness of our method.

Synth

Problem:: UDA Gaze Estimation의 Domain 간 개인/스타일 차이로 인한 성능 저하 / 데이터(Source Noise/Target Pseudo Label) 불확실성으로 일반화 한계

Solution:: Twain Domain Consistent Constraints (Intra-Domain / Inter-Domain)로 Gaze 무관 요소 제거 / Dual Uncertainty Perception Modules (Intrinsic / Extrinsic)로 데이터 불확실성 관리

Novelty:: UDA Gaze Estimation에 Uncertainty Learning 최초 도입 및 Source/Pseudo Label 불확실성 동시 정량화 및 완화

Note:: Uncertainty에 관한 부분이 흥미로웠지만 이걸 정량화 및 완화 하는 방식이 좀 애매한 듯

Summary

Motivation

file-20250509090927460.png|625

Method

file-20250509091013501.png

Method 검증