Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions

Bian, Tongfei; Ma, Yiming; Chollet, Mathieu; Sanchez, Victor; Guha, Tanaya

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.16698 (cs)

[Submitted on 21 Dec 2024]

Title:Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions

Authors:Tongfei Bian, Yiming Ma, Mathieu Chollet, Victor Sanchez, Tanaya Guha

View PDF HTML (experimental)

Abstract:For efficient human-agent interaction, an agent should proactively recognize their target user and prepare for upcoming interactions. We formulate this challenging problem as the novel task of jointly forecasting a person's intent to interact with the agent, their attitude towards the agent and the action they will perform, from the agent's (egocentric) perspective. So we propose \emph{SocialEgoNet} - a graph-based spatiotemporal framework that exploits task dependencies through a hierarchical multitask learning approach. SocialEgoNet uses whole-body skeletons (keypoints from face, hands and body) extracted from only 1 second of video input for high inference speed. For evaluation, we augment an existing egocentric human-agent interaction dataset with new class labels and bounding box annotations. Extensive experiments on this augmented dataset, named JPL-Social, demonstrate \emph{real-time} inference and superior performance (average accuracy across all tasks: 83.15\%) of our model outperforming several competitive baselines. The additional annotations and code will be available upon acceptance.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2412.16698 [cs.CV]
	(or arXiv:2412.16698v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.16698

Submission history

From: Tongfei Bian [view email]
[v1] Sat, 21 Dec 2024 16:54:28 UTC (5,510 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators