Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Yang, Zhenkui; Huang, Zeyi; Wang, Ge; Ding, Han; Han, Tony Xiao; Wang, Fei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.14621 (cs)

[Submitted on 20 Apr 2025 (v1), last revised 22 Apr 2025 (this version, v2)]

Title:Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Authors:Zhenkui Yang, Zeyi Huang, Ge Wang, Han Ding, Tony Xiao Han, Fei Wang

View PDF HTML (experimental)

Abstract:Wireless signal-based human sensing technologies, such as WiFi, millimeter-wave (mmWave) radar, and Radio Frequency Identification (RFID), enable the detection and interpretation of human presence, posture, and activities, thereby providing critical support for applications in public security, healthcare, and smart environments. These technologies exhibit notable advantages due to their non-contact operation and environmental adaptability; however, existing systems often fail to leverage the textual information inherent in datasets. To address this, we propose an innovative text-enhanced wireless sensing framework, WiTalk, that seamlessly integrates semantic knowledge through three hierarchical prompt strategies-label-only, brief description, and detailed action description-without requiring architectural modifications or incurring additional data costs. We rigorously validate this framework across three public benchmark datasets: XRF55 for human action recognition (HAR), and WiFiTAL and XRFV2 for WiFi temporal action localization (TAL). Experimental results demonstrate significant performance improvements: on XRF55, accuracy for WiFi, RFID, and mmWave increases by 3.9%, 2.59%, and 0.46%, respectively; on WiFiTAL, the average performance of WiFiTAD improves by 4.98%; and on XRFV2, the mean average precision gains across various methods range from 4.02% to 13.68%. Our codes have been included in this https URL.

Comments:	10 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.14621 [cs.CV]
	(or arXiv:2504.14621v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.14621

Submission history

From: Zhenkui Yang [view email]
[v1] Sun, 20 Apr 2025 13:58:35 UTC (1,776 KB)
[v2] Tue, 22 Apr 2025 14:48:39 UTC (1,775 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators