/HYHO /HYHO 7RWDO 9*31>@         (&5$3>@         0,(/ RXU         +XPDQ         +XPDQ ZR,QWHUDFWLRQ          x 1.2 x 0.54 • 0,(/ VKRZHGWLPHVKLJKHUSHUIRUPDQFHWKDQ(&5$3DERXW65 7RS • ,Q/HYHOTXHULHV0,(/FRXOGVXSSOHPHQWLQIRUPDWLRQODFNLQJLQODQJXDJHTXHULHVXVLQJ+5, • &RPSDUHGWR+XPDQDQG+XPDQ ZR,QWHUDFWLRQ 0,(/DFKLHYHGDSSUR[LPDWHO\65 7RS  [11] J. Hu et al “VGPN: Voice-Guided Pointing Robot Navigation for Humans,” IEEE ROBIO, pp.1107–1112, 2018. [6] A. Oyama et al. ECRAP: Exophora Resolution and Classifying User Commands for Robot Action Planning by Large Language Models. IEEE IRC, pp.1–8, 2024. R   = 1 7ULDOV 6XFFHVVRU)DOVH RU 5HVXOWV&DVHRIXVHULVYLVLEOHIURPWKHURERW VSRVLWLRQ