Discuss before moving: Visual language navigation via multi-expert discussions Y Long, X Li, W Cai, H Dong 2024 IEEE International Conference on Robotics and Automation (ICRA), 17380 …, 2024 | 22 | 2024 |
Bridging zero-shot object navigation and foundation models through pixel-guided navigation skill W Cai, S Huang, G Cheng, Y Long, P Gao, C Sun, H Dong 2024 IEEE International Conference on Robotics and Automation (ICRA), 5228-5234, 2024 | 21 | 2024 |
Manipllm: Embodied multimodal large language model for object-centric robotic manipulation X Li, M Zhang, Y Geng, H Geng, Y Long, Y Shen, R Zhang, J Liu, H Dong Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 21 | 2024 |
InstructNav: Zero-shot System for Generic Instruction Navigation in Unexplored Environment Y Long, W Cai, H Wang, G Zhan, H Dong arXiv preprint arXiv:2406.04882, 2024 | 6 | 2024 |
Spring: Situated conversation agent pretrained with multimodal questions from incremental layout graph Y Long, B Hui, F Ye, Y Li, Z Han, C Yuan, Y Li, X Wang Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 13309 …, 2023 | 6 | 2023 |
Improving Situated Conversational Agents with Step-by-Step Multi-modal Logic Reasoning Y Long, H Zhang, B Hui, Z Yang, C Yuan, X Wang, F Huang, Y Li Proceedings of The Eleventh Dialog System Technology Challenge, 15-24, 2023 | 4 | 2023 |
Multimodal recommendation dialog with subjective preference: A new challenge and benchmark Y Long, B Hui, C Yuan, F Huang, Y Li, X Wang arXiv preprint arXiv:2305.18212, 2023 | 3 | 2023 |
Agricultural internet of things system based on cloud computing and machine learning Y Long 2019 12th International Conference on Intelligent Computation Technology and …, 2019 | 3 | 2019 |
Whether you can locate or not? Interactive Referring Expression Generation F Ye, Y Long, F Feng, X Wang Proceedings of the 31st ACM International Conference on Multimedia, 4697-4706, 2023 | 2 | 2023 |
LLM-Driven “Coach-Athlete” Pretraining Framework for Complex Text-To-Motion Generation J Fu, Y Long, X Wang, J Yin 2024 International Joint Conference on Neural Networks (IJCNN), 1-7, 2024 | | 2024 |
VDialogUE: A Unified Evaluation Benchmark for Visually-grounded Dialogue Y Li, B Hui, Z Yin, W He, R Luo, Y Long, M Yang, F Huang, Y Li arXiv preprint arXiv:2309.07387, 2023 | | 2023 |