Look, listen, and act: Towards audio-visual embodied navigation C Gan, Y Zhang, J Wu, B Gong, JB Tenenbaum 2020 IEEE International Conference on Robotics and Automation (ICRA), 9701-9707, 2020 | 143 | 2020 |
Watch, Reason and Code: Learning to Represent Videos Using Program X Duan, Q Wu, C Gan, Y Zhang, W Huang, A van den Hengel, W Zhu Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019 | 6 | 2019 |