Learn to navigate: cooperative path planning for unmanned surface vehicles using deep reinforcement learning X Zhou, P Wu, H Zhang, W Guo, Y Liu Ieee Access 7, 165262-165278, 2019 | 141 | 2019 |
Improving knowledge tracing via pre-training question embeddings Y Liu, Y Yang, X Chen, J Shen, H Zhang, Y Yu arXiv preprint arXiv:2012.05031, 2020 | 134 | 2020 |
Bi-level actor-critic for multi-agent coordination H Zhang, W Chen, Z Huang, M Li, Y Yang, W Zhang, J Wang Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7325-7332, 2020 | 100 | 2020 |
Offline pre-trained multi-agent decision transformer L Meng, M Wen, C Le, X Li, D Xing, W Zhang, Y Wen, H Zhang, J Wang, ... Machine Intelligence Research 20 (2), 233-248, 2023 | 73 | 2023 |
Learning correlated communication topology in multi-agent reinforcement learning Y Du, B Liu, V Moens, Z Liu, Z Ren, J Wang, X Chen, H Zhang Proceedings of the 20th International Conference on Autonomous Agents and …, 2021 | 69 | 2021 |
Settling the variance of multi-agent policy gradients JG Kuba, M Wen, L Meng, H Zhang, D Mguni, J Wang, Y Yang Advances in Neural Information Processing Systems 34, 13458-13470, 2021 | 62 | 2021 |
User response learning for directly optimizing campaign performance in display advertising K Ren, W Zhang, Y Rong, H Zhang, Y Yu, J Wang Proceedings of the 25th acm international on conference on information and …, 2016 | 50 | 2016 |
GCS: Graph-based coordination strategy for multi-agent reinforcement learning J Ruan, Y Du, X Xiong, D Xing, X Li, L Meng, H Zhang, J Wang, B Xu arXiv preprint arXiv:2201.06257, 2022 | 41 | 2022 |
Token-level Direct Preference Optimization Y Zeng, G Liu, W Ma, N Yang, H Zhang, J Wang arXiv preprint arXiv:2404.11999, 2024 | 32 | 2024 |
Large language models play starcraft ii: Benchmarks and a chain of summarization approach W Ma, Q Mi, Y Zeng, X Yan, Y Wu, R Lin, H Zhang, J Wang arXiv preprint arXiv:2312.11865, 2023 | 30 | 2023 |
Large sequence models for sequential decision-making: a survey M Wen, R Lin, H Wang, Y Yang, Y Wen, L Mai, J Wang, H Zhang, ... Frontiers of Computer Science 17 (6), 176349, 2023 | 30 | 2023 |
Offline pre-trained multi-agent decision transformer: One big sequence model tackles all smac tasks L Meng, M Wen, Y Yang, C Le, X Li, W Zhang, Y Wen, H Zhang, J Wang, ... arXiv preprint arXiv:2112.02845, 2021 | 29 | 2021 |
Botzone: an online multi-agent competitive platform for ai education H Zhou, H Zhang, Y Zhou, X Wang, W Li Proceedings of the 23rd Annual ACM Conference on Innovation and Technology …, 2018 | 28 | 2018 |
A review: machine learning for combinatorial optimization problems in energy areas X Yang, Z Wang, H Zhang, N Ma, N Yang, H Liu, H Zhang, L Yang Algorithms 15 (6), 205, 2022 | 27 | 2022 |
Layout design for intelligent warehouse by evolution with fitness approximation H Zhang, Z Guo, W Zhang, H Cai, C Wang, Y Yu, W Li, J Wang IEEE Access 7, 166310-166317, 2019 | 22 | 2019 |
Learning to design games: Strategic environments in reinforcement learning H Zhang, J Wang, Z Zhou, W Zhang, Y Wen, Y Yu, W Li Proceedings of the 27th international joint conference on Artificial …, 2017 | 20 | 2017 |
A game-theoretic approach for improving generalization ability of TSP solvers C Wang, Y Yang, O Slumbers, C Han, T Guo, H Zhang, J Wang arXiv preprint arXiv:2110.15105, 2021 | 15 | 2021 |
Managing risk of bidding in display advertising H Zhang, W Zhang, Y Rong, K Ren, W Li, J Wang Proceedings of the Tenth ACM International Conference on Web Search and Data …, 2017 | 15 | 2017 |
Estimating -Rank from A Few Entries with Low Rank Matrix Completion Y Du, X Yan, X Chen, J Wang, H Zhang International Conference on Machine Learning, 2870-2879, 2021 | 12 | 2021 |
Contextual transformer for offline meta reinforcement learning R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang arXiv preprint arXiv:2211.08016, 2022 | 10 | 2022 |