default search action
Shiyu Huang 0001
Person information
- unicode name: 黄世宇
- affiliation: Zhipu AI, Beijing, China
- affiliation (2022 - 2024): 4Paradigm Inc., Beijing, China
- affiliation (PhD 2022): Tsinghua University, Department of Computer Science and Technology, China
Other persons with the same name
- Shiyu Huang 0002 — University of Science and Technology of China, CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System, Hefei, China
- Shiyu Huang 0003 — Beijing University of Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, China
- Shiyu Huang 0004 — Southeast University, School of Civil Engineering, Nanjing, China
- Shiyu Huang 0005 — Dalian Polytechnic University, School of Information Science and Engineering, China
- Shiyu Huang 0006 — Fujian Normal University, College of Computer and Cyber Security, Fuzhou, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Wentse Chen, Shiyu Huang, Yuan Chiang, Tim Pearce, Wei-Wei Tu, Ting Chen, Jun Zhu:
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization. AAAI 2024: 11390-11398 - [c11]Junzhe Chen, Xuming Hu, Shuodi Liu, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Lijie Wen:
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments. ACL (1) 2024: 13055-13077 - [i18]Yiwen Sun, Furong Ye, Xianyin Zhang, Shiyu Huang, Bingzhen Zhang, Ke Wei, Shaowei Cai:
AutoSAT: Automatically Optimize SAT Solvers via Large Language Models. CoRR abs/2402.10705 (2024) - [i17]Junzhe Chen, Xuming Hu, Shuodi Liu, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Lijie Wen:
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments. CoRR abs/2402.16499 (2024) - [i16]Ziyan Xiong, Bo Chen, Shiyu Huang, Wei-Wei Tu, Zhaofeng He, Yang Gao:
MQE: Unleashing the Power of Interaction with Multi-agent Quadruped Environment. CoRR abs/2403.16015 (2024) - [i15]Weihan Wang, Zehai He, Wenyi Hong, Yean Cheng, Xiaohan Zhang, Ji Qi, Shiyu Huang, Bin Xu, Yuxiao Dong, Ming Ding, Jie Tang:
LVBench: An Extreme Long Video Understanding Benchmark. CoRR abs/2406.08035 (2024) - [i14]Wentse Chen, Shiyu Huang, Jeff Schneider:
Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization. CoRR abs/2406.13930 (2024) - [i13]Ruize Zhang, Zelai Xu, Chengdong Ma, Chao Yu, Wei-Wei Tu, Shiyu Huang, Deheng Ye, Wenbo Ding, Yaodong Yang, Yu Wang:
A Survey on Self-play Methods in Reinforcement Learning. CoRR abs/2408.01072 (2024) - [i12]Zhuoyi Yang, Jiayan Teng, Wendi Zheng, Ming Ding, Shiyu Huang, Jiazheng Xu, Yuanming Yang, Wenyi Hong, Xiaohan Zhang, Guanyu Feng, Da Yin, Xiaotao Gu, Yuxuan Zhang, Weihan Wang, Yean Cheng, Ting Liu, Bin Xu, Yuxiao Dong, Jie Tang:
CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer. CoRR abs/2408.06072 (2024) - [i11]Wenyi Hong, Weihan Wang, Ming Ding, Wenmeng Yu, Qingsong Lv, Yan Wang, Yean Cheng, Shiyu Huang, Junhui Ji, Zhao Xue, Lei Zhao, Zhuoyi Yang, Xiaotao Gu, Xiaohan Zhang, Guanyu Feng, Da Yin, Zihan Wang, Ji Qi, Xixuan Song, Peng Zhang, Debing Liu, Bin Xu, Juanzi Li, Yuxiao Dong, Jie Tang:
CogVLM2: Visual Language Models for Image and Video Understanding. CoRR abs/2408.16500 (2024) - 2023
- [j2]Yudeng Lin, Qingtian Zhang, Bin Gao, Jianshi Tang, Peng Yao, Chongxuan Li, Shiyu Huang, Zhengwu Liu, Ying Zhou, Yuyi Liu, Wenqiang Zhang, Jun Zhu, He Qian, Huaqiang Wu:
Uncertainty quantification via a memristor Bayesian deep neural network for risk-sensitive reinforcement learning. Nat. Mac. Intell. 5(7): 714-723 (2023) - [c10]Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu:
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play. AAMAS 2023: 67-76 - [c9]Xinyi Yang, Shiyu Huang, Yiwen Sun, Yuxiang Yang, Chao Yu, Wei-Wei Tu, Huazhong Yang, Yu Wang:
Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation. AAMAS 2023: 1652-1660 - [c8]Wenze Chen, Shiyu Huang, Yuan Chiang, Ting Chen, Jun Zhu:
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization. AAMAS 2023: 2634-2636 - [c7]Bill Yuchen Lin, Yicheng Fu, Karina Yang, Faeze Brahman, Shiyu Huang, Chandra Bhagavatula, Prithviraj Ammanabrolu, Yejin Choi, Xiang Ren:
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks. NeurIPS 2023 - [i10]Xinyi Yang, Shiyu Huang, Yiwen Sun, Yuxiang Yang, Chao Yu, Wei-Wei Tu, Huazhong Yang, Yu Wang:
Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation. CoRR abs/2302.04094 (2023) - [i9]Fanqi Lin, Shiyu Huang, Tim Pearce, Wenze Chen, Wei-Wei Tu:
TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play. CoRR abs/2302.07515 (2023) - [i8]Bill Yuchen Lin, Yicheng Fu, Karina Yang, Prithviraj Ammanabrolu, Faeze Brahman, Shiyu Huang, Chandra Bhagavatula, Yejin Choi, Xiang Ren:
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks. CoRR abs/2305.17390 (2023) - [i7]Fanqi Lin, Shiyu Huang, Weiwei Tu:
Diverse Policies Converge in Reward-free Markov Decision Processe. CoRR abs/2308.11924 (2023) - [i6]Haixu Song, Shiyu Huang, Yinpeng Dong, Wei-Wei Tu:
Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models. CoRR abs/2309.02218 (2023) - [i5]Shiyu Huang, Wentse Chen, Yiwen Sun, Fuqing Bie, Wei-Wei Tu:
OpenRL: A Unified Reinforcement Learning Framework. CoRR abs/2312.16189 (2023) - 2022
- [j1]Dong Yan, Jiayi Weng, Shiyu Huang, Chongxuan Li, Yichi Zhou, Hang Su, Jun Zhu:
Deep reinforcement learning with credit assignment for combinatorial optimization. Pattern Recognit. 124: 108466 (2022) - [c6]Shiyu Huang, Chao Yu, Bin Wang, Dong Li, Yu Wang, Ting Chen, Jun Zhu:
VMAPD: Generate Diverse Solutions for Multi-Agent Games with Recurrent Trajectory Discriminators. CoG 2022: 9-16 - [c5]Fanqi Lin, Shiyu Huang, Wei-Wei Tu:
Diverse Policies Converge in Reward-Free Markov Decision Processes. PRICAI (1) 2022: 125-136 - [i4]Wenze Chen, Shiyu Huang, Yuan Chiang, Ting Chen, Jun Zhu:
DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization. CoRR abs/2207.05631 (2022) - 2021
- [c4]Shiyu Huang, Bin Wang, Hang Su, Dong Li, Jianye Hao, Jun Zhu, Ting Chen:
Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic. PRICAI (3) 2021: 46-59 - [i3]Shiyu Huang, Bin Wang, Dong Li, Jianye Hao, Ting Chen, Jun Zhu:
Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization. CoRR abs/2110.03939 (2021) - [i2]Shiyu Huang, Wenze Chen, Longfei Zhang, Ziyang Li, Fengming Zhu, Deheng Ye, Ting Chen, Jun Zhu:
TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations. CoRR abs/2110.04507 (2021) - 2020
- [c3]Shiyu Huang, Hang Su, Jun Zhu, Ting Chen:
SVQN: Sequential Variational Soft Q-Learning Networks. ICLR 2020
2010 – 2019
- 2019
- [c2]Shiyu Huang, Hang Su, Jun Zhu, Ting Chen:
Combo-Action: Training Agent For FPS Game with Auxiliary Tasks. AAAI 2019: 954-961 - 2017
- [c1]Shiyu Huang, Deva Ramanan:
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters. CVPR 2017: 4664-4673 - [i1]Shiyu Huang, Deva Ramanan:
Recognition in-the-Tail: Training Detectors for Unusual Pedestrians with Synthetic Imposters. CoRR abs/1703.06283 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-20 21:01 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint