default search action
Shangding Gu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j7]Jiayi Guan, Shangding Gu, Zhijun Li, Jing Hou, Yiqin Yang, Guang Chen, Changjun Jiang:
UAC: Offline Reinforcement Learning With Uncertain Action Constraint. IEEE Trans. Cogn. Dev. Syst. 16(2): 671-680 (2024) - [j6]Jing Hou, Guang Chen, Zhijun Li, Wei He, Shangding Gu, Alois Knoll, Changjun Jiang:
Hybrid Residual Multiexpert Reinforcement Learning for Spatial Scheduling of High-Density Parking Lots. IEEE Trans. Cybern. 54(5): 2771-2783 (2024) - [j5]Shangding Gu, Dianye Huang, Muning Wen, Guang Chen, Alois Knoll:
Safe Multiagent Learning With Soft Constrained Policy Optimization in Real Robot Control. IEEE Trans. Ind. Informatics 20(9): 10706-10716 (2024) - [c3]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Ming Jin, Alois Knoll:
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation. AAAI 2024: 21099-21106 - [i15]Shangding Gu:
Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Case Study. CoRR abs/2401.06603 (2024) - [i14]Shangding Gu, Alois Knoll, Ming Jin:
TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning. CoRR abs/2403.08694 (2024) - [i13]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Ming Jin, Alois Knoll:
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation. CoRR abs/2405.01677 (2024) - [i12]Shangding Gu, Bilgehan Sel, Yuhao Ding, Lu Wang, Qingwei Lin, Alois Knoll, Ming Jin:
Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning. CoRR abs/2405.16390 (2024) - [i11]Zhi Zheng, Shangding Gu:
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous Driving. CoRR abs/2405.18209 (2024) - [i10]Shangding Gu, Laixi Shi, Yuhao Ding, Alois Knoll, Costas J. Spanos, Adam Wierman, Ming Jin:
Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation. CoRR abs/2405.20860 (2024) - [i9]Ruiqi Zhang, Jing Hou, Florian Walter, Shangding Gu, Jiayi Guan, Florian Röhrbein, Yali Du, Panpan Cai, Guang Chen, Alois Knoll:
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey. CoRR abs/2408.09675 (2024) - 2023
- [j4]Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023) - [j3]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Jan Peters, Alois Knoll:
A human-centered safe robot reinforcement learning framework with interactive behaviors. Frontiers Neurorobotics 17 (2023) - [i8]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Yaodong Yang, Jan Peters, Alois C. Knoll:
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors. CoRR abs/2302.13137 (2023) - [i7]Jaafar Mhamed, Shangding Gu:
SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization. CoRR abs/2311.00880 (2023) - [i6]Jing Hou, Guang Chen, Ruiqi Zhang, Zhijun Li, Shangding Gu, Changjun Jiang:
Spreeze: High-Throughput Parallel Reinforcement Learning Framework. CoRR abs/2312.06126 (2023) - 2022
- [j2]Shangding Gu, Guang Chen, Lijun Zhang, Jing Hou, Yingbai Hu, Alois C. Knoll:
Constrained Reinforcement Learning for Vehicle Motion Planning with Topological Reachability Analysis. Robotics 11(4): 81 (2022) - [j1]Tianpei Zou, Guang Chen, Zhijun Li, Wei He, Sanqing Qu, Shangding Gu, Alois C. Knoll:
KAM-Net: Keypoint-Aware and Keypoint-Matching Network for Vehicle Detection From 2-D Point Cloud. IEEE Trans. Artif. Intell. 3(2): 207-217 (2022) - [i5]Man Zhu, Changshi Xiao, Shangding Gu, Zhe Du, Yuanqiao Wen:
A Circle Grid-based Approach for Obstacle Avoidance Motion Planning of Unmanned Surface Vehicles. CoRR abs/2202.04494 (2022) - [i4]Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022) - 2021
- [c2]Jakub Grudzien Kuba, Muning Wen, Linghui Meng, Shangding Gu, Haifeng Zhang, David Mguni, Jun Wang, Yaodong Yang:
Settling the Variance of Multi-Agent Policy Gradients. NeurIPS 2021: 13458-13470 - [i3]Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang:
Settling the Variance of Multi-Agent Policy Gradients. CoRR abs/2108.08612 (2021) - [i2]Shangding Gu, Jakub Grudzien Kuba, Muning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois C. Knoll, Yaodong Yang:
Multi-Agent Constrained Policy Optimisation. CoRR abs/2110.02793 (2021) - 2020
- [c1]Fan Lu, Guang Chen, Jinhu Dong, Xiaoding Yuan, Shangding Gu, Alois C. Knoll:
Pole-based Localization for Autonomous Vehicles in Urban Scenarios Using Local Grid Map-based Method. ICARM 2020: 640-645 - [i1]Chunhui Zhou, Shangding Gu, Yuanqiao Wen, Zhe Du, Changshi Xiao, Liang Huang, Man Zhu:
The Review Unmanned Surface Vehicle Path Planning: Based on Multi-modality Constraint. CoRR abs/2007.01691 (2020)
Coauthor Index
aka: Alois Knoll
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-04 20:59 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint