default search action

combined dblp search
author search
venue search
publication search

ask others

Baoxiang Wang 0001

> Home > Persons

Person information

affiliation: Chinese University of Hong Kong, Department of Computer Science and Engineering, Shenzhen, China
affiliation: Shenzhen Institute of Artificial Intelligence and Robotics for Society, China
affiliation (former): Borealis AI, Edmonton, AB, Canada

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/0002Z0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/0002Z0024
Fang Kong, Xiangcheng Zhang, Baoxiang Wang, Shuai Li:
Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization. Trans. Mach. Learn. Res. 2024 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/GuoWWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/GuoWWZ24
Dandan Guo, Chaojie Wang, Baoxiang Wang, Hongyuan Zha:
Learning Fair Representations via Distance Correlation Minimization. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2139-2152 (2024)
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuZZ0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuZZ0H24
Jiawei Xu, Cheng Zhou, Yizheng Zhang, Baoxiang Wang, Lei Han:
Relative Policy-Transition Optimization for Fast Policy Transfer. AAAI 2024: 16164-16172
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aistats/Dong0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/Dong0Y24
Jing Dong, Baoxiang Wang, Yaoliang Yu:
Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games. AISTATS 2024: 2044-2052
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Jin0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Jin0024
Ruinan Jin, Shuai Li, Baoxiang Wang:
On Stationary Point Convergence of PPO-Clip. ICLR 2024
[c25]
- view
  - electronic edition @ ijcai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/WangLZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WangLZ024
Han Wang, Wenhao Li, Hongyuan Zha, Baoxiang Wang:
Carbon Market Simulation with Adaptive Mechanism Design. IJCAI 2024: 8824-8828
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06516
Jing Dong, Baoxiang Wang, Yaoliang Yu:
Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games. CoRR abs/2404.06516 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07875
Han Wang, Wenhao Li, Hongyuan Zha, Baoxiang Wang:
Carbon Market Simulation with Adaptive Mechanism Design. CoRR abs/2406.07875 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04285
Jiawei Xu, Rui Yang, Feng Luo, Meng Fang, Baoxiang Wang, Lei Han:
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling. CoRR abs/2407.04285 (2024)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-08395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-08395
Jing Dong, Baoxiang Wang, Yaoliang Yu:
Uncoupled and Convergent Learning in Monotone Games under Bandit Feedback. CoRR abs/2408.08395 (2024)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13045
Huanjian Zhou, Baoxiang Wang, Masashi Sugiyama:
Adaptive complexity of log-concave sampling. CoRR abs/2408.13045 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05023
Ruinan Jin, Xiaoyu Wang, Baoxiang Wang:
Asymptotic and Non-Asymptotic Convergence Analysis of AdaGrad for Non-Convex Optimization via Novel Stopping Time-based Analysis. CoRR abs/2409.05023 (2024)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04458
Ruinan Jin, Xiao Li, Yaoliang Yu, Baoxiang Wang:
A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD. CoRR abs/2410.04458 (2024)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08746
Jing Dong, Baoxiang Wang, Yaoliang Yu:
Last-iterate Convergence in Regularized Graphon Mean Field Game. CoRR abs/2410.08746 (2024)
2023
[j2]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/Yang00YZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/Yang00YZ23
Shanchao Yang, Kaili Ma, Baoxiang Wang, Tianshu Yu, Hongyuan Zha:
Learning to Boost Resilience of Complex Networks via Neural Edge Rewiring. Trans. Mach. Learn. Res. 2023 (2023)
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/0001KL023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/0001KL023
Qi Tian, Kun Kuang, Furui Liu, Baoxiang Wang:
Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning. AAAI 2023: 11672-11680
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/Li0YZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/Li0YZ23
Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha:
Diverse Policy Optimization for Structured Action Space. AAMAS 2023: 819-828
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/KongX0YL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/KongX0YL23
Fang Kong, Jize Xie, Baoxiang Wang, Tao Yao, Shuai Li:
Online Influence Maximization under Decreasing Cascade Model. AAMAS 2023: 2197-2204
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/0008SX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/0008SX023
Jing Dong, Li Shen, Yinggan Xu, Baoxiang Wang:
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation. AAMAS 2023: 2640-2642
[c20]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZhaoY0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhaoY0023
Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Shuai Li:
Learning Adversarial Linear Mixture Markov Decision Processes with Bandit Feedback and Unknown Transition. ICLR 2023
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ZhaoZ00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ZhaoZ00023
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li:
DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning. IJCAI 2023: 4638-4646
[c18]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/0003K0LWX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0003K0LWX023
Jiahui Li, Kun Kuang, Baoxiang Wang, Xingchen Li, Fei Wu, Jun Xiao, Long Chen:
Two Heads are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning. NeurIPS 2023
[c17]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LinLZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LinLZ023
Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang:
Information Design in Multi-Agent Reinforcement Learning. NeurIPS 2023
[c16]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/ZhaoY0ZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhaoY0ZL23
Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Xuezhou Zhang, Shuai Li:
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback. NeurIPS 2023
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/wsdm/ZhaoZ00L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wsdm/ZhaoZ00L23
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li:
Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization. WSDM 2023: 985-993
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-06834
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-06834
Fang Kong, Xiangcheng Zhang, Baoxiang Wang, Shuai Li:
Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization. CoRR abs/2302.06834 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11917
Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha:
Diverse Policy Optimization for Structured Action Space. CoRR abs/2302.11917 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-06807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-06807
Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang:
Information Design in Multi-Agent Reinforcement Learning. CoRR abs/2305.06807 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10865
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10865
Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha:
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning. CoRR abs/2305.10865 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15428
Fang Kong, Jize Xie, Baoxiang Wang, Tao Yao, Shuai Li:
Online Influence Maximization under Decreasing Cascade Model. CoRR abs/2305.15428 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01952
Jingwei Li, Jing Dong, Baoxiang Wang, Jingzhao Zhang:
Online Control with Adversarial Disturbance for Continuous-time Linear Systems. CoRR abs/2306.01952 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-13673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-13673
Jing Dong, Jingyu Wu, Siwei Wang, Baoxiang Wang, Wei Chen:
Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games. CoRR abs/2306.13673 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-09902
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-09902
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li:
DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2308.09902 (2023)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-07876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-07876
Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Xuezhou Zhang, Shuai Li:
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback. CoRR abs/2311.07876 (2023)
2022
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/DongZWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DongZWZ22
Jing Dong, Shiji Zhou, Baoxiang Wang, Han Zhao:
Algorithms and Theory for Supervised Gradual Domain Adaptation. Trans. Mach. Learn. Res. 2022 (2022)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangDWL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangDWL22
Kun Wang, Jing Dong, Baoxiang Wang, Shuai Li:
Cascading Bandit Under Differential Privacy. ICASSP 2022: 4418-4422
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiK0LCF0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiK0LCF0022
Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Changjie Fan, Fei Wu, Jun Xiao:
Deconfounded Value Decomposition for Multi-Agent Reinforcement Learning. ICML 2022: 12843-12856
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/wsdm/DongLL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wsdm/DongLL022
Jing Dong, Ke Li, Shuai Li, Baoxiang Wang:
Combinatorial Bandits under Strategic Manipulations. WSDM 2022: 219-229
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10447
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li:
Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization. CoRR abs/2201.10447 (2022)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-13863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-13863
Jing Dong, Li Shen, Yinggan Xu, Baoxiang Wang:
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation. CoRR abs/2202.13863 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-11644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-11644
Jing Dong, Shiji Zhou, Baoxiang Wang, Han Zhao:
Algorithms and Theory for Supervised Gradual Domain Adaptation. CoRR abs/2204.11644 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-13841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-13841
Jing Dong, Jingwei Li, Baoxiang Wang, Jingzhao Zhang:
Online Policy Optimization for Robust MDP. CoRR abs/2209.13841 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-15612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-15612
Qi Tian, Kun Kuang, Furui Liu, Baoxiang Wang:
Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning. CoRR abs/2211.15612 (2022)
2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/LiKWLCWX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/LiKWLCWX21
Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao:
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning. KDD 2021: 934-942
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-12722
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-12722
Jing Dong, Ke Li, Shuai Li, Baoxiang Wang:
Combinatorial Bandits under Strategic Manipulations. CoRR abs/2102.12722 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-11126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-11126
Kun Wang, Jing Dong, Baoxiang Wang, Shuai Li, Shuo Shao:
Cascading Bandit under Differential Privacy. CoRR abs/2105.11126 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00285
Jiahui Li, Kun Kuang, Baoxiang Wang, Furui Liu, Long Chen, Fei Wu, Jun Xiao:
Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning. CoRR abs/2106.00285 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-07103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-07103
Baoxiang Wang, Huanjian Zhou:
Multilinear extension of k-submodular functions. CoRR abs/2107.07103 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-04226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-04226
Jing Dong, Shuai Li, Baoxiang Wang:
Incentivizing an Unknown Crowd. CoRR abs/2109.04226 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09035
Shanchao Yang, Kaili Ma, Baoxiang Wang, Hongyuan Zha:
Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient. CoRR abs/2110.09035 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10374
Qi Tian, Kun Kuang, Baoxiang Wang, Furui Liu, Fei Wu:
Multi-agent Communication with Graph Information Bottleneck under Limited Bandwidth. CoRR abs/2112.10374 (2021)
2020
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/Wang0LC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Wang0LC20
Baoxiang Wang, Shuai Li, Jiajin Li, Siu On Chan:
The Gambler's Problem and Beyond. ICLR 2020
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/innovations/BogdanovW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/innovations/BogdanovW20
Andrej Bogdanov, Baoxiang Wang:
Learning and Testing Variable Partitions. ITCS 2020: 37:1-37:22
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-00102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-00102
Baoxiang Wang, Shuai Li, Jiajin Li, Siu On Chan:
The Gambler's Problem and Beyond. CoRR abs/2001.00102 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12990
Andrej Bogdanov, Baoxiang Wang:
Learning and Testing Variable Partitions. CoRR abs/2003.12990 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c8]
- view
  - electronic edition @ aaai.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/aiide/WangSZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aiide/WangSZ19
Baoxiang Wang, Tongfang Sun, Xianjun Sam Zheng:
Beyond Winning and Losing: Modeling Human Motivations and Behaviors with Vector-Valued Inverse Reinforcement Learning. AIIDE 2019: 195-201
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/Wang19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/Wang19
Baoxiang Wang:
Recurrent Existence Determination Through Policy Optimization. IJCAI 2019: 3656-3662
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/YoungWT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/YoungWT19
Kenny Young, Baoxiang Wang, Matthew E. Taylor:
Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control. IJCAI 2019: 4185-4191
[c5]
- view
- export record
  dblp key:
  - conf/nips/WangH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangH19
Baoxiang Wang, Nidhi Hegde:
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces. NeurIPS 2019: 11323-11333
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1901-10634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-10634
Baoxiang Wang, Nidhi Hegde:
Private Q-Learning with Functional Noise in Continuous Spaces. CoRR abs/1901.10634 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13551
Baoxiang Wang:
Recurrent Existence Determination Through Policy Optimization. CoRR abs/1905.13551 (2019)
2018
[c4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LiW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiW18
Jiajin Li, Baoxiang Wang:
Policy Optimization with Second-Order Advantage Information. ICLR (Workshop) 2018
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LiWZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LiWZ18
Jiajin Li, Baoxiang Wang, Shengyu Zhang:
Policy Optimization with Second-Order Advantage Information. IJCAI 2018: 5038-5044
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-03586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-03586
Jiajin Li, Baoxiang Wang:
Policy Optimization with Second-Order Advantage Information. CoRR abs/1805.03586 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-04514
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-04514
Kenny Young, Baoxiang Wang, Matthew E. Taylor:
Metatrace: Online Step-size Tuning by Meta-gradient Descent for Reinforcement Learning Control. CoRR abs/1805.04514 (2018)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-00366
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-00366
Baoxiang Wang, Tongfang Sun, Xianjun Sam Zheng:
Beyond Winning and Losing: Modeling Human Motivations and Behaviors Using Inverse Reinforcement Learning. CoRR abs/1807.00366 (2018)
2016
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiWZC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiWZC16
Shuai Li, Baoxiang Wang, Shengyu Zhang, Wei Chen:
Contextual Combinatorial Cascading Bandits. ICML 2016: 1245-1253
2015
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/issre/GaoWHZZL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/issre/GaoWHZZL15
Cuiyun Gao, Baoxiang Wang, Pinjia He, Jieming Zhu, Yangfan Zhou, Michael R. Lyu:
PAID: Prioritizing app issues for developers by tracking user reviews over versions. ISSRE 2015: 35-45

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.