


default search action
Yang Yu 0001
Person information
- affiliation (PhD 2011): Nanjing University, State Key Laboratory for Novel Software Technology, China
- affiliation: Pazhou Lab, Guangzhou, China
Other persons with the same name
- Yang Yu — disambiguation page
- Yang Yu 0002
— University of Technology Sydney, Faculty of Engineering and Information Technology, NSW, Australia (and 1 more)
- Yang Yu 0003
— North China Electric Power University, State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources, Baoding, China
- Yang Yu 0004
— Rochester Institute of Technology, Saunders College of Business, Rochester, NY, USA (and 1 more)
- Yang Yu 0005
— Jiangsu University of Technology, School of Electric Information Engineering, Changzhou, China
- Yang Yu 0006
— National University of Defense Technology, College of Electrical Science and Engineering, National Key Laboratory of Science and Technology on ATR, Changsha, China
- Yang Yu 0007
— National University of Defense Technology, College of Computer, Changsha, China
- Yang Yu 0008
— Tsinghua University, Department of Computer Science and Technology, Beijing, China (and 1 more)
- Yang Yu 0009 — Motorola Labs, Schaumburg, IL, USA (and 1 more)
- Yang Yu 0010
— Rutgers University, Department of Computer Science, Piscataway, NJ, USA
- Yang Yu 0011
— Tsinghua University, Institute for Interdisciplinary Information Sciences, Beijing, China (and 1 more)
- Yang Yu 0012 — University of Sheffield, UK
- Yang Yu 0013
— Nanjing University of Posts and Telecommunications, College of Automation / College of Artificial Intelligence, China (and 1 more)
- Yang Yu 0014
— National University of Defense Technology, College of Intelligence Science and Technology, Changsha, China (and 1 more)
- Yang Yu 0015
— Harbin Institute of Technology, Department of Automatic Test and Control, Harbin, China
- Yang Yu 0016
— Northeastern University, College of Information Science and Engineering, Shenyang, China
- Yang Yu 0017
— Changchun University of Technology, School of Mechatronic Engineering, Changchun, China
- Yang Yu 0018
— Harbin Jiancheng Group Company, Harbin, China
- Yang Yu 0019
— Shanghai Jiao Tong University, School of Mechanical Engineering, State Key Laboratory of Mechanical System and Vibration, Shanghai, China
- Yang Yu 0020
— Tongji University, State Key Laboratory of Marine Geology, Shanghai, China
- Yang Yu 0021
— University of Technology Sydney, School of Civil and Environmental Engineering, Sydney, Australia
- Yang Yu 0022
— Hebei University of Technology, School of Computer Science and Engineering, Tianjin, China
- Yang Yu 0023
— Wuhan University, School of Urban Design, Department of Urban Planning, Wuhan, China
- Yang Yu 0024
— Tongji University, Department of Control Science and Engineering, Shanghai, China
- Yang Yu 0025 — Rutgers University, Department of Mathematics, Piscataway, NJ, USA
- Yang Yu 0026
— China Agricultural University, College of Engineering, Beijing, China
- Yang Yu 0027
— Sun Yat-sen University, School of Data and Computer Science, Guangzhou, China
- Yang Yu 0028
— Hong Kong University of Science and Technology, Department of Electronic and Computer Engineering, Robotics and Multi-Perception Laborotary, Hong Kong
- Yang Yu 0029 — Google, Mountain View, CA, USA (and 3 more)
- Yang Yu 0030
— Tianjin University, College of Intelligence and Computing, China
- Yang Yu 0031
— Southwest Forestry University, School of Machinery and Transportation, Kunming, China (and 1 more)
- Yang Yu 0032 — National University of Defense Technology, Center of Material Science, College of Liberal Arts and Sciences, College of Advanced Interdisciplinary Studies, College of Sciences, Changsha, China
- Yang Yu 0033
— Guizhou Medical University, School of Biology and Engineering, Guiyang, China (and 1 more)
- Yang Yu 0034 — Nanjing University of Posts and Telecommunications, College of Communication & Information Engineering, China
- Yang Yu 0035
— Royal Institute of Technology, Stockholm, Sweden
- Yang Yu 0036
— University of Duisburg-Essen, Germany
- Yang Yu 0037
— Auckland University of Technology, Institute of Biomedical Technologies, New Zealand
- Yang Yu 0038
— University of Science and Technology of China, State Key Laboratory of Cognitive Intelligence, Hefei, China
- Yang Yu 0039
— Beijing Jiaotong University, Institute of Information Science, Beijing Key Laboratory of Advanced Information Science and Network Technology, Beijing, China
- Yang Yu 0040
— Northwestern Polytechnical University, School of Marine Science and Technology, Xi'an, China
- Yang Yu 0041 — Shanghai Jiao Tong University, Department of Electronic Engineering, Network Coding and Transmission Laboratory, Shanghai, China (and 1 more)
- Yang Yu 0042
— Kookmin University, Department of Computer Science, Seoul, South Korea
- Yang Yu 0043
— Qingdao University, School of Automation, Shandong Key Laboratory of Industrial Control Technology, Qingdao, China
- Yang Yu 0044
— Zhengzhou University of Light Industry, Software Engineering College, Zhengzhou, China
- Yang Yu 0045
— Chinese Academy of Sciences, Shanghai Institute of Technical Physics, Key Laboratory of Infrared System Detecting and Imaging Technology, Shanghai, China
- Yang Yu 0046
— Japan Advanced Institute of Science and Technology (JAIST), School of Knowledge Science, Nomi, Japan
- Yang Yu 0047
— Hubei Three Gorges Polytechnic, Electronic Information School, Yichang, China
- Yang Yu 0048
— Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China
- Yang Yu 0049
— Lanzhou Jiaotong University, School of Traffic and Transportation, Lanzhou, China
- Yang Yu 0050
— Shanghai Jiao Tong University, Antai College of Economics and Management, Shanghai, China
- Yang Yu 0051
— Tianjin University, Tianjin Key Laboratory of Port and Ocean Engineering, State Key Laboratory of Hydraulic Engineering Simulation and Safety, Tianjin, China
- Yang Yu 0052 — Purdue University, Department of Statistics, West Lafayette, IN, USA
- Yang Yu 0053 — University of North Carolina at Chapel Hill, Department of Statistics and Operations Research, Chapel Hill, NC, USA
- Yang Yu 0054
— Halliburton Ltd, Singapore (and 1 more)
- Yang Yu 0055
— Taylor Hobson Ltd. AMETEK Ultra Precision Technologies, Leicester, UK (and 1 more)
- Yang Yu 0056
— University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China (and 1 more)
- Yang Yu 0057
— Qilu University of Technology, Shandong Computer Science Center, Shandong Provincial Key Laboratory of Computer Networks, Jinan, China
- Yang Yu 0058
— Beijing Jiaotong University, Institute of Data Science and Intelligent Decision Support, Beijing, China (and 2 more)
- Yang Yu 0059
— Liaoning Institute of Science and Engineering, School of Management Engineering, Jinzhou, China
- Yang Yu 0060
— Wuhan University of Technology, School of Information Engineering, Wuhan, China
- Yang Yu 0061
— Nanjing University of Posts and Telecommunications, Institute of Signal Processing Transmission, Nanjing, China
- Yang Yu 0062
— University of California Davis, Department of Land Air and Water Resources, Davis, CA, USA (and 1 more)
- Yang Yu 0063
— Pennsylvania State University, Department of Architectural Engineering, University Park, PA, USA
- Yang Yu 0064
— Beihang University, School of Aeronautic Science and Engineering, Beijing, China
- Yang Yu 0065
— Shanghai Conservatory of Music, Shanghai Key Laboratory for Music Acoustic, Shanghai, China
- Yang Yu 0066
— Semiconductor Manufacturing International Corporation, R&D Department, Shanghai, China
- Yang Yu 0067
— Tianjin University of Science and Technology, College of Artificial Intelligence, Tianjin, China
- Yang Yu 0068
— Chinese Academy of Sciences, National Space Science Center, Key Laboratory of Microwave Remote Sensing, Beijing, China (and 3 more)
- Yang Yu 0069
— Northwestern Polytechnical University, School of Astronautics, National Key Laboratory of Aerospace Flight Dynamics, Xi'an, China
- Yang Yu 0070
— Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
- Yang Yu 0071
— Weifang People's Hospital, Department of Stomatology, Weifang, China
- Yang Yu 0072
— Shandong University of Technology, School of Transportation and Vehicle Engineering, Zibo, China
- Yang Yu 0073
— Wuhan University, School of Cyber Science and Engineering, Key Laboratory of Aerospace Information Security and Trusted Computing, Wuhan, China
- Yang Yu 0074
— Victoria University of Wellington, School of Marketing and International Business, Wellington, New Zealand
- Yang Yu 0075 — Victoria University of Wellington, School of Engineering and Computer Science, Wellington, New Zealand
- Yang Yu 0076
— Jilin Communications Polytechnic, Department of Physical Education, Changchun, China
- Yang Yu 0077
— Shenyang Aerospace University, School of Automation, Shenyang, China
- Yang Yu 0078
— Northwestern University, Department of Statistics, Evanston, IL, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j47]Cong Guan, Ke Xue, Chunpeng Fan, Feng Chen, Lichao Zhang, Lei Yuan, Chao Qian, Yang Yu:
Open and real-world human-AI coordination by heterogeneous training with communication. Frontiers Comput. Sci. 19(4): 194314 (2025) - [j46]Zhengmao Zhu, Hong-Long Tian, Xionghui Chen, Kun Zhang, Yang Yu:
Offline model-based reinforcement learning with causal structured world models. Frontiers Comput. Sci. 19(4): 194347 (2025) - [j45]Cong Guan, Tao Jiang, Yi-Chen Li, Zongzhang Zhang, Lei Yuan, Yang Yu:
Constraining an Unconstrained Multi-agent Policy with offline data. Neural Networks 186: 107253 (2025) - [i98]Chen-Xiao Gao, Chenyang Wu, Mingjun Cao, Chenjun Xiao, Yang Yu, Zongzhang Zhang:
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning. CoRR abs/2502.04778 (2025) - 2024
- [j44]Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A survey on model-based reinforcement learning. Sci. China Inf. Sci. 67(2) (2024) - [j43]Chengxing Jia, Fuxiang Zhang, Tian Xu, Jing-Cheng Pang, Zongzhang Zhang, Yang Yu:
Model gradient: unified model and policy learning in model-based reinforcement learning. Frontiers Comput. Sci. 18(4): 184339 (2024) - [j42]Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation. Frontiers Comput. Sci. 18(6) (2024) - [j41]Ruo-Ze Liu
, Yanjie Shen, Yang Yu
, Tong Lu
:
Revisiting of AlphaStar. IEEE Trans. Games 16(2): 317-330 (2024) - [j40]Zijian Zhang
, Xin Lu
, Meng Li
, Jincheng An
, Yang Yu
, Hao Yin
, Liehuang Zhu
, Yong Liu
, Jiamou Liu
, Bakh Khoussainov
:
A Blockchain-Based Privacy-Preserving Scheme for Sealed-Bid Auction. IEEE Trans. Dependable Secur. Comput. 21(5): 4668-4683 (2024) - [j39]Ming Yang, Yiming Wang, Yang Yu
, Mingliang Zhou
, Leong Hou U
:
MixLight: Mixed-Agent Cooperative Reinforcement Learning for Traffic Light Control. IEEE Trans. Ind. Informatics 20(2): 2653-2661 (2024) - [j38]Zhengbang Zhu
, Rongjun Qin
, Junjie Huang
, Xinyi Dai
, Yang Yu
, Yong Yu
, Weinan Zhang
:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. ACM Trans. Inf. Syst. 42(4): 90:1-90:32 (2024) - [c143]Chao Chen, Jiacheng Xu, Weijian Liao, Hao Ding, Zongzhang Zhang, Yang Yu, Rui Zhao:
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning. AAAI 2024: 11240-11248 - [c142]Chenxiao Gao, Chenyang Wu
, Mingjun Cao
, Rui Kong, Zongzhang Zhang, Yang Yu:
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning. AAAI 2024: 12127-12135 - [c141]Haoxin Lin, Hongqiu Wu, Jiaji Zhang, Yihao Sun, Junyin Ye, Yang Yu:
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward. AAAI 2024: 13808-13816 - [c140]Renzhe Zhou, Chenxiao Gao, Zongzhang Zhang, Yang Yu:
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations. AAAI 2024: 17132-17140 - [c139]Chao Chen, Dawei Wang, Feng Mao, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Deep Anomaly Detection via Active Anomaly Search. AAMAS 2024: 308-316 - [c138]Ruifeng Chen, Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Feng Xu, Yang Yu:
Foresight Distribution Adjustment for Off-policy Reinforcement Learning. AAMAS 2024: 317-325 - [c137]Cong Guan, Ruiqi Xue, Ziqian Zhang, Lihe Li, Yi-Chen Li, Lei Yuan, Yang Yu:
Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation. AAMAS 2024: 743-751 - [c136]Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chenxiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu:
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation. AAMAS 2024: 944-953 - [c135]Chengxing Jia, Chenxiao Gao, Hao Yin, Fuxiang Zhang, Xiong-Hui Chen, Tian Xu, Lei Yuan, Zongzhang Zhang, Zhi-Hua Zhou, Yang Yu:
Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning. ICLR 2024 - [c134]Ziniu Li, Tian Xu, Yang Yu:
When is RL better than DPO in RLHF? A Representation and Optimization Perspective. Tiny Papers @ ICLR 2024 - [c133]Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu:
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning. ICLR 2024 - [c132]Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Language Model Self-improvement by Reinforcement Learning Contemplation. ICLR 2024 - [c131]Zhilong Zhang, Yihao Sun, Junyin Ye, Tian-Shuo Liu, Jiaji Zhang, Yang Yu:
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation. ICLR 2024 - [c130]Ruifeng Chen, Xiong-Hui Chen, Yihao Sun, Siyuan Xiao, Minhui Li, Yang Yu:
Policy-conditioned Environment Models are More Generalizable. ICML 2024 - [c129]Ruifeng Chen, Chengxing Jia, Zefang Huang, Tian-Shuo Liu, Xu-Hui Liu, Yang Yu:
Offline Transition Modeling via Contrastive Energy Learning. ICML 2024 - [c128]Xingchen Cao, Fan-Ming Luo, Junyin Ye, Tian Xu, Zhilong Zhang, Yang Yu:
Limited Preference Aided Imitation Learning from Imperfect Demonstrations. ICML 2024 - [c127]Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, XuHui Liu, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Yang Yu, Anqi Huang, Kai Xu, Zongzhang Zhang:
Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration. ICML 2024 - [c126]Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. ICML 2024 - [c125]Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen, Zhilong Zhang, Xinwei Chen, Yang Yu:
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning. ICML 2024 - [c124]Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics. ICML 2024 - [c123]Lihe Li, Ruotong Chen, Ziqian Zhang, Zhichao Wu, Yi-Chen Li, Cong Guan, Yang Yu, Lei Yuan:
Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal. IJCAI 2024: 4434-4442 - [c122]Zhi-Hao Tan
, Jian-Dong Liu
, Xiao-Dong Bi
, Peng Tan
, Qin-Cheng Zheng
, Hai-Tian Liu
, Yi Xie
, Xiao-Chuan Zou
, Yang Yu
, Zhi-Hua Zhou
:
Beimingwu: A Learnware Dock System. KDD 2024: 5773-5782 - [c121]Tao Jiang, Lei Yuan, Lihe Li, Cong Guan, Zongzhang Zhang, Yang Yu:
Multi-Agent Domain Calibration with a Handful of Offline Data. NeurIPS 2024 - [c120]Fan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu:
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate. NeurIPS 2024 - [c119]Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu:
KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. NeurIPS 2024 - [c118]Ruiqi Xue, Ziqian Zhang, Lihe Li, Feng Chen, Yi-Chen Li, Yang Yu, Lei Yuan:
Dynamics Adaptive Safe Reinforcement Learning with a Misspecified Simulator. ECML/PKDD (7) 2024: 74-91 - [i97]Zhi-Hao Tan, Jian-Dong Liu, Xiao-Dong Bi, Peng Tan, Qin-Cheng Zheng, Hai-Tian Liu, Yi Xie, Xiao-Chuan Zou, Yang Yu, Zhi-Hua Zhou:
Beimingwu: A Learnware Dock System. CoRR abs/2401.14427 (2024) - [i96]Jing-Cheng Pang, Heng-Bo Fan, Pengyuan Wang, Jiahao Xiao, Nan Tang, Si-Hang Yang, Chengxing Jia, Sheng-Jun Huang, Yang Yu:
Empowering Language Models with Active Inquiry for Deeper Understanding. CoRR abs/2402.03719 (2024) - [i95]Xinyu Zhang, Wenjie Qiu, Yi-Chen Li, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics. CoRR abs/2402.11317 (2024) - [i94]Chengxing Jia, Fuxiang Zhang, Yi-Chen Li, Chenxiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu:
Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation. CoRR abs/2403.07261 (2024) - [i93]Jing-Cheng Pang, Si-Hang Yang, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang, Yang Yu:
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. CoRR abs/2404.09248 (2024) - [i92]Fan-Ming Luo, Zuolin Tu, Zefang Huang, Yang Yu:
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate. CoRR abs/2405.15384 (2024) - [i91]Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu:
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning. CoRR abs/2405.17031 (2024) - [i90]Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu:
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation. CoRR abs/2405.17039 (2024) - [i89]Yi-Chen Li, Fuxiang Zhang, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu:
Q-Adapter: Training Your LLM Adapter as a Residual Q-Function. CoRR abs/2407.03856 (2024) - [i88]Fuxiang Zhang, Junyou Li, Yi-Chen Li, Zongzhang Zhang, Yang Yu, Deheng Ye:
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models. CoRR abs/2407.03964 (2024) - [i87]Chen-Xiao Gao, Shengjun Fang, Chenjun Xiao, Yang Yu, Zongzhang Zhang:
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning. CoRR abs/2407.04451 (2024) - [i86]Xu-Hui Liu, Tian-Shuo Liu, Shengyi Jiang, Ruifeng Chen, Zhilong Zhang, Xinwei Chen, Yang Yu:
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning. CoRR abs/2407.12448 (2024) - [i85]Zhilong Zhang, Ruifeng Chen, Junyin Ye, Yihao Sun, Pengyuan Wang, Jingcheng Pang, Kaiyuan Li, Tianshuo Liu, Haoxin Lin, Yang Yu, Zhi-Hua Zhou:
WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making. CoRR abs/2411.05619 (2024) - [i84]Feng Chen, Fuguang Han, Cong Guan, Lei Yuan, Zhilong Zhang, Yang Yu, Zongzhang Zhang:
Stable Continual Reinforcement Learning via Diffusion-based Trajectory Replay. CoRR abs/2411.10809 (2024) - 2023
- [j37]Hua Yang
, Minghao Zhao, Lei Yuan, Yang Yu, Zhenhua Li
, Ming Gu:
Memory-efficient Transformer-based network model for Traveling Salesman Problem. Neural Networks 161: 589-597 (2023) - [j36]Xiong-Hui Chen
, Fan-Ming Luo
, Yang Yu
, Qingyang Li
, Zhiwei Qin
, Wenjie Shang
, Jieping Ye
:
Offline Model-Based Adaptable Policy Learning for Decision-Making in Out-of-Support Regions. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15260-15274 (2023) - [j35]Guangda Huzhang
, Zhen-Jia Pang, Yongqing Gao, Yawen Liu, Weijie Shen, Wen-Ji Zhou
, Qianying Lin, Qing Da, Anxiang Zeng
, Han Yu
, Yang Yu
, Zhi-Hua Zhou
:
AliExpress Learning-to-Rank: Maximizing Online Model Performance Without Going Online. IEEE Trans. Knowl. Data Eng. 35(2): 1214-1226 (2023) - [j34]Han Wang
, Yang Yu
, Yuan Jiang:
Fully Decentralized Multiagent Communication via Causal Inference. IEEE Trans. Neural Networks Learn. Syst. 34(12): 10193-10202 (2023) - [j33]Hang Zhao
, Zherong Pan
, Yang Yu
, Kai Xu
:
Learning Physically Realizable Skills for Online Packing of General 3D Shapes. ACM Trans. Graph. 42(5): 165:1-165:21 (2023) - [c117]Yang Yu, Qi Liu, Likang Wu, Runlong Yu
, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. AAAI 2023: 4854-4863 - [c116]Weijian Liao, Zongzhang Zhang, Yang Yu:
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning. AAAI 2023: 8746-8754 - [c115]Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers. AAAI 2023: 11753-11762 - [c114]Chao Chen, Dawei Wang, Feng Mao, Zongzhang Zhang, Yang Yu:
Deep Anomaly Detection and Search via Reinforcement Learning (Student Abstract). AAAI 2023: 16180-16181 - [c113]Yi-Chen Li, Wen-Jie Shen, Boyu Zhang, Feng Mao, Zongzhang Zhang, Yang Yu:
Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract). AAAI 2023: 16258-16259 - [c112]Aoran Wang, Hongyang Yang, Feng Mao, Zongzhang Zhang, Yang Yu, Xiaoyang Liu:
Anti-drifting Feature Selection via Deep Reinforcement Learning (Student Abstract). AAAI 2023: 16356-16357 - [c111]Renzhe Zhou, Zongzhang Zhang, Yang Yu:
Model-Based Offline Weighted Policy Optimization (Student Abstract). AAAI 2023: 16392-16393 - [c110]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. AAMAS 2023: 476-484 - [c109]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. AAMAS 2023: 1276-1284 - [c108]Lei Yuan
, Lihe Li
, Ziqian Zhang
, Feng Chen
, Tianyi Zhang
, Cong Guan
, Yang Yu, Zhi-Hua Zhou
:
Learning to Coordinate with Anyone. DAI 2023: 4:1-4:9 - [c107]Haoxin Lin, Yihao Sun, Jiaji Zhang, Yang Yu:
Model-Based Reinforcement Learning with Multi-Step Plan Value Estimation. ECAI 2023: 1481-1488 - [c106]Huakang Lu, Hong Qian, Yupeng Wu, Ziqi Liu, Ya-Lin Zhang, Aimin Zhou, Yang Yu:
Degradation-Resistant Offline Optimization via Accumulative Risk Control. ECAI 2023: 1609-1616 - [c105]Xiong-Hui Chen, Bowei He
, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma
:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. ICDE 2023: 3389-3402 - [c104]Fuxiang Zhang, Chengxing Jia, Yi-Chen Li, Lei Yuan, Yang Yu, Zongzhang Zhang:
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data. ICLR 2023 - [c103]Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. ICML 2023: 28701-28717 - [c102]Yihao Sun, Jiaji Zhang, Chengxing Jia, Haoxin Lin, Junyin Ye, Yang Yu:
Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning. ICML 2023: 33177-33194 - [c101]Jing-Cheng Pang, Si-Hang Yang, Xiong-Hui Chen, Xinyu Yang, Yang Yu, Mas Ma, Ziqi Guo, Howard Yang, Bill Huang:
Object-Oriented Option Framework for Robotics Manipulation in Clutter. IROS 2023: 1230-1237 - [c100]Jiacheng Xu
, Chao Chen
, Fuxiang Zhang
, Lei Yuan
, Zongzhang Zhang
, Yang Yu:
Internal Logical Induction for Pixel-Symbolic Reinforcement Learning. KDD 2023: 2825-2837 - [c99]Xiong-Hui Chen, Yang Yu, Zhengmao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Rong-Jun Qin, Hongqiu Wu, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. NeurIPS 2023 - [c98]Ziniu Li, Tian Xu, Zeyu Qin, Yang Yu, Zhi-Quan Luo:
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms. NeurIPS 2023 - [c97]Yuren Liu, Biwei Huang, Zhengmao Zhu, Hong-Long Tian, Mingming Gong, Yang Yu, Kun Zhang:
Learning World Models with Identifiable Factorization. NeurIPS 2023 - [c96]Jing-Cheng Pang, Xinyu Yang, Si-Hang Yang, Xiong-Hui Chen, Yang Yu:
Natural Language Instruction-following with Task-related Language Development and Translation. NeurIPS 2023 - [c95]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. UAI 2023: 2367-2378 - [c94]Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. UAI 2023: 2465-2476 - [i83]Shaowei Zhang, Jiahan Cao, Lei Yuan, Yang Yu, De-Chuan Zhan:
Self-Motivated Multi-Agent Exploration. CoRR abs/2301.02083 (2023) - [i82]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Theoretical Analysis of Offline Imitation With Supplementary Dataset. CoRR abs/2301.11687 (2023) - [i81]Jing-Cheng Pang, Xinyu Yang, Si-Hang Yang, Yang Yu:
Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation. CoRR abs/2302.09368 (2023) - [i80]Cong Guan, Feng Chen, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Communication via Self-supervised Information Aggregation for Online and Offline Multi-agent Reinforcement Learning. CoRR abs/2302.09605 (2023) - [i79]Xu-Hui Liu, Feng Xu, Xinyu Zhang, Tianyuan Liu, Shengyi Jiang, Ruifeng Chen, Zongzhang Zhang, Yang Yu:
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement. CoRR abs/2303.02073 (2023) - [i78]Zheng-Mao Zhu, Yu-Ren Liu, Hong-Long Tian, Yang Yu, Kun Zhang:
Beware of Instantaneous Dependence in Reinforcement Learning. CoRR abs/2303.05458 (2023) - [i77]Xiong-Hui Chen, Bowei He, Yang Yu, Qingyang Li, Zhiwei Tony Qin, Wenjie Shang, Jieping Ye, Chen Ma:
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems. CoRR abs/2305.04832 (2023) - [i76]Lei Yuan, Feng Chen, Zongzhang Zhang, Yang Yu:
Communication-Robust Multi-Agent Learning by Adaptable Auxiliary Multi-Agent Adversary Generation. CoRR abs/2305.05116 (2023) - [i75]Lei Yuan, Ziqian Zhang, Ke Xue, Hao Yin, Feng Chen, Cong Guan, Lihe Li, Chao Qian, Yang Yu:
Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers. CoRR abs/2305.05909 (2023) - [i74]Ziqian Zhang, Lei Yuan, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu:
Fast Teammate Adaptation in the Presence of Sudden Policy Change. CoRR abs/2305.05911 (2023) - [i73]Lei Yuan, Tao Jiang, Lihe Li, Feng Chen, Zongzhang Zhang, Yang Yu:
Robust Multi-agent Communication via Multi-view Message Certification. CoRR abs/2305.13936 (2023) - [i72]Lei Yuan, Lihe Li, Ziqian Zhang, Fuxiang Zhang, Cong Guan, Yang Yu:
Multi-agent Continual Coordination via Progressive Task Contextualization. CoRR abs/2305.13937 (2023) - [i71]Jing-Cheng Pang, Pengyuan Wang, Kaiyuan Li, Xiong-Hui Chen, Jiacheng Xu, Zongzhang Zhang, Yang Yu:
Language Model Self-improvement by Reinforcement Learning Contemplation. CoRR abs/2305.14483 (2023) - [i70]Yu-Ren Liu, Biwei Huang, Zheng-Mao Zhu, Hong-Long Tian, Mingming Gong, Yang Yu, Kun Zhang:
Learning World Models with Identifiable Factorization. CoRR abs/2306.06561 (2023) - [i69]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Provably Efficient Adversarial Imitation Learning with Unknown Transitions. CoRR abs/2306.06563 (2023) - [i68]Yuhang Ran, Yi-Chen Li, Fuxiang Zhang, Zongzhang Zhang, Yang Yu:
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning. CoRR abs/2306.06569 (2023) - [i67]Chenxiao Gao, Chenyang Wu
, Mingjun Cao, Rui Kong, Zongzhang Zhang, Yang Yu:
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning. CoRR abs/2309.05915 (2023) - [i66]Lei Yuan, Lihe Li, Ziqian Zhang, Feng Chen, Tianyi Zhang, Cong Guan, Yang Yu, Zhi-Hua Zhou:
Learning to Coordinate with Anyone. CoRR abs/2309.12633 (2023) - [i65]Fan-Ming Luo, Tian Xu, Xingchen Cao, Yang Yu:
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning. CoRR abs/2310.05422 (2023) - [i64]Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Anqi Huang, Kai Xu, Zongzhang Zhang, Yang Yu:
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments. CoRR abs/2310.05712 (2023) - [i63]Ziniu Li, Tian Xu, Yushun Zhang, Yang Yu, Ruoyu Sun, Zhi-Quan Luo:
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models. CoRR abs/2310.10505 (2023) - [i62]Cong Guan, Lichao Zhang, Chunpeng Fan, Yichen Li, Feng Chen, Lihe Li, Yunjia Tian, Lei Yuan, Yang Yu:
Efficient Human-AI Coordination via Preparatory Language-based Convention. CoRR abs/2311.00416 (2023) - [i61]Lei Yuan, Ziqian Zhang, Lihe Li, Cong Guan, Yang Yu:
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment. CoRR abs/2312.01058 (2023) - [i60]Ziniu Li, Tian Xu, Yang Yu:
Policy Optimization in RLHF: The Impact of Out-of-preference Data. CoRR abs/2312.10584 (2023) - [i59]Haoxin Lin, Hongqiu Wu, Jiaji Zhang, Yihao Sun, Junyin Ye, Yang Yu:
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward. CoRR abs/2312.10642 (2023) - [i58]Renzhe Zhou, Chenxiao Gao, Zongzhang Zhang, Yang Yu:
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations. CoRR abs/2312.15909 (2023) - 2022
- [j32]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Chao Qian, Yang Yu:
ZOOpt: a toolbox for derivative-free optimization. Sci. China Inf. Sci. 65(10) (2022) - [j31]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. J. Artif. Intell. Res. 75: 213-260 (2022) - [j30]Yi-Feng Zhang
, Fan-Ming Luo, Yang Yu:
Improve generated adversarial imitation learning with reward variance regularization. Mach. Learn. 111(3): 977-995 (2022) - [j29]Yi-Qi Hu
, Xu-Hui Liu
, Shu-Qiao Li
, Yang Yu:
Cascaded Algorithm Selection With Extreme-Region UCB Bandit. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6782-6794 (2022) - [j28]Tian Xu
, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments for Reinforcement Learning. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6968-6980 (2022) - [j27]Ruo-Ze Liu
, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zhen-Jia Pang, Zitai Xiao, Yuzhou Wu, Tong Lu
:
Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning. IEEE Trans. Games 14(2): 294-307 (2022) - [j26]Xin Jin, Yanping Xie, Xiu-Shen Wei
, Borui Zhao, Yongshun Zhang, Xiaoyang Tan
, Yang Yu:
A Lightweight Encoder-Decoder Path for Deep Residual Networks. IEEE Trans. Neural Networks Learn. Syst. 33(2): 866-878 (2022) - [j25]Yang Yu, Chengjie Niu, Jun Li, Kai Xu
:
Multi-view 2D-3D alignment with hybrid bundle adjustment for visual metrology. Vis. Comput. 38(4): 1483-1494 (2022) - [c93]Fan-Ming Luo, Shengyi Jiang, Yang Yu, Zongzhang Zhang, Yi-Feng Zhang:
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy. AAAI 2022: 7637-7646 - [c92]Zheng-Mao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang:
Invariant Action Effect Model for Reinforcement Learning. AAAI 2022: 9260-9268 - [c91]Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang:
Multi-Agent Incentive Communication via Decentralized Teammate Modeling. AAAI 2022: 9466-9474 - [c90]Yang Yu, Rui Jin, Hao Yin
, Keke Gai, Zijian Zhang:
A Searchable Re-encryption-based Scheme for Massive Data Transactions. CSCloud/EdgeCom 2022: 135-140 - [c89]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022 - [c88]Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang:
Active Hierarchical Exploration with Stable Subgoal Representation Learning. ICLR 2022 - [c87]Hang Zhao, Yang Yu, Kai Xu:
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees. ICLR 2022 - [c86]Hong Qian, Xu-Hui Liu, Chen-Xi Su, Aimin Zhou, Yang Yu:
The Teaching Dimension of Regularized Kernel Learners. ICML 2022: 17984-18002 - [c85]Di Xue
, Lei Yuan, Zongzhang Zhang, Yang Yu:
Efficient Multi-Agent Communication via Shapley Message Value. IJCAI 2022: 578-584 - [c84]Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Concentrative Coordination with Decentralized Task Representation. IJCAI 2022: 599-605 - [c83]Ke Xue, Jiacheng Xu, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. NeurIPS 2022 - [c82]Cong Guan, Feng Chen, Lei Yuan, Chenghe Wang, Hao Yin, Zongzhang Zhang, Yang Yu:
Efficient Multi-agent Communication via Self-supervised Information Aggregation. NeurIPS 2022 - [c81]Rongjun Qin, Xingyuan Zhang, Songyi Gao, Xiong-Hui Chen, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. NeurIPS 2022 - [c80]Chenyang Wu, Tianci Li, Zongzhang Zhang, Yang Yu:
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning. NeurIPS 2022 - [e4]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part I. Lecture Notes in Computer Science 13280, Springer 2022, ISBN 978-3-031-05932-2 [contents] - [e3]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part II. Lecture Notes in Computer Science 13281, Springer 2022, ISBN 978-3-031-05935-3 [contents] - [e2]João Gama, Tianrui Li
, Yang Yu, Enhong Chen, Yu Zheng, Fei Teng:
Advances in Knowledge Discovery and Data Mining - 26th Pacific-Asia Conference, PAKDD 2022, Chengdu, China, May 16-19, 2022, Proceedings, Part III. Lecture Notes in Computer Science 13282, Springer 2022, ISBN 978-3-031-05980-3 [contents] - [i57]Ziniu Li, Tian Xu, Yang Yu, Zhi-Quan Luo:
Rethinking ValueDice: Does It Really Improve Performance? CoRR abs/2202.02468 (2022) - [i56]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022) - [i55]Ziniu Li, Tian Xu, Yang Yu:
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle. CoRR abs/2203.11489 (2022) - [i54]Fan-Ming Luo, Xingchen Cao, Yang Yu:
Transferable Reward Learning by Dynamics-Agnostic Discriminator Ensemble. CoRR abs/2206.00238 (2022) - [i53]Zheng-Mao Zhu, Xiong-Hui Chen, Hong-Long Tian, Kun Zhang, Yang Yu:
Offline Reinforcement Learning with Causal Structured World Models. CoRR abs/2206.01474 (2022) - [i52]Xue-Kun Jin, Xu-Hui Liu, Shengyi Jiang, Yang Yu:
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning. CoRR abs/2206.02000 (2022) - [i51]Xiong-Hui Chen, Yang Yu, Zheng-Mao Zhu, Zhihua Yu, Zhenjun Chen, Chenghe Wang, Yinan Wu, Hongqiu Wu, Rong-Jun Qin, Ruijin Ding, Fangsheng Huang:
Adversarial Counterfactual Environment Model Learning. CoRR abs/2206.04890 (2022) - [i50]Fan-Ming Luo, Tian Xu, Hang Lai, Xiong-Hui Chen, Weinan Zhang, Yang Yu:
A Survey on Model-based Reinforcement Learning. CoRR abs/2206.09328 (2022) - [i49]Tian Xu, Ziniu Li, Yang Yu, Zhi-Quan Luo:
Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis. CoRR abs/2208.01899 (2022) - [i48]Ke Xue
, Yutong Wang, Lei Yuan, Cong Guan, Chao Qian, Yang Yu:
Heterogeneous Multi-agent Zero-Shot Coordination by Coevolution. CoRR abs/2208.04957 (2022) - [i47]Rong-Jun Qin, Fan-Ming Luo, Hong Qian, Yang Yu:
Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games. CoRR abs/2208.09452 (2022) - [i46]Haoxin Lin, Yihao Sun, Jiaji Zhang, Yang Yu:
Model-based Reinforcement Learning with Multi-step Plan Value Estimation. CoRR abs/2209.05530 (2022) - [i45]Ruo-Ze Liu, Zhen-Jia Pang, Zhou-Yu Meng, Wenhai Wang, Yang Yu, Tong Lu:
On Efficient Reinforcement Learning for Full-length Game of StarCraft II. CoRR abs/2209.11553 (2022) - [i44]Zhengbang Zhu, Rongjun Qin, Junjie Huang, Xinyi Dai
, Yang Yu, Yong Yu, Weinan Zhang:
Understanding or Manipulation: Rethinking Online Performance Gains of Modern Recommender Systems. CoRR abs/2210.05662 (2022) - [i43]Ke Xue
, Jiacheng Xu
, Lei Yuan, Miqing Li, Chao Qian, Zongzhang Zhang, Yang Yu:
Multi-agent Dynamic Algorithm Configuration. CoRR abs/2210.06835 (2022) - [i42]Hang Zhao, Zherong Pan, Yang Yu, Kai Xu:
Learning Physically Realizable Skills for Online Packing of General 3D Shapes. CoRR abs/2212.02094 (2022) - [i41]Yang Yu, Qi Liu, Likang Wu, Runlong Yu, Sanshi Lei Yu, Zaixi Zhang:
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense. CoRR abs/2212.05399 (2022) - 2021
- [j24]Anxiang Zeng, Han Yu, Qing Da, Yusen Zhan, Yang Yu, Jingren Zhou, Chunyan Miao:
Improving Search Engine Efficiency through Contextual Factor Selection. AI Mag. 42(2): 50-58 (2021) - [j23]Chao Qian
, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. Algorithmica 83(4): 940-975 (2021) - [j22]Chao Bian, Chao Qian, Yang Yu, Ke Tang:
On the robustness of median sampling in noisy evolutionary optimization. Sci. China Inf. Sci. 64(5) (2021) - [j21]Lei Bu
, Yongjuan Liang, Zhunyi Xie, Hong Qian, Yi-Qi Hu, Yang Yu, Xin Chen, Xuandong Li:
Machine learning steered symbolic execution framework for complex software code. Formal Aspects Comput. 33(3): 301-323 (2021) - [j20]Wenjie Shang
, Qingyang Li, Zhiwei (Tony) Qin, Yang Yu, Yiping Meng, Jieping Ye:
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation. Mach. Learn. 110(9): 2603-2640 (2021) - [j19]Hugo Jair Escalante
, Quanming Yao
, Wei-Wei Tu, Nelishia Pillay, Rong Qu
, Yang Yu, Neil Houlsby:
Guest Editorial: Automated Machine Learning. IEEE Trans. Pattern Anal. Mach. Intell. 43(9): 2887-2890 (2021) - [c79]Chenyang Wu, Rui Kong, Guoyu Yang, Xianghan Kong, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu:
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract). AAAI 2021: 15927-15928 - [c78]Feng Xu, Shengyi Jiang, Hao Yin, Zongzhang Zhang, Yang Yu, Ming Li, Dong Li, Wulong Liu:
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract). AAAI 2021: 15937-15938 - [c77]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. ICLR 2021 - [c76]Chao Bian, Chao Qian, Frank Neumann, Yang Yu:
Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints. IJCAI 2021: 2191-2197 - [c75]Weijie Shen, Lei Yuan, Junfu Huang, Songyi Gao, Yuyang Huang, Yang Yu:
Sequential and Dynamic constraint Contrastive Learning for Reinforcement Learning. IJCNN 2021: 1-9 - [c74]Xiong-Hui Chen, Yang Yu, Qingyang Li, Fan-Ming Luo, Zhiwei (Tony) Qin, Wenjie Shang, Jieping Ye:
Offline Model-based Adaptable Policy Learning. NeurIPS 2021: 8432-8443 - [c73]Xiong-Hui Chen, Shengyi Jiang, Feng Xu, Zongzhang Zhang, Yang Yu:
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning. NeurIPS 2021: 12520-12532 - [c72]Xu-Hui Liu, Zhenghai Xue, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning. NeurIPS 2021: 17604-17615 - [c71]Chenyang Wu, Guoyu Yang, Zongzhang Zhang, Yang Yu, Dong Li, Wulong Liu, Jianye Hao:
Adaptive Online Packing-guided Search for POMDPs. NeurIPS 2021: 28419-28430 - [i40]Rongjun Qin, Songyi Gao, Xingyuan Zhang, Zhen Xu, Shengkai Huang, Zewen Li, Weinan Zhang, Yang Yu:
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning. CoRR abs/2102.00714 (2021) - [i39]Hong Qian, Yang Yu:
Derivative-Free Reinforcement Learning: A Review. CoRR abs/2102.05710 (2021) - [i38]Ruo-Ze Liu, Wenhai Wang, Yanjie Shen, Zhiqi Li, Yang Yu, Tong Lu:
An Introduction of mini-AlphaStar. CoRR abs/2104.06890 (2021) - [i37]Zhenghai Xue, Xu-Hui Liu, Jing-Cheng Pang, Shengyi Jiang, Feng Xu, Yang Yu:
Regret Minimization Experience Replay. CoRR abs/2105.07253 (2021) - [i36]Jing-Cheng Pang, Tian Xu, Shengyi Jiang, Yu-Ren Liu, Yang Yu:
Sparsity Prior Regularized Q-learning for Sparse Action Tasks. CoRR abs/2105.08666 (2021) - [i35]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021) - [i34]Tian Xu, Ziniu Li, Yang Yu:
Nearly Minimax Optimal Adversarial Imitation Learning with Known and Unknown Transitions. CoRR abs/2106.10424 (2021) - [i33]Yongqing Gao, Guangda Huzhang, Weijie Shen, Yawen Liu, Wen-Ji Zhou, Qing Da, Dan Shen, Yang Yu:
Imitate TheWorld: A Search Engine Simulation Platform. CoRR abs/2107.07693 (2021) - [i32]Zhao-Hua Li, Yang Yu, Yingfeng Chen, Ke Chen, Zhipeng Hu, Changjie Fan:
Neural-to-Tree Policy Distillation with Policy Improvement Criterion. CoRR abs/2108.06898 (2021) - [i31]Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan:
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates. CoRR abs/2109.12508 (2021) - [i30]Qixin Zhang, Wenbing Ye, Zaiyi Chen, Haoyuan Hu, Enhong Chen, Yang Yu:
Online Allocation with Two-sided Resource Constraints. CoRR abs/2112.13964 (2021) - 2020
- [j18]Chengjie Niu, Yang Yu, Zhenwei Bian, Jun Li, Kai Xu:
Weakly Supervised Part-wise 3D Shape Reconstruction from Single-View RGB Images. Comput. Graph. Forum 39(7): 447-457 (2020) - [j17]Yi-Qi Hu
, Yang Yu:
A technical view on neural architecture search. Int. J. Mach. Learn. Cybern. 11(4): 795-811 (2020) - [j16]Chao Bian, Chao Qian, Ke Tang, Yang Yu:
Running time analysis of the (1+1)-EA for robust linear optimization. Theor. Comput. Sci. 843: 57-72 (2020) - [c70]Chao Bian, Chao Feng, Chao Qian, Yang Yu:
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints. AAAI 2020: 3267-3274 - [c69]Meng Wang, Yingfeng Chen, Tangjie Lv, Yan Song, Kai Guan, Changjie Fan, Yang Yu:
Reinforcement Learning with Action-Specific Focuses in Video Games. CoG 2020: 9-16 - [c68]Yi-Qi Hu, Zelin Liu, Hua Yang, Yang Yu, Yunfeng Liu:
Derivative-Free Optimization with Adaptive Experience for Efficient Hyper-Parameter Tuning. ECAI 2020: 1207-1214 - [c67]Shengyi Jiang, Jing-Cheng Pang, Yang Yu:
Offline Imitation Learning with a Misspecified Simulator. NeurIPS 2020 - [c66]Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. NeurIPS 2020 - [e1]Matthew E. Taylor, Yang Yu, Edith Elkind, Yang Gao:
Distributed Artificial Intelligence - Second International Conference, DAI 2020, Nanjing, China, October 24-27, 2020, Proceedings. Lecture Notes in Computer Science 12547, Springer 2020, ISBN 978-3-030-64095-8 [contents] - [i29]Wen-Ji Zhou, Yang Yu:
Temporal-adaptive Hierarchical Reinforcement Learning. CoRR abs/2002.02080 (2020) - [i28]Chao Wang, Ruo-Ze Liu, Han-Jia Ye, Yang Yu:
Novelty-Prepared Few-Shot Classification. CoRR abs/2003.00497 (2020) - [i27]Guangda Huzhang, Zhen-Jia Pang, Yongqing Gao, Wen-Ji Zhou, Qing Da, Anxiang Zeng, Yang Yu:
Validation Set Evaluation can be Wrong: An Evaluator-Generator Approach for Maximizing Online Performance of Ranking in E-commerce. CoRR abs/2003.11941 (2020) - [i26]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. CoRR abs/2008.01062 (2020) - [i25]Tian Xu, Ziniu Li, Yang Yu:
Error Bounds of Imitating Policies and Environments. CoRR abs/2010.11876 (2020)
2010 – 2019
- 2019
- [b1]Zhi-Hua Zhou, Yang Yu, Chao Qian:
Evolutionary Learning: Advances in Theories and Algorithms. Springer 2019, ISBN 978-981-13-5955-2, pp. 3-293 - [j15]Chao Qian, Yang Yu, Ke Tang, Xin Yao, Zhi-Hua Zhou:
Maximizing submodular or monotone approximately submodular functions by multi-objective evolutionary algorithms. Artif. Intell. 275: 279-294 (2019) - [c65]Yi-Qi Hu, Yang Yu, Wei-Wei Tu, Qiang Yang, Yuqiang Chen, Wenyuan Dai:
Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion. AAAI 2019: 3846-3853 - [c64]Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang
, Yang Yu, Tong Lu:
On Reinforcement Learning for Full-Length Game of StarCraft. AAAI 2019: 4691-4698 - [c63]Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng:
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning. AAAI 2019: 4902-4909 - [c62]Xiong-Hui Chen, Yang Yu:
Reinforcement Learning with Derivative-Free Exploration. AAMAS 2019: 1880-1882 - [c61]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu:
Asynchronous classification-based optimization. DAI 2019: 9:1-9:8 - [c60]Songyi Gao, Weijie Shen, Zelin Liu, An Zhu, Yang Yu:
Only Image Cosine Embedding for Few-Shot Learning. ICONIP (2) 2019: 83-94 - [c59]Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. IJCAI 2019: 2528-2534 - [c58]Wen-Ji Zhou
, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. IJCAI 2019: 4447-4453 - [c57]Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. KDD 2019: 566-576 - [c56]Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou:
Bridging Machine Learning and Logical Reasoning by Abductive Learning. NeurIPS 2019: 2811-2822 - [i24]Ruo-Ze Liu, Haifeng Guo, Xiaozhong Ji, Yang Yu, Zitai Xiao, Yuzhou Wu, Zhen-Jia Pang, Tong Lu:
Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II. CoRR abs/1903.00715 (2019) - [i23]Yi-Qi Hu, Yang Yu, Jun-Da Liao:
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit. CoRR abs/1905.13703 (2019) - [i22]Wen-Ji Zhou, Yang Yu, Yingfeng Chen, Kai Guan, Tangjie Lv, Changjie Fan, Zhi-Hua Zhou:
Reinforcement Learning Experience Reuse with Policy Residual Representation. CoRR abs/1905.13719 (2019) - [i21]Wenjie Shang, Yang Yu, Qingyang Li, Zhiwei (Tony) Qin, Yiping Meng, Jieping Ye:
Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation. CoRR abs/1907.06584 (2019) - [i20]Jorge G. Madrid, Hugo Jair Escalante, Eduardo F. Morales, Wei-Wei Tu, Yang Yu, Lisheng Sun-Hosoya, Isabelle Guyon, Michèle Sebag:
Towards AutoML in the presence of Drift: first results. CoRR abs/1907.10772 (2019) - [i19]Chao Bian, Chao Qian, Yang Yu:
On the Robustness of Median Sampling in Noisy Evolutionary Optimization. CoRR abs/1907.13100 (2019) - [i18]Tian Xu, Ziniu Li, Yang Yu:
On Value Discrepancy of Imitation Learning. CoRR abs/1911.07027 (2019) - [i17]Rong-Jun Qin, Jing-Cheng Pang, Yang Yu:
Improving Fictitious Play Reinforcement Learning with Expanding Models. CoRR abs/1911.11928 (2019) - 2018
- [j14]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Analyzing Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(1) (2018) - [j13]Chao Qian, Yang Yu, Ke Tang, Yaochu Jin
, Xin Yao
, Zhi-Hua Zhou:
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments. Evol. Comput. 26(2) (2018) - [j12]Yang Yu
, Shi-Yong Chen
, Qing Da, Zhi-Hua Zhou
:
Reusable Reinforcement Learning via Shallow Trails. IEEE Trans. Neural Networks Learn. Syst. 29(6): 2204-2215 (2018) - [c55]Hong Wang, Hong Qian, Yang Yu:
Noisy Derivative-Free Optimization With Value Suppression. AAAI 2018: 1447-1454 - [c54]Chao Qian, Chao Bian, Yang Yu, Ke Tang, Xin Yao
:
Analysis of noisy evolutionary optimization when sampling fails. GECCO 2018: 1507-1514 - [c53]Wenqiang Pu, Yang Yu, Shuhua Yu, Zhi-Quan Luo:
An Alternating Minimization Approach to Optimizing Subarray Configuration for a Large Phased Array. SAM 2018: 361-365 - [c52]Chao Qian, Yang Yu, Ke Tang:
Approximation Guarantees of Stochastic Greedy Algorithms for Subset Selection. IJCAI 2018: 1478-1484 - [c51]Yi-Qi Hu, Yang Yu, Zhi-Hua Zhou:
Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search. IJCAI 2018: 2276-2282 - [c50]Yang Yu, Wen-Ji Zhou
:
Mixture of GANs for Clustering. IJCAI 2018: 3047-3053 - [c49]Chao Zhang, Yang Yu, Zhi-Hua Zhou:
Learning Environmental Calibration Actions for Policy Self-Evolution. IJCAI 2018: 3061-3067 - [c48]Yang Yu:
Towards Sample Efficient Reinforcement Learning. IJCAI 2018: 5739-5743 - [c47]Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu:
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application. KDD 2018: 368-377 - [c46]Shi-Yong Chen, Yang Yu, Qing Da, Jun Tan, Hai-Kuan Huang, Hai-Hong Tang:
Stabilizing Reinforcement Learning in Dynamic Environment with Application to Online Recommendation. KDD 2018: 1187-1196 - [c45]Ji Feng, Yang Yu, Zhi-Hua Zhou:
Multi-Layered Gradient Boosting Decision Trees. NeurIPS 2018: 3555-3565 - [i16]Yu-Ren Liu, Yi-Qi Hu, Hong Qian, Yang Yu, Chao Qian:
ZOOpt/ZOOjl: Toolbox for Derivative-Free Optimization. CoRR abs/1801.00329 (2018) - [i15]Wang-Zhou Dai, Qiu-Ling Xu, Yang Yu, Zhi-Hua Zhou:
Tunneling Neural Perception and Logic Reasoning through Abductive Learning. CoRR abs/1802.01173 (2018) - [i14]Yusen Zhan, Qing Da, Fei Xiao, Anxiang Zeng, Yang Yu:
Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection. CoRR abs/1803.00693 (2018) - [i13]Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu:
Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application. CoRR abs/1803.00710 (2018) - [i12]Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, Anxiang Zeng:
Virtual-Taobao: Virtualizing Real-world Online Retail Environment for Reinforcement Learning. CoRR abs/1805.10000 (2018) - [i11]Ji Feng, Yang Yu, Zhi-Hua Zhou:
Multi-Layered Gradient Boosting Decision Trees. CoRR abs/1806.00007 (2018) - [i10]Zhen-Jia Pang, Ruo-Ze Liu, Zhou-Yu Meng, Yi Zhang, Yang Yu, Tong Lu:
On Reinforcement Learning for Full-length Game of StarCraft. CoRR abs/1809.09095 (2018) - [i9]Chao Qian, Chao Bian, Yang Yu, Ke Tang, Xin Yao:
Analysis of Noisy Evolutionary Optimization When Sampling Fails. CoRR abs/1810.05045 (2018) - [i8]Quanming Yao, Mengshuo Wang, Hugo Jair Escalante, Isabelle Guyon, Yi-Qi Hu, Yufeng Li, Wei-Wei Tu, Qiang Yang, Yang Yu:
Taking Human out of Learning Applications: A Survey on Automated Machine Learning. CoRR abs/1810.13306 (2018) - 2017
- [c44]Hong Qian, Yang Yu:
Solving High-Dimensional Multi-Objective Optimization Problems with Low Effective Dimensions. AAAI 2017: 875-881 - [c43]Yi-Qi Hu, Hong Qian, Yang Yu:
Sequential Classification-Based Optimization for Direct Policy Search. AAAI 2017: 2029-2035 - [c42]Jing-Cheng Shi, Chao Qian, Yang Yu:
Evolutionary multi-objective optimization made faster by sequential decomposition. CEC 2017: 2488-2493 - [c41]Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Optimizing Ratio of Monotone Set Functions. IJCAI 2017: 2606-2612 - [c40]Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang:
On Subset Selection with General Cost Constraints. IJCAI 2017: 2613-2619 - [c39]Jing-Wen Yang, Yang Yu, Xiao-Peng Zhang:
Life-Stage Modeling by Customer-Manifold Embedding. IJCAI 2017: 3259-3265 - [c38]Yang Yu, Wei-Yang Qu, Nan Li, Zimin Guo:
Open Category Classification by Adversarial Sample Generation. IJCAI 2017: 3357-3363 - [c37]Wen-Ji Zhou
, Yang Yu, Min-Ling Zhang
:
Binary Linear Compression for Multi-label Classification. IJCAI 2017: 3546-3552 - [c36]Jianbing Zhang, Yixin Sun, Shujian Huang, Cam-Tu Nguyen, Xiaoliang Wang, Xinyu Dai, Jiajun Chen, Yang Yu:
AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles. IJCAI 2017: 4221-4227 - [c35]Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Subset Selection under Noise. NIPS 2017: 3560-3570 - [i7]Yang Yu, Wei-Yang Qu, Nan Li, Zimin Guo:
Open-Category Classification by Adversarial Sample Generation. CoRR abs/1705.08722 (2017) - [i6]Chao Qian, Yang Yu, Ke Tang, Xin Yao, Zhi-Hua Zhou:
Maximizing Non-monotone/Non-submodular Functions by Multi-objective Evolutionary Algorithms. CoRR abs/1711.07214 (2017) - 2016
- [c34]Hong Qian, Yang Yu:
Scaling Simultaneous Optimistic Optimization for High-Dimensional Non-Convex Functions with Low Effective Dimensions. AAAI 2016: 2000-2006 - [c33]Yang Yu, Hong Qian, Yi-Qi Hu:
Derivative-Free Optimization via Classification. AAAI 2016: 2286-2292 - [c32]Yang Yu, Peng-Fei Hou, Qing Da, Yu Qian:
Boosting Nonparametric Policies. AAMAS 2016: 477-484 - [c31]Yi-Qi Hu, Yang Yu:
A Multi-task Learning Approach by Combining Derivative-Free and Gradient Methods. BIC-TA (1) 2016: 456-465 - [c30]Hong Qian, Yang Yu:
On sampling-and-classification optimization in discrete domains. CEC 2016: 4374-4381 - [c29]Chao Qian, Yang Yu, Zhi-Hua Zhou:
A Lower Bound Analysis of Population-Based Evolutionary Algorithms for Pseudo-Boolean Functions. IDEAL 2016: 457-467 - [c28]Chao Qian, Jing-Cheng Shi, Yang Yu, Ke Tang, Zhi-Hua Zhou:
Parallel Pareto Optimization for Subset Selection. IJCAI 2016: 1939-1945 - [c27]Hong Qian, Yi-Qi Hu, Yang Yu:
Derivative-Free Optimization of High-Dimensional Non-Convex Functions by Sequential Random Embeddings. IJCAI 2016: 1946-1952 - [c26]Xin Li, Yongjuan Liang, Hong Qian, Yi-Qi Hu, Lei Bu
, Yang Yu, Xin Chen, Xuandong Li:
Symbolic execution of complex program driven by machine learning based constraint solving. ASE 2016: 554-559 - [c25]Han Wang, Yang Yu:
Exploring Multi-action Relationship in Reinforcement Learning. PRICAI 2016: 574-587 - [i5]Chao Qian, Yang Yu, Zhi-Hua Zhou:
A Lower Bound Analysis of Population-based Evolutionary Algorithms for Pseudo-Boolean Functions. CoRR abs/1606.03326 (2016) - 2015
- [j11]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Variable solution structure can be helpful in evolutionary optimization. Sci. China Inf. Sci. 58(11): 1-17 (2015) - [j10]Chaoli Sun, Yaochu Jin
, Jianchao Zeng, Yang Yu:
A two-layer surrogate-assisted particle swarm optimization algorithm. Soft Comput. 19(6): 1461-1475 (2015) - [j9]Yang Yu, Chao Qian, Zhi-Hua Zhou:
Switch Analysis for Running Time Analysis of Evolutionary Algorithms. IEEE Trans. Evol. Comput. 19(6): 777-792 (2015) - [c24]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Pareto Ensemble Pruning. AAAI 2015: 2935-2941 - [c23]Yang Yu, Chao Qian:
Running time analysis: Convergence-based analysis reduces to switch analysis. CEC 2015: 2603-2610 - [c22]Chao Qian, Yang Yu, Zhi-Hua Zhou:
On Constrained Boolean Pareto Optimization. IJCAI 2015: 389-395 - [c21]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Subset Selection by Pareto Optimization. NIPS 2015: 1774-1782 - 2014
- [c20]Qing Da, Yang Yu, Zhi-Hua Zhou:
Learning with Augmented Class by Exploiting Unlabeled Data. AAAI 2014: 1760-1766 - [c19]Qing Da, Yang Yu, Zhi-Hua Zhou:
Napping for functional representation of policy. AAMAS 2014: 189-196 - [c18]Yang Yu, Hong Qian:
The sampling-and-learning framework: A statistical view of evolutionary algorithms. IEEE Congress on Evolutionary Computation 2014: 149-158 - [c17]Chao Qian, Yang Yu, Yaochu Jin, Zhi-Hua Zhou:
On the Effectiveness of Sampling for Evolutionary Optimization in Noisy Environments. PPSN 2014: 302-311 - [i4]Yang Yu, Hong Qian:
The Sampling-and-Learning Framework: A Statistical View of Evolutionary Algorithms. CoRR abs/1401.6333 (2014) - 2013
- [j8]Chao Qian, Yang Yu, Zhi-Hua Zhou:
An analysis on recombination in multi-objective evolutionary optimization. Artif. Intell. 204: 99-119 (2013) - [c16]Yang Yu, Xin Yao, Zhi-Hua Zhou:
On the Approximation Ability of Evolutionary Optimization with Application to Minimum Set Cover: Extended Abstract. IJCAI 2013: 3190-3194 - [c15]Qing Da, Yang Yu, Zhi-Hua Zhou:
Self-Practice Imitation Learning from Weak Policy. PSL 2013: 9-20 - [i3]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Analyzing Evolutionary Optimization in Noisy Environments. CoRR abs/1311.4987 (2013) - 2012
- [j7]Yang Yu, Xin Yao
, Zhi-Hua Zhou:
On the approximation ability of evolutionary optimization with application to minimum set cover. Artif. Intell. 180-181: 20-33 (2012) - [c14]Sheng-Jun Huang, Yang Yu, Zhi-Hua Zhou:
Multi-label hypothesis reuse. KDD 2012: 525-533 - [c13]Nan Li, Yang Yu, Zhi-Hua Zhou:
Diversity Regularized Ensemble Pruning. ECML/PKDD (1) 2012: 330-345 - [c12]Chao Qian, Yang Yu, Zhi-Hua Zhou:
On Algorithm-Dependent Boundary Case Identification for Problem Classes. PPSN (1) 2012: 62-71 - 2011
- [c11]Chao Qian, Yang Yu, Zhi-Hua Zhou:
Collisions are helpful for computing unique input-output sequences. GECCO (Companion) 2011: 265-266 - [c10]Chao Qian, Yang Yu, Zhi-Hua Zhou:
An analysis on recombination in multi-objective evolutionary optimization. GECCO 2011: 2051-2058 - [c9]Wang-Zhou Dai, Yang Yu, Zhi-Hua Zhou:
Lifted-Rollout for Approximate Policy Iteration of Markov Decision Process. ICDM Workshops 2011: 689-696 - [c8]Yang Yu, Yufeng Li, Zhi-Hua Zhou:
Diversity Regularized Machine. IJCAI 2011: 1603-1608 - [i2]Yang Yu, Chao Qian, Zhi-Hua Zhou:
Towards Analyzing Crossover Operators in Evolutionary Search via General Markov Chain Switching Theorem. CoRR abs/1111.0907 (2011) - 2010
- [j6]Yang Yu, Zhi-Hua Zhou:
A framework for modeling positive class expansion with single snapshot. Knowl. Inf. Syst. 25(2): 211-227 (2010) - [c7]Yang Yu, Chao Qian, Zhi-Hua Zhou:
Towards Analyzing Recombination Operators in Evolutionary Search. PPSN (1) 2010: 144-153 - [i1]Yang Yu, Xin Yao, Zhi-Hua Zhou:
Evolutionary Algorithms as Guaranteed Approximation Optimizers. CoRR abs/1011.4028 (2010)
2000 – 2009
- 2009
- [c6]Nan Li, Yang Yu, Zhi-Hua Zhou:
Semi-naive Exploitation of One-Dependence Estimators. ICDM 2009: 278-287 - 2008
- [j5]Yang Yu, Zhi-Hua Zhou:
A new approach to estimating the expected first hitting time of evolutionary algorithms. Artif. Intell. 172(15): 1809-1832 (2008) - [j4]Fei Tony Liu, Kai Ming Ting, Yang Yu, Zhi-Hua Zhou:
Spectrum of Variable-Random Trees. J. Artif. Intell. Res. 32: 355-384 (2008) - [c5]Yang Yu, Zhi-Hua Zhou:
On the usefulness of infeasible solutions in evolutionary search: A theoretical study. IEEE Congress on Evolutionary Computation 2008: 835-840 - [c4]Li-Ping Liu, Yang Yu, Yuan Jiang, Zhi-Hua Zhou:
TEFE: A Time-Efficient Approach to Feature Extraction. ICDM 2008: 423-432 - [c3]Yang Yu, Zhi-Hua Zhou:
A Framework for Modeling Positive Class Expansion with Single Snapshot. PAKDD 2008: 429-440 - 2007
- [j3]Yang Yu, De-Chuan Zhan, Xu-Ying Liu, Ming Li, Zhi-Hua Zhou:
Predicting Future Customers via Ensembling Gradually Expanded Trees. Int. J. Data Warehous. Min. 3(2): 12-21 (2007) - [c2]Yang Yu, Zhi-Hua Zhou, Kai Ming Ting:
Cocktail Ensemble for Regression. ICDM 2007: 721-726 - 2006
- [c1]Yang Yu, Zhi-Hua Zhou:
A New Approach to Estimating the Expected First Hitting Time of Evolutionary Algorithms. AAAI 2006: 555-560 - 2005
- [j2]Zhi-Hua Zhou, Yang Yu:
Adapt Bagging to Nearest Neighbor Classifiers. J. Comput. Sci. Technol. 20(1): 48-54 (2005) - [j1]Zhi-Hua Zhou, Yang Yu:
Ensembling local learners ThroughMultimodal perturbation. IEEE Trans. Syst. Man Cybern. Part B 35(4): 725-735 (2005)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-24 00:27 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint