default search action

combined dblp search
author search
venue search
publication search

ask others

Yaqi Duan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/gpb/HuoDZXZCSWCGXLDMZZLKCQW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/gpb/HuoDZXZCSWCGXLDMZZLKCQW24
Zitian Huo, Yaqi Duan, Dongdong Zhan, Xizhen Xu, Nairen Zheng, Jing Cai, Ruifang Sun, Jianping Wang, Fang Cheng, Zhan Gao, Caixia Xu, Wanlin Liu, Yuting Dong, Sailong Ma, Qian Zhang, Yiyun Zheng, Liping Lou, Dong Kuang, Qian Chu, Jun Qin, Guoping Wang, Yi Wang:
Proteomic Stratification of Prognosis and Treatment Options for Small Cell Lung Cancer. Genom. Proteom. Bioinform. 22(1) (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-05233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-05233
Yaqi Duan, Martin J. Wainwright:
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces. CoRR abs/2401.05233 (2024)
2023
[j4]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/NiDDWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/NiDDWZ23
Chengzhuo Ni, Yaqi Duan, Munther A. Dahleh, Mengdi Wang, Anru R. Zhang:
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition. J. Mach. Learn. Res. 24: 115:1-115:53 (2023)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tvcg/MaoDHDLH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tvcg/MaoDHDLH23
Aihua Mao, Zihui Du, Junhui Hou, Yaqi Duan, Yong-Jin Liu, Ying He:
PU-Flow: A Point Cloud Upsampling Network With Normalizing Flows. IEEE Trans. Vis. Comput. Graph. 29(12): 4964-4977 (2023)
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MaoDWDCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MaoDWDCL23
Aihua Mao, Yaqi Duan, Yu-Hui Wen, Zihui Du, Hongmin Cai, Yong-Jin Liu:
Invertible Residual Neural Networks with Conditional Injector and Interpolator for Point Cloud Upsampling. IJCAI 2023: 1267-1275
[c9]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/l4dc/DuanW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/l4dc/DuanW23
Yaqi Duan, Martin J. Wainwright:
A finite-sample analysis of multi-step temporal difference estimates. L4DC 2023: 612-624
2022
[b1]
- view
  - electronic edition @ princeton.edu
  - details & citations
- export record
  dblp key:
  - phd/us/Duan22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Duan22
Yaqi Duan:
Policy Evaluation in Batch Reinforcement Learning. Princeton University, USA, 2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/DuanCZHFXX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/DuanCZHFXX22
Yaqi Duan, Jinglong Chen, Tianci Zhang, Shuilong He, Yong Feng, Jingsong Xie, Wenrong Xiao:
High-temperature augmented neighborhood metric learning for cross-domain fault diagnosis with imbalanced data. Knowl. Based Syst. 257: 109930 (2022)
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YinDW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YinDW022
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang:
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism. ICLR 2022
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05250
Yaqi Duan, Kaizheng Wang:
Adaptive and Robust Multi-task Learning. CoRR abs/2202.05250 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-05804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-05804
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang:
Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism. CoRR abs/2203.05804 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-03899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-03899
Yaqi Duan, Martin J. Wainwright:
Policy evaluation from a single path: Multi-step methods, mixing and mis-specification. CoRR abs/2211.03899 (2022)
2021
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuanJL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuanJL21
Yaqi Duan, Chi Jin, Zhiyuan Li:
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning. ICML 2021: 2892-2902
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HaoDLSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HaoDLSW21
Botao Hao, Yaqi Duan, Tor Lattimore, Csaba Szepesvári, Mengdi Wang:
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient. ICML 2021: 4063-4073
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/HaoJDLSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/HaoJDLSW21
Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvári, Mengdi Wang:
Bootstrapping Fitted Q-Evaluation for Off-Policy Inference. ICML 2021: 4074-4084
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/isit/NiZDW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isit/NiZDW21
Chengzhuo Ni, Anru R. Zhang, Yaqi Duan, Mengdi Wang:
Learning Good State and Action Representations via Tensor Decomposition. ISIT 2021: 1682-1687
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-03607
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-03607
Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvári, Mengdi Wang:
Bootstrapping Statistical Inference for Off-Policy Evaluation. CoRR abs/2102.03607 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-13883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-13883
Yaqi Duan, Chi Jin, Zhiyuan Li:
Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning. CoRR abs/2103.13883 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-01136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-01136
Chengzhuo Ni, Anru Zhang, Yaqi Duan, Mengdi Wang:
Learning Good State and Action Representations via Tensor Decomposition. CoRR abs/2105.01136 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05893
Aihua Mao, Zihui Du, Junhui Hou, Yaqi Duan, Yong-Jin Liu, Ying He:
PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows. CoRR abs/2107.05893 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-12002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-12002
Yaqi Duan, Mengdi Wang, Martin J. Wainwright:
Optimal policy evaluation using kernel-based temporal difference methods. CoRR abs/2109.12002 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/siammax/DuanWWY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/siammax/DuanWWY20
Yaqi Duan, Mengdi Wang, Zaiwen Wen, Yaxiang Yuan:
Adaptive Low-Nonnegative-Rank Approximation for State Aggregation of Markov Chains. SIAM J. Matrix Anal. Appl. 41(1): 244-278 (2020)
[c3]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuanJW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuanJW20
Yaqi Duan, Zeyu Jia, Mengdi Wang:
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation. ICML 2020: 2701-2709
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09516
Yaqi Duan, Mengdi Wang:
Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation. CoRR abs/2002.09516 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-04019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-04019
Botao Hao, Yaqi Duan, Tor Lattimore, Csaba Szepesvári, Mengdi Wang:
Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient. CoRR abs/2011.04019 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
- export record
  dblp key:
  - conf/nips/DuanKW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuanKW19
Yaqi Duan, Zheng Tracy Ke, Mengdi Wang:
State Aggregation Learning from Markov Transition Data. NeurIPS 2019: 4488-4497
[c1]
- view
- export record
  dblp key:
  - conf/nips/SunDGW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SunDGW19
Yifan Sun, Yaqi Duan, Hao Gong, Mengdi Wang:
Learning low-dimensional state embeddings and metastable clusters from time series data. NeurIPS 2019: 4563-4572
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-00302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-00302
Yifan Sun, Yaqi Duan, Hao Gong, Mengdi Wang:
Learning low-dimensional state embeddings and metastable clusters from time series data. CoRR abs/1906.00302 (2019)
2018
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02619
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02619
Yaqi Duan, Zheng Tracy Ke, Mengdi Wang:
State Aggregation Learning from Markov Transition Data. CoRR abs/1811.02619 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.