default search action

combined dblp search
author search
venue search
publication search

ask others

Ajin Joseph 0001

Ajin George Joseph

> Home > Persons

Person information

affiliation: University of Alberta, Edmonton, AB, Canada
affiliation (former): Indian Institute of Science, Bangalore, India

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/SundarVJ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/SundarVJ23
Poorna Syama Sundar, Manjunath Vasam, Ajin George Joseph:
Monotonic Model Improvement Self-Play Algorithm for Adversarial Games. CDC 2023: 5600-5605
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/NeumannLJP0W23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NeumannLJP0W23
Samuel Neumann, Sungsu Lim, Ajin George Joseph, Yangchen Pan, Adam White, Martha White:
Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement. ICLR 2023

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/allerton/JosephB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/allerton/JosephB19
Ajin George Joseph, Shalabh Bhatnagar:
Stochastic Approximation Trackers for Model-Based Search. Allerton 2019: 741-748
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cdc/JosephB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cdc/JosephB19
Ajin George Joseph, Shalabh Bhatnagar:
An Adaptive and Incremental Approach to Quantile Estimation. CDC 2019: 6025-6031
[c7]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ChungNJW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChungNJW19
Wesley Chung, Somjit Nath, Ajin Joseph, Martha White:
Two-Timescale Networks for Nonlinear Value Function Approximation. ICLR (Poster) 2019
2018
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/JosephB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/JosephB18
Ajin George Joseph, Shalabh Bhatnagar:
An incremental off-policy search in a model-free Markov decision process using a single sample path. Mach. Learn. 107(6): 969-1011 (2018)
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ml/JosephB18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ml/JosephB18a
Ajin George Joseph, Shalabh Bhatnagar:
An online prediction algorithm for reinforcement learning with linear function approximation using cross entropy method. Mach. Learn. 107(8-10): 1385-1429 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-10287
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-10287
Ajin George Joseph, Shalabh Bhatnagar:
An Incremental Off-policy Search in a Model-free Markov Decision Process Using a Single Sample Path. CoRR abs/1801.10287 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-10291
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-10291
Ajin George Joseph, Shalabh Bhatnagar:
A Cross Entropy based Optimization Algorithm with Global Convergence Guarantees. CoRR abs/1801.10291 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-06720
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-06720
Ajin George Joseph, Shalabh Bhatnagar:
An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method. CoRR abs/1806.06720 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-09103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-09103
Sungsu Lim, Ajin Joseph, Lei Le, Yangchen Pan, Martha White:
Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces. CoRR abs/1810.09103 (2018)
2017
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/JosephB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/JosephB17
Ajin George Joseph, Shalabh Bhatnagar:
A model based search method for prediction in model-free Markov decision process. IJCNN 2017: 170-177
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/JosephB17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/JosephB17a
Ajin George Joseph, Shalabh Bhatnagar:
Bounds for off-policy prediction in reinforcement learning. IJCNN 2017: 3991-3997
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/premi/JosephB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/premi/JosephB17
Ajin George Joseph, Shalabh Bhatnagar:
An Incremental Fast Policy Search Using a Single Sample Path. PReMI 2017: 3-10
2016
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/JosephB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/JosephB16
Ajin George Joseph, Shalabh Bhatnagar:
Revisiting the Cross Entropy Method with Applications in Stochastic Global Optimization and Reinforcement Learning. ECAI 2016: 1026-1034
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/wsc/JosephB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wsc/JosephB16
Ajin George Joseph, Shalabh Bhatnagar:
A randomized algorithm for continuous optimization. WSC 2016: 907-918
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/JosephB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/JosephB16
Ajin George Joseph, Shalabh Bhatnagar:
A Cross Entropy based Stochastic Approximation Algorithm for Reinforcement Learning with Linear Function Approximation. CoRR abs/1609.09449 (2016)
2015
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iconip/JosephB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iconip/JosephB15
Ajin George Joseph, Shalabh Bhatnagar:
A Stochastic Approximation Algorithm for Quantile Estimation. ICONIP (2) 2015: 311-319

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.