Google Scholar

User profiles for Alan Fern

Alan Fern

Oregon State University

Verified email at oregonstate.edu

Cited by 10436

[PDF] aaai.org

[PDF][PDF] FF-Replan: A Baseline for Probabilistic Planning.

SW Yoon, A Fern, R Givan - ICAPS, 2007 - cdn.aaai.org

FF-Replan was the winner of the 2004 International Probabilistic Planning Competition (IPPC-04)(Younes
& Littman 2004a) and was also the top performer on IPPC-06 domains, …

Save Cite Cited by 462 Related articles All 6 versions View as HTML

[PDF] mlr.press

Visualizing and understanding atari agents

…, A Koul, J Dodge, A Fern - … conference on machine …, 2018 - proceedings.mlr.press

While deep reinforcement learning (deep RL) agents are effective at maximizing rewards, it
is often unclear what strategies they use to do so. In this paper, we take a step toward …

Save Cite Cited by 445 Related articles All 12 versions View as HTML

[PDF] academia.edu

Multi-task reinforcement learning: a hierarchical bayesian approach

A Wilson, A Fern, S Ray, P Tadepalli - Proceedings of the 24th …, 2007 - dl.acm.org

We consider the problem of multi-task reinforcement learning, where the agent needs to
solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but …

Save Cite Cited by 412 Related articles All 16 versions

[PDF] arxiv.org

Blind bipedal stair traversal via sim-to-real reinforcement learning

J Siekmann, K Green, J Warila, A Fern… - arXiv preprint arXiv …, 2021 - arxiv.org

Accurate and precise terrain estimation is a difficult problem for robot locomotion in real-world
environments. Thus, it is useful to have systems that do not depend on accurate estimation …

Save Cite Cited by 222 Related articles All 8 versions View as HTML

Related searches

[PDF] psu.edu

Discriminatively trained particle filters for complex multi-object tracking

R Hess, A Fern - 2009 IEEE conference on computer vision …, 2009 - ieeexplore.ieee.org

This work presents a discriminative training method for particle filters in the context of multi-object
tracking. We are motivated by the difficulty of hand-tuning the many model parameters …

Save Cite Cited by 195 Related articles All 9 versions

[PDF] jair.org

A decision-theoretic model of assistance

A Fern, S Natarajan, K Judah, P Tadepalli - Journal of Artificial Intelligence …, 2014 - jair.org

There is a growing interest in intelligent assistants for a variety of applications from sorting
email to helping people with disabilities to do their daily chores. In this paper, we formulate the …

Save Cite Cited by 136 Related articles All 20 versions View as HTML

[PDF] neurips.cc

Approximate policy iteration with a policy language bias

A Fern, S Yoon, R Givan - Advances in neural information …, 2003 - proceedings.neurips.cc

We explore approximate policy iteration, replacing the usual costfunction learning step with
a learning step in policy space. We give policy-language biases that enable solution of very …

Save Cite Cited by 276 Related articles All 20 versions View as HTML

[PDF] nsf.gov

[PDF][PDF] Explainable reinforcement learning via reward decomposition

Z Juozapaitis, A Koul, A Fern, M Erwig… - IJCAI/ECAI Workshop on …, 2019 - par.nsf.gov

We study reward decomposition for explaining the decisions of reinforcement learning (RL)
agents. The approach decomposes rewards into sums of semantically meaningful reward …

Save Cite Cited by 259 Related articles All 7 versions View as HTML

[PDF] neurips.cc

A bayesian approach for policy learning from trajectory preference queries

A Wilson, A Fern, P Tadepalli - Advances in neural …, 2012 - proceedings.neurips.cc

We consider the problem of learning control policies via trajectory preference queries to an
expert. In particular, the learning agent can present an expert with short runs of a pair of …

Save Cite Cited by 229 Related articles All 8 versions View as HTML

[PDF] roboticsproceedings.org

[PDF][PDF] Fast Online Trajectory Optimization for the Bipedal Robot Cassie.

T Apgar, P Clary, K Green, A Fern… - … Science and Systems, 2018 - roboticsproceedings.org

We apply fast online trajectory optimization for multi-step motion planning to Cassie, a
bipedal robot designed to exploit natural spring-mass locomotion dynamics using lightweight, …

Save Cite Cited by 186 Related articles All 6 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library