Google Scholar

User profiles for Pavel Osinenko

Pavel Osinenko

Professor (Assistant), Skolkovo Institute of Science and Technology

Verified email at skoltech.ru

Cited by 413

Reinforcement learning with guarantees: a review

P Osinenko, D Dobriborsci, W Aumer - IFAC-PapersOnLine, 2022 - Elsevier

Reinforcement learning is concerned with a generic concept of an agent acting in an
environment. From the control theory standpoint, reinforcement learning may be considered as an …

Save Cite Cited by 23 Related articles

A method of optimal traction control for farm tractors with feedback of drive torque

PV Osinenko, M Geissler, T Herlitzius - Biosystems engineering, 2015 - Elsevier

Traction efficiency of farm tractors barely reaches 50% in field operations (Renius et al., 1985).
On the other hand, modern trends in agriculture show growth of the global tractor markets …

Save Cite Cited by 75 Related articles All 7 versions

[PDF] ieee.org

An actor-critic framework for online control with environment stability guarantee

P Osinenko, G Yaremenko, G Malaniya… - IEEE Access, 2023 - ieeexplore.ieee.org

Online actor-critic reinforcement learning is concerned with training an agent on-the-fly via
dynamic interaction with the environment. Due to the specifics of the application, it is not …

Save Cite Cited by 1 Related articles All 2 versions

[PDF] arxiv.org

On Stochastic Stabilization via Nonsmooth Control Lyapunov Functions

P Osinenko, G Yaremenko… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Control Lyapunov function is a central tool in stabilization. It generalizes an abstract energy
function—a Lyapunov function—to the case of controlled systems. It is a known fact that most …

Save Cite Cited by 7 Related articles All 4 versions

[PDF] arxiv.org

An approach to improve agent learning via guaranteeing goal reaching in all episodes

P Osinenko, G Yaremenko, G Malaniya… - arXiv preprint arXiv …, 2024 - arxiv.org

Reinforcement learning is commonly concerned with problems of maximizing accumulated
rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the …

Application of non-destructive sensors and big data analysis to predict physiological storage disorders and fruit firmness in 'Braeburn'apples

P Osinenko, K Biegert, RJ McCormick, T Göhrt… - … and Electronics in …, 2021 - Elsevier

Physiological storage disorders affect a range of commercially important pomefruit and result
in fruit losses and wastage of resources. Disorders can develop during and/or after storage …

Save Cite Cited by 24 Related articles All 6 versions

Optimal traction control for heavy-duty vehicles

P Osinenko, S Streif - Control Engineering Practice, 2017 - Elsevier

Heavy-duty vehicles such as tractors, bulldozers, certain construction and municipal
vehicles, soil millers, forestry machinery etc. have a high demand for propulsion force and …

Save Cite Cited by 29 Related articles All 2 versions

[PDF] arxiv.org

Comprehensive overview of reward engineering and shaping in advancing reinforcement learning applications

S Ibrahim, M Mostafa, A Jnadi, P Osinenko - arXiv preprint arXiv …, 2024 - arxiv.org

The aim of Reinforcement Learning (RL) in real-world applications is to create systems
capable of making autonomous decisions by learning from their environment through trial and …

Save Cite Cited by 1 Related articles All 3 versions View as HTML

[PDF] arxiv.org

A reinforcement learning method with closed-loop stability guarantee

P Osinenko, L Beckenbach, T Göhrt, S Streif - IFAC-PapersOnLine, 2020 - Elsevier

Reinforcement learning (RL) in the context of control systems offers wide possibilities of
controller adaptation. Given an infinite-horizon cost function, the so-called critic of RL …

Save Cite Cited by 14 Related articles All 5 versions

[PDF] ieee.org

Optimal flow factor determination in vanadium redox flow battery control

…, S Parsegov, M Pugach, A Polyakov, P Osinenko… - IEEE …, 2024 - ieeexplore.ieee.org

The optimization of vanadium redox flow batteries (VRFBs) is closely related to the flow rate
control: a proper regulation of the electrolyte flow rate reduces losses and prolongs battery …

Save Cite Cited by 4 Related articles All 2 versions

Create alert

Cite

Advanced search

Saved to My library

User profiles for Pavel Osinenko

Pavel Osinenko

Reinforcement learning with guarantees: a review

A method of optimal traction control for farm tractors with feedback of drive torque

An actor-critic framework for online control with environment stability guarantee

On Stochastic Stabilization via Nonsmooth Control Lyapunov Functions

An approach to improve agent learning via guaranteeing goal reaching in all episodes

Application of non-destructive sensors and big data analysis to predict physiological storage disorders and fruit firmness in 'Braeburn'apples

Optimal traction control for heavy-duty vehicles

Comprehensive overview of reward engineering and shaping in advancing reinforcement learning applications

A reinforcement learning method with closed-loop stability guarantee

Optimal flow factor determination in vanadium redox flow battery control