User profiles for Pavel Osinenko

Pavel Osinenko

Professor (Assistant), Skolkovo Institute of Science and Technology
Verified email at skoltech.ru
Cited by 413

Reinforcement learning with guarantees: a review

P Osinenko, D Dobriborsci, W Aumer - IFAC-PapersOnLine, 2022 - Elsevier
Reinforcement learning is concerned with a generic concept of an agent acting in an
environment. From the control theory standpoint, reinforcement learning may be considered as an …

A method of optimal traction control for farm tractors with feedback of drive torque

PV Osinenko, M Geissler, T Herlitzius - Biosystems engineering, 2015 - Elsevier
Traction efficiency of farm tractors barely reaches 50% in field operations (Renius et al., 1985).
On the other hand, modern trends in agriculture show growth of the global tractor markets …

An actor-critic framework for online control with environment stability guarantee

P Osinenko, G Yaremenko, G Malaniya… - IEEE Access, 2023 - ieeexplore.ieee.org
Online actor-critic reinforcement learning is concerned with training an agent on-the-fly via
dynamic interaction with the environment. Due to the specifics of the application, it is not …

On Stochastic Stabilization via Nonsmooth Control Lyapunov Functions

P Osinenko, G Yaremenko… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Control Lyapunov function is a central tool in stabilization. It generalizes an abstract energy
function—a Lyapunov function—to the case of controlled systems. It is a known fact that most …

An approach to improve agent learning via guaranteeing goal reaching in all episodes

P Osinenko, G Yaremenko, G Malaniya… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning is commonly concerned with problems of maximizing accumulated
rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the …

Application of non-destructive sensors and big data analysis to predict physiological storage disorders and fruit firmness in 'Braeburn'apples

P Osinenko, K Biegert, RJ McCormick, T Göhrt… - … and Electronics in …, 2021 - Elsevier
Physiological storage disorders affect a range of commercially important pomefruit and result
in fruit losses and wastage of resources. Disorders can develop during and/or after storage …

Optimal traction control for heavy-duty vehicles

P Osinenko, S Streif - Control Engineering Practice, 2017 - Elsevier
Heavy-duty vehicles such as tractors, bulldozers, certain construction and municipal
vehicles, soil millers, forestry machinery etc. have a high demand for propulsion force and …

Comprehensive overview of reward engineering and shaping in advancing reinforcement learning applications

S Ibrahim, M Mostafa, A Jnadi, P Osinenko - arXiv preprint arXiv …, 2024 - arxiv.org
The aim of Reinforcement Learning (RL) in real-world applications is to create systems
capable of making autonomous decisions by learning from their environment through trial and …

A reinforcement learning method with closed-loop stability guarantee

P Osinenko, L Beckenbach, T Göhrt, S Streif - IFAC-PapersOnLine, 2020 - Elsevier
Reinforcement learning (RL) in the context of control systems offers wide possibilities of
controller adaptation. Given an infinite-horizon cost function, the so-called critic of RL …

Optimal flow factor determination in vanadium redox flow battery control

…, S Parsegov, M Pugach, A Polyakov, P Osinenko… - IEEE …, 2024 - ieeexplore.ieee.org
The optimization of vanadium redox flow batteries (VRFBs) is closely related to the flow rate
control: a proper regulation of the electrolyte flow rate reduces losses and prolongs battery …