User profiles for Pavel Osinenko
Pavel OsinenkoProfessor (Assistant), Skolkovo Institute of Science and Technology Verified email at skoltech.ru Cited by 413 |
Reinforcement learning with guarantees: a review
Reinforcement learning is concerned with a generic concept of an agent acting in an
environment. From the control theory standpoint, reinforcement learning may be considered as an …
environment. From the control theory standpoint, reinforcement learning may be considered as an …
A method of optimal traction control for farm tractors with feedback of drive torque
Traction efficiency of farm tractors barely reaches 50% in field operations (Renius et al., 1985).
On the other hand, modern trends in agriculture show growth of the global tractor markets …
On the other hand, modern trends in agriculture show growth of the global tractor markets …
An actor-critic framework for online control with environment stability guarantee
P Osinenko, G Yaremenko, G Malaniya… - IEEE Access, 2023 - ieeexplore.ieee.org
Online actor-critic reinforcement learning is concerned with training an agent on-the-fly via
dynamic interaction with the environment. Due to the specifics of the application, it is not …
dynamic interaction with the environment. Due to the specifics of the application, it is not …
On Stochastic Stabilization via Nonsmooth Control Lyapunov Functions
P Osinenko, G Yaremenko… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Control Lyapunov function is a central tool in stabilization. It generalizes an abstract energy
function—a Lyapunov function—to the case of controlled systems. It is a known fact that most …
function—a Lyapunov function—to the case of controlled systems. It is a known fact that most …
An approach to improve agent learning via guaranteeing goal reaching in all episodes
P Osinenko, G Yaremenko, G Malaniya… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement learning is commonly concerned with problems of maximizing accumulated
rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the …
rewards in Markov decision processes. Oftentimes, a certain goal state or a subset of the …
Application of non-destructive sensors and big data analysis to predict physiological storage disorders and fruit firmness in 'Braeburn'apples
P Osinenko, K Biegert, RJ McCormick, T Göhrt… - … and Electronics in …, 2021 - Elsevier
Physiological storage disorders affect a range of commercially important pomefruit and result
in fruit losses and wastage of resources. Disorders can develop during and/or after storage …
in fruit losses and wastage of resources. Disorders can develop during and/or after storage …
Optimal traction control for heavy-duty vehicles
P Osinenko, S Streif - Control Engineering Practice, 2017 - Elsevier
Heavy-duty vehicles such as tractors, bulldozers, certain construction and municipal
vehicles, soil millers, forestry machinery etc. have a high demand for propulsion force and …
vehicles, soil millers, forestry machinery etc. have a high demand for propulsion force and …
Comprehensive overview of reward engineering and shaping in advancing reinforcement learning applications
The aim of Reinforcement Learning (RL) in real-world applications is to create systems
capable of making autonomous decisions by learning from their environment through trial and …
capable of making autonomous decisions by learning from their environment through trial and …
A reinforcement learning method with closed-loop stability guarantee
Reinforcement learning (RL) in the context of control systems offers wide possibilities of
controller adaptation. Given an infinite-horizon cost function, the so-called critic of RL …
controller adaptation. Given an infinite-horizon cost function, the so-called critic of RL …
Optimal flow factor determination in vanadium redox flow battery control
The optimization of vanadium redox flow batteries (VRFBs) is closely related to the flow rate
control: a proper regulation of the electrolyte flow rate reduces losses and prolongs battery …
control: a proper regulation of the electrolyte flow rate reduces losses and prolongs battery …