Please use this identifier to cite or link to this item: https://dspace.ncfu.ru/handle/20.500.12258/18122
Title: Method for reconfiguring the kinematic structure of a mechatronic-modular robot in non-deterministic conditions
Authors: Petrenko, V. I.
Петренко, В. И.
Tebueva, F. B.
Тебуева, Ф. Б.
Pavlov, A. S.
Павлов, А. С.
Gurchinsky, M. M.
Гурчинский, М. М.
Keywords: Modular robotic;Q-learning;Reinforcement learning;Path planning;Multi-agent systems
Issue Date: 2020
Publisher: ATLANTIS PRESS
Citation: Petrenko, V. I.; Tebueva, F. B.; Pavlov A. S.; Gurchinsky, M. M. Method for reconfiguring the kinematic structure of a mechatronic-modular robot in non-deterministic conditions // PROCEEDINGS OF THE 8TH SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGIES FOR INTELLIGENT DECISION MAKING SUPPORT (ITIDS 2020). - 2020. - Book Series: Advances in Intelligent Systems Research. - Volume 174. - Page 347-352
Series/Report no.: PROCEEDINGS OF THE 8TH SCIENTIFIC CONFERENCE ON INFORMATION TECHNOLOGIES FOR INTELLIGENT DECISION MAKING SUPPORT (ITIDS 2020)
Abstract: Modular robots, consisting of many identical modules, are one of the most difficult areas of robotics. Each newly added element changes the shape and capabilities of the end device, for example, adds functionality or allows the robot to move in new planes. The reconfiguration of the kinematic structure is a sequence of movements of each robot module from the initial position of the initial configuration to the final position of the desired configuration. The paper considers a method for reconfiguration the kinematic structure of a mechatronic-modular robot using reinforcement learning. The proposed method will be built on the basis of a learning algorithm, where the information for training will be the actions taken and the "reward" is a value characterizing the quality of the robot's completion of the target task. The purpose of the training is to build a control algorithm that maximizes the total reward for a certain period of time. The effectiveness of the learning algorithm was tested by computer simulation of a robot, consisting of 5, 10 and 15 modules, in the formation of the target configuration
URI: http://hdl.handle.net/20.500.12258/18122
Appears in Collections:Статьи, проиндексированные в SCOPUS, WOS

Files in This Item:
File SizeFormat 
WoS 1186 .pdf
  Restricted Access
191.04 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.