• Medientyp: E-Artikel
  • Titel: Scheduling in Multiagent Systems Using Reinforcement Learning
  • Beteiligte: Minashina, I. K.; Gorbachev, R. A.; Zakharova, E. M.
  • Erschienen: Pleiades Publishing Ltd, 2022
  • Erschienen in: Doklady Mathematics, 106 (2022) S1, Seite S70-S78
  • Sprache: Englisch
  • DOI: 10.1134/s1064562422060175
  • ISSN: 1064-5624; 1531-8362
  • Schlagwörter: General Mathematics
  • Entstehung:
  • Anmerkungen:
  • Beschreibung: Abstract The paper is devoted to scheduling in multiagent systems in the framework of the Flatland 3 competition. The main aim of this competition is to develop an algorithm for the effective control of dense traffic in complex railroad networks according to a given schedule. The proposed solution is based on reinforcement learning. To adapt this method to the particular scheduling problem, a novel approach based on structuring the reward function that stimulates an agent to adhere to its schedule was developed. The architecture of the proposed model is based on a multiagent version of centralized critic with proximal policy optimization (PPO) learning. In addition, a curriculum learning strategy was developed and implemented. This allowed the agent to cope with each level of complexity on time and train the model in more difficult conditions. The proposed solution won first place in the Flatland 3 competition in the reinforcement learning track.