Explore chapters and articles related to this topic
A multi-objective reinforcement learning approach for resequencing scheduling problems in automotive manufacturing systems
Published in International Journal of Production Research, 2023
Jinling Leng, Xingyuan Wang, Shiping Wu, Chun Jin, Meng Tang, Rui Liu, Alexander Vogl, Huiyu Liu
Second, MRSP is described as a Multi-objective Markov Decision Process (MOMDP) and a MORL-based MODQN algorithm is proposed to obtain the Pareto frontier set from a multi-policy network. A MODQN agent implements a shaped reward and preference generation approach that obeys a two-dimensional folded normal distribution (2dFND) to minimize the CC and ST of the MRSP.