Use of Proximal Policy Optimization for the Joint Replenishment Problem

Nathalie Vanvuchelen*, Joren Gijsbrechts, Robert Boute

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

74 Scopus citations

Abstract

Deep reinforcement learning has been coined as a promising research avenue to solve sequential decision-making problems, especially if few is known about the optimal policy structure. We apply the proximal policy optimization algorithm to the intractable joint replenishment problem. We demonstrate how the algorithm approaches the optimal policy structure and outperforms two other heuristics. Its deployment in supply chain control towers can orchestrate and facilitate collaborative shipping in the Physical Internet.

Original languageEnglish (US)
Article number103239
JournalComputers in Industry
Volume119
DOIs
StatePublished - Aug 2020

Keywords

  • Collaborative Shipping
  • Deep Reinforcement Learning
  • Joint Replenishment Problem
  • Machine Learning
  • Physical Internet
  • Proximal Policy Optimization

ASJC Scopus subject areas

  • General Computer Science
  • General Engineering

Fingerprint

Dive into the research topics of 'Use of Proximal Policy Optimization for the Joint Replenishment Problem'. Together they form a unique fingerprint.

Cite this