Papers
arxiv:2605.02461

Middle-mile logistics through the lens of goal-conditioned reinforcement learning

Published on May 4
Authors:
,
,
,

Abstract

Multi-object goal-conditioned MDP reformulation of middle-mile logistics integrates graph neural networks with model-free reinforcement learning using feature graphs extracted from environmental states.

AI-generated summary

Middle-mile logistics describes the problem of routing parcels through a network of hubs linked by trucks with finite capacity. We rephrase this as a multi-object goal-conditioned MDP. Our method combines graph neural networks with model-free RL, extracting small feature graphs from the environment state.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.02461 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.02461 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.02461 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.