Papers
arxiv:2605.02461
Middle-mile logistics through the lens of goal-conditioned reinforcement learning
Published on May 4
Authors:
Abstract
Multi-object goal-conditioned MDP reformulation of middle-mile logistics integrates graph neural networks with model-free reinforcement learning using feature graphs extracted from environmental states.
AI-generated summary
Middle-mile logistics describes the problem of routing parcels through a network of hubs linked by trucks with finite capacity. We rephrase this as a multi-object goal-conditioned MDP. Our method combines graph neural networks with model-free RL, extracting small feature graphs from the environment state.
Models citing this paper 0
No model linking this paper
Cite arxiv.org/abs/2605.02461 in a model README.md to link it from this page.
Datasets citing this paper 0
No dataset linking this paper
Cite arxiv.org/abs/2605.02461 in a dataset README.md to link it from this page.
Spaces citing this paper 0
No Space linking this paper
Cite arxiv.org/abs/2605.02461 in a Space README.md to link it from this page.
Collections including this paper 0
No Collection including this paper
Add this paper to a collection to link it from this page.