File size: 3,274 Bytes
744b468
 
26c10c5
 
 
 
 
 
 
 
 
744b468
 
26c10c5
744b468
26c10c5
 
cb38347
 
 
 
 
 
 
26c10c5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
744b468
 
 
 
 
 
 
26c10c5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
---
license: mit
library_name: pytorch
tags:
  - reinforcement-learning
  - multi-agent-reinforcement-learning
  - offline-rl
  - flow-matching
  - generative-models
  - pytorch
  - arxiv:2605.01457
---

# CoFlow Checkpoints

Official checkpoints for **CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making**.

<p align="center">
  <a href="https://arxiv.org/abs/2605.01457"><img src="https://img.shields.io/badge/Paper-arXiv%3A2605.01457-b31b1b?style=for-the-badge&logo=arxiv" alt="Paper"></a>
  <a href="https://guowei-zou.github.io/coflow/"><img src="https://img.shields.io/badge/Project-Page-4169e1?style=for-the-badge&logo=githubpages" alt="Project Page"></a>
  <a href="https://github.com/Guowei-Zou/coflow-release"><img src="https://img.shields.io/badge/Code-GitHub-111111?style=for-the-badge&logo=github" alt="Code"></a>
  <a href="#citation"><img src="https://img.shields.io/badge/Citation-BibTeX-ff8c00?style=for-the-badge" alt="Citation"></a>
</p>

CoFlow is a coordinated few-step generative model for offline multi-agent reinforcement learning. It combines Coordinated Velocity Attention with adaptive coordination gating so multi-agent actions can be generated in one to a few model calls while preserving inter-agent coordination.

## Repository Contents

This repository contains the 120 checkpoints used in the paper:

- 30 task-quality configurations
- 4 model variants per configuration

The task-quality configurations cover:

- MPE: Spread, Tag, and World with `expert`, `medium-replay`, `medium`, and `random` data qualities
- SMAC: `3m`, `8m`, `2s3z`, and `5m_vs_6m` with `Good`, `Medium`, and `Poor` data qualities
- MA-MuJoCo: `2xAnt` and `4xAnt` with `Good`, `Medium`, and `Poor` data qualities

Model variants:

- `coflow-c`: CoFlow with centralized execution
- `coflow-d`: CoFlow with decentralized execution
- `coflow-base-c`: CoFlow-base with centralized execution
- `coflow-base-d`: CoFlow-base with decentralized execution

Each leaf directory contains one paper-used `state_*.pt` checkpoint. See `MANIFEST.tsv` for the mapping from paper configuration to source run, seed, checkpoint step, and file size.

## Download

Download the full checkpoint release:

```bash
hf download Guowei-Zou/CoFlow-checkpoints --local-dir CoFlow-checkpoints
```

Download one configuration, for example MPE Spread Expert with CoFlow-C:

```bash
hf download Guowei-Zou/CoFlow-checkpoints \
  --include "mpe/simple_spread/expert/coflow-c/*" \
  --local-dir CoFlow-checkpoints
```

## Usage

The checkpoints are intended to be used with the official code release:

```bash
git clone https://github.com/Guowei-Zou/coflow-release.git
```

Please follow the setup, evaluation, and configuration instructions in the GitHub repository. The directory structure in this checkpoint repository is aligned with the paper task names and model variants.

## Citation

```bibtex
@misc{zou2026coflowcoordinatedfewstepflow,
      title={CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making},
      author={Guowei Zou and Haitao Wang and Beiwen Zhang and Boning Zhang and Hejun Wu},
      year={2026},
      eprint={2605.01457},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2605.01457},
}
```