Guowei-Zou commited on
Commit
26c10c5
·
verified ·
1 Parent(s): 43858a3

Update CoFlow checkpoint model card

Browse files
Files changed (1) hide show
  1. README.md +74 -3
README.md CHANGED
@@ -1,15 +1,86 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
 
 
 
3
  ---
4
 
5
- # CoFlow Paper Checkpoints
6
 
7
- This private repository contains the 120 checkpoint files used for the CoFlow paper step-sweep experiments: 30 task/quality configurations times 4 variants.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
 
9
- Variants:
10
  - `coflow-c`: CoFlow with centralized execution
11
  - `coflow-d`: CoFlow with decentralized execution
12
  - `coflow-base-c`: CoFlow-base with centralized execution
13
  - `coflow-base-d`: CoFlow-base with decentralized execution
14
 
15
  Each leaf directory contains one paper-used `state_*.pt` checkpoint. See `MANIFEST.tsv` for the mapping from paper configuration to source run, seed, checkpoint step, and file size.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ library_name: pytorch
4
+ tags:
5
+ - reinforcement-learning
6
+ - multi-agent-reinforcement-learning
7
+ - offline-rl
8
+ - flow-matching
9
+ - generative-models
10
+ - pytorch
11
+ - arxiv:2605.01457
12
  ---
13
 
14
+ # CoFlow Checkpoints
15
 
16
+ Official checkpoints for **CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making**.
17
+
18
+ CoFlow is a coordinated few-step generative model for offline multi-agent reinforcement learning. It combines Coordinated Velocity Attention with adaptive coordination gating so multi-agent actions can be generated in one to a few model calls while preserving inter-agent coordination.
19
+
20
+ ## Links
21
+
22
+ - Paper: https://arxiv.org/abs/2605.01457
23
+ - Project page: https://guowei-zou.github.io/coflow/
24
+ - Code: https://github.com/Guowei-Zou/coflow-release
25
+
26
+ ## Repository Contents
27
+
28
+ This repository contains the 120 checkpoints used in the paper:
29
+
30
+ - 30 task-quality configurations
31
+ - 4 model variants per configuration
32
+
33
+ The task-quality configurations cover:
34
+
35
+ - MPE: Spread, Tag, and World with `expert`, `medium-replay`, `medium`, and `random` data qualities
36
+ - SMAC: `3m`, `8m`, `2s3z`, and `5m_vs_6m` with `Good`, `Medium`, and `Poor` data qualities
37
+ - MA-MuJoCo: `2xAnt` and `4xAnt` with `Good`, `Medium`, and `Poor` data qualities
38
+
39
+ Model variants:
40
 
 
41
  - `coflow-c`: CoFlow with centralized execution
42
  - `coflow-d`: CoFlow with decentralized execution
43
  - `coflow-base-c`: CoFlow-base with centralized execution
44
  - `coflow-base-d`: CoFlow-base with decentralized execution
45
 
46
  Each leaf directory contains one paper-used `state_*.pt` checkpoint. See `MANIFEST.tsv` for the mapping from paper configuration to source run, seed, checkpoint step, and file size.
47
+
48
+ ## Download
49
+
50
+ Download the full checkpoint release:
51
+
52
+ ```bash
53
+ hf download Guowei-Zou/CoFlow-checkpoints --local-dir CoFlow-checkpoints
54
+ ```
55
+
56
+ Download one configuration, for example MPE Spread Expert with CoFlow-C:
57
+
58
+ ```bash
59
+ hf download Guowei-Zou/CoFlow-checkpoints \
60
+ --include "mpe/simple_spread/expert/coflow-c/*" \
61
+ --local-dir CoFlow-checkpoints
62
+ ```
63
+
64
+ ## Usage
65
+
66
+ The checkpoints are intended to be used with the official code release:
67
+
68
+ ```bash
69
+ git clone https://github.com/Guowei-Zou/coflow-release.git
70
+ ```
71
+
72
+ Please follow the setup, evaluation, and configuration instructions in the GitHub repository. The directory structure in this checkpoint repository is aligned with the paper task names and model variants.
73
+
74
+ ## Citation
75
+
76
+ ```bibtex
77
+ @misc{zou2026coflowcoordinatedfewstepflow,
78
+ title={CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making},
79
+ author={Guowei Zou and Haitao Wang and Beiwen Zhang and Boning Zhang and Hejun Wu},
80
+ year={2026},
81
+ eprint={2605.01457},
82
+ archivePrefix={arXiv},
83
+ primaryClass={cs.AI},
84
+ url={https://arxiv.org/abs/2605.01457},
85
+ }
86
+ ```