Guowei-Zou
/

CoFlow-checkpoints

Reinforcement Learning

multi-agent-reinforcement-learning

generative-models

Model card Files Files and versions

CoFlow-checkpoints / README.md

Guowei-Zou's picture

Remove duplicate quick links from model card

7284558 verified about 14 hours ago

|

history blame contribute delete

3.27 kB

	---
	license: mit
	library_name: pytorch
	tags:
	- reinforcement-learning
	- multi-agent-reinforcement-learning
	- offline-rl
	- flow-matching
	- generative-models
	- pytorch
	- arxiv:2605.01457
	---

	# CoFlow Checkpoints

	Official checkpoints for CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making.

	<p align="center">
	<a href="https://arxiv.org/abs/2605.01457"><img src="https://img.shields.io/badge/Paper-arXiv%3A2605.01457-b31b1b?style=for-the-badge&logo=arxiv" alt="Paper"></a>
	<a href="https://guowei-zou.github.io/coflow/"><img src="https://img.shields.io/badge/Project-Page-4169e1?style=for-the-badge&logo=githubpages" alt="Project Page"></a>
	<a href="https://github.com/Guowei-Zou/coflow-release"><img src="https://img.shields.io/badge/Code-GitHub-111111?style=for-the-badge&logo=github" alt="Code"></a>
	<a href="#citation"><img src="https://img.shields.io/badge/Citation-BibTeX-ff8c00?style=for-the-badge" alt="Citation"></a>
	</p>

	CoFlow is a coordinated few-step generative model for offline multi-agent reinforcement learning. It combines Coordinated Velocity Attention with adaptive coordination gating so multi-agent actions can be generated in one to a few model calls while preserving inter-agent coordination.

	## Repository Contents

	This repository contains the 120 checkpoints used in the paper:

	- 30 task-quality configurations
	- 4 model variants per configuration

	The task-quality configurations cover:

	- MPE: Spread, Tag, and World with `expert`, `medium-replay`, `medium`, and `random` data qualities
	- SMAC: `3m`, `8m`, `2s3z`, and `5m_vs_6m` with `Good`, `Medium`, and `Poor` data qualities
	- MA-MuJoCo: `2xAnt` and `4xAnt` with `Good`, `Medium`, and `Poor` data qualities

	Model variants:

	- `coflow-c`: CoFlow with centralized execution
	- `coflow-d`: CoFlow with decentralized execution
	- `coflow-base-c`: CoFlow-base with centralized execution
	- `coflow-base-d`: CoFlow-base with decentralized execution

	Each leaf directory contains one paper-used `state_*.pt` checkpoint. See `MANIFEST.tsv` for the mapping from paper configuration to source run, seed, checkpoint step, and file size.

	## Download

	Download the full checkpoint release:

	```bash
	hf download Guowei-Zou/CoFlow-checkpoints --local-dir CoFlow-checkpoints
	```

	Download one configuration, for example MPE Spread Expert with CoFlow-C:

	```bash
	hf download Guowei-Zou/CoFlow-checkpoints \
	--include "mpe/simple_spread/expert/coflow-c/*" \
	--local-dir CoFlow-checkpoints
	```

	## Usage

	The checkpoints are intended to be used with the official code release:

	```bash
	git clone https://github.com/Guowei-Zou/coflow-release.git
	```

	Please follow the setup, evaluation, and configuration instructions in the GitHub repository. The directory structure in this checkpoint repository is aligned with the paper task names and model variants.

	## Citation

	```bibtex
	@misc{zou2026coflowcoordinatedfewstepflow,
	title={CoFlow: Coordinated Few-Step Flow for Offline Multi-Agent Decision Making},
	author={Guowei Zou and Haitao Wang and Beiwen Zhang and Boning Zhang and Hejun Wu},
	year={2026},
	eprint={2605.01457},
	archivePrefix={arXiv},
	primaryClass={cs.AI},
	url={https://arxiv.org/abs/2605.01457},
	}
	```