bruAristimunha commited on
Commit
cbc3460
·
1 Parent(s): 35b6030

Rewrite org card: pure markdown, matches HF-conventional pattern (Google/Meta/Qwen refs)

Browse files
Files changed (1) hide show
  1. org-readme/README.md +42 -68
org-readme/README.md CHANGED
@@ -1,47 +1,32 @@
1
- <div align="center">
2
 
3
- <img src="https://raw.githubusercontent.com/eegdash/EEGDash/main/docs/source/_static/eegdash_long.svg" width="340" alt="EEGDash" />
4
-
5
- ### The open catalog of EEG / MEG datasets — indexed, described, and loadable with one line of Python.
6
 
7
  [![PyPI](https://img.shields.io/pypi/v/eegdash?style=flat-square&logo=pypi&logoColor=white&color=0072B2)](https://pypi.org/project/eegdash/)
8
  [![Python](https://img.shields.io/pypi/pyversions/eegdash?style=flat-square&color=0072B2)](https://pypi.org/project/eegdash/)
9
  [![License](https://img.shields.io/badge/license-BSD--3--Clause-009E73?style=flat-square)](https://github.com/eegdash/EEGDash/blob/main/LICENSE)
10
- [![GitHub](https://img.shields.io/github/stars/eegdash/EEGDash?style=flat-square&logo=github&color=E69F00)](https://github.com/eegdash/EEGDash)
11
  [![Downloads](https://static.pepy.tech/badge/eegdash)](https://pepy.tech/project/eegdash)
 
12
 
13
- [**🗺️ Browse the catalog**](https://huggingface.co/spaces/EEGDash/catalog) · [**📚 Docs**](https://eegdash.org) · [**💻 GitHub**](https://github.com/eegdash/EEGDash) · [**📦 PyPI**](https://pypi.org/project/eegdash/)
14
-
15
- </div>
16
-
17
- ---
18
-
19
- ## Why this exists
20
 
21
- Publicly funded neuroscience produces a river of EEG/MEG data — but most of it is stuck: BIDS trees on S3, idiosyncratic loaders, inconsistent metadata, no way to search across studies. **EEGDash is the index.** We catalog every publicly shared EEG/MEG study we can find, normalize the metadata, and expose each one as a single Python object that plugs straight into [braindecode](https://braindecode.org) and PyTorch.
22
 
23
- Raw data **never gets rehosted** — each repo here is a pointer to its canonical source (OpenNeuro, NEMAR, lab S3). `EEGDashDataset` handles the download, caching, and conversion on demand.
24
 
25
- ## By the numbers
 
 
 
 
26
 
27
- <div align="center">
28
 
29
- | | | | |
30
- |:---:|:---:|:---:|:---:|
31
- | **736** | **40,361** | **222,750** | **85,298** |
32
- | datasets | subjects | recordings | hours of data |
33
 
34
- | **600** | **571** | **73** | **55** | **22** |
35
- |:---:|:---:|:---:|:---:|:---:|
36
- | 🤗 mirrored | EEG | iEEG | MEG | fNIRS |
37
 
38
- </div>
39
-
40
- **Clinical populations covered:** Parkinson's · Alzheimer's · ADHD · Schizophrenia · Depression · Epilepsy · Dementia · Dyslexia · Development · post-surgical · healthy controls.
41
-
42
- **Experimental paradigms:** Visual · Auditory · Motor · Multisensory · Tactile · Resting State · Sleep · Anesthesia.
43
-
44
- **Data sources:** 546 from [OpenNeuro](https://openneuro.org) · 190 from [NEMAR](https://nemar.org).
45
 
46
  ## Get started in 30 seconds
47
 
@@ -52,79 +37,68 @@ pip install eegdash
52
  ```python
53
  from eegdash import EEGDashDataset
54
 
55
- # Load any dataset in the catalog by its ID...
56
  ds = EEGDashDataset(dataset="ds002718", cache_dir="./cache")
57
  print(len(ds), "recordings")
58
 
59
- # ...or by its canonical name — every known alias is registered:
60
  from eegdash.dataset import Wakeman2015
61
  ds = Wakeman2015(cache_dir="./cache")
62
 
63
- # ...or pull a Hub-mirrored Zarr copy directly:
64
  from braindecode.datasets import BaseConcatDataset
65
  ds = BaseConcatDataset.pull_from_hub("EEGDash/ds002718")
66
 
67
- # Plug into PyTorch — EEGDash datasets ARE braindecode datasets.
68
  from torch.utils.data import DataLoader
69
  loader = DataLoader(ds, batch_size=32, shuffle=True)
70
  ```
71
 
72
- ## Start here
73
 
74
- - 🗺️ **[Catalog Space](https://huggingface.co/spaces/EEGDash/catalog)** — searchable, filterable view over all 736 datasets with interactive treemap/sankey/growth plots.
75
- - 📚 **[Documentation](https://eegdash.org)** — tutorials, per-dataset cards, preprocessing recipes, API reference.
76
- - 🎓 **[Tutorials](https://eegdash.org/generated/auto_examples/index.html)** — end-to-end examples from load to trained model.
77
- - 🧪 **[EEG 2025 challenge](https://eegdash.org/eeg2025/)** — benchmarks on Healthy Brain Network data, 22 mini-releases ready to download.
78
 
79
- ## Featured datasets
 
 
 
 
 
80
 
81
- | Dataset | What it is | Population | Size | Canonical |
82
- |---|---|---|---|---|
83
- | [`ds002718`](https://huggingface.co/datasets/EEGDash/ds002718) | Face recognition (Wakeman & Henson) | Healthy (18) | 4.3 GB | `Wakeman2015` |
84
- | [`ds003800`](https://huggingface.co/datasets/EEGDash/ds003800) | Resting state / auditory | Parkinson's | small | — |
85
- | [`ds002799`](https://huggingface.co/datasets/EEGDash/ds002799) | Patient-day recording | Dementia | — | — |
86
- | [`EEG2025r1`](https://huggingface.co/datasets/EEGDash/eeg2025r1) | HBN multi-task — 136 subjects, 10 paradigms | Development | 20.6 GB | `HBN_r1_bdf` |
87
- | [`ds004551`](https://huggingface.co/datasets/EEGDash/ds004551) | iEEG recordings | Surgery | — | — |
88
- | [`ds000117`](https://huggingface.co/datasets/EEGDash/ds000117) | MEG+EEG multi-modal face | Healthy | — | `WakemanHenson_MEEG` |
89
 
90
- **[Browse all 600 mirrored datasets →](https://huggingface.co/EEGDash)**
91
 
92
  ## Backed by
93
 
94
- EEGDash is a **U.S.–Israel collaboration** supported by the **National Science Foundation** as part of the EEG-DaSh initiative:
95
 
96
- - **Swartz Center for Computational Neuroscience (SCCN)** University of California, San Diego
97
- - **Ben-Gurion University of the Negev** Beer-Sheva, Israel
98
 
99
- Built on top of (and giving back to) the wider open-neuroscience ecosystem: [MNE-Python](https://mne.tools), [braindecode](https://braindecode.org), [EEGLAB](https://eeglab.org), [BIDS](https://bids.neuroimaging.io), [OpenNeuro](https://openneuro.org), [NEMAR](https://nemar.org).
100
 
101
  ## Contribute
102
 
103
- The entire catalog is regenerated from **one CSV** (`eegdash/dataset/dataset_summary.csv`) and the EEGDash API. Missing a dataset? Spotted wrong metadata?
104
 
105
- - [Open an issue](https://github.com/eegdash/EEGDash/issues) — we add datasets on request.
106
- - Fix the CSV or push a new `dataset_description.json` — every stub on HF regenerates automatically from the single source.
107
- - Bigger ideas? [CONTRIBUTING.md](https://github.com/eegdash/EEGDash/blob/main/CONTRIBUTING.md).
108
 
109
  ## Cite
110
 
111
  ```bibtex
112
  @software{eegdash,
113
- title = {EEGDash: An open catalog and loader for EEG/MEG datasets},
114
- author = {{EEG-DaSh contributors}},
115
- url = {https://github.com/eegdash/EEGDash},
116
- year = {2025},
117
  license = {BSD-3-Clause}
118
  }
119
  ```
120
 
121
- When you use a specific dataset, **follow its upstream citation policy** — the link is in every dataset's card under *How to cite*.
122
 
123
  ---
124
 
125
- <div align="center">
126
- <sub>
127
- EEGDash code is <b>BSD-3-Clause</b>. Each dataset retains its upstream license — check the card before redistribution.<br/>
128
- <em>Open, indexed, loadable.</em>
129
- </sub>
130
- </div>
 
1
+ ![EEGDash — the open catalog of EEG/MEG datasets](https://raw.githubusercontent.com/eegdash/EEGDash/main/docs/source/_static/eegdash_long.svg)
2
 
3
+ *The open catalog of EEG / MEG datasets — indexed, described, and loadable with one line of Python.*
 
 
4
 
5
  [![PyPI](https://img.shields.io/pypi/v/eegdash?style=flat-square&logo=pypi&logoColor=white&color=0072B2)](https://pypi.org/project/eegdash/)
6
  [![Python](https://img.shields.io/pypi/pyversions/eegdash?style=flat-square&color=0072B2)](https://pypi.org/project/eegdash/)
7
  [![License](https://img.shields.io/badge/license-BSD--3--Clause-009E73?style=flat-square)](https://github.com/eegdash/EEGDash/blob/main/LICENSE)
 
8
  [![Downloads](https://static.pepy.tech/badge/eegdash)](https://pepy.tech/project/eegdash)
9
+ [![Stars](https://img.shields.io/github/stars/eegdash/EEGDash?style=flat-square&logo=github&color=E69F00)](https://github.com/eegdash/EEGDash)
10
 
11
+ Welcome to the official Hugging Face org for **[EEGDash](https://eegdash.org)** — an open archive of publicly shared EEG and MEG studies, built to remove the friction between the river of publicly funded neuroscience data and modern ML pipelines. Raw recordings never get rehosted here; every dataset on this page is a **pointer** to its canonical source (OpenNeuro, NEMAR, or the lab that collected it), and `EEGDashDataset` handles the download, caching, and conversion on demand.
 
 
 
 
 
 
12
 
13
+ **[🗺️ Browse the interactive catalog](https://huggingface.co/spaces/EEGDash/catalog)** · [📚 Docs](https://eegdash.org) · [💻 GitHub](https://github.com/eegdash/EEGDash) · [📦 PyPI](https://pypi.org/project/eegdash/)
14
 
15
+ ## The archive in one glance
16
 
17
+ - **736** EEG / MEG datasets indexed, of which **600+** already mirrored on 🤗 and growing daily
18
+ - **40,361** subjects · **222,750** recordings · **85,298 hours** of signal
19
+ - **571 EEG** · **73 iEEG** · **55 MEG** · **22 fNIRS** · a handful of multimodal combos
20
+ - Every dataset reachable by a stable slug (`ds002718`) *and* by canonical alias (`Wakeman2015`)
21
+ - Sourced from **[OpenNeuro](https://openneuro.org)** (546) and **[NEMAR](https://nemar.org)** (190)
22
 
23
+ ### Clinical populations covered
24
 
25
+ Healthy controls (349) · Epilepsy (37) · Development — HBN releases (36+) · post-Surgery (17) · Dementia · Parkinson's · Schizophrenia · Depression · Dyslexia · ADHD · and more.
 
 
 
26
 
27
+ ### Experimental paradigms
 
 
28
 
29
+ Visual (300) · Auditory (59) · Multisensory (35) · Resting State (22) · Motor (17) · Tactile (16) · Sleep (13) · Anesthesia (4).
 
 
 
 
 
 
30
 
31
  ## Get started in 30 seconds
32
 
 
37
  ```python
38
  from eegdash import EEGDashDataset
39
 
40
+ # Load any dataset in the catalog by its ID
41
  ds = EEGDashDataset(dataset="ds002718", cache_dir="./cache")
42
  print(len(ds), "recordings")
43
 
44
+ # or by canonical alias — every known name is registered as a class:
45
  from eegdash.dataset import Wakeman2015
46
  ds = Wakeman2015(cache_dir="./cache")
47
 
48
+ # or pull a Hub-mirrored, pre-windowed Zarr copy:
49
  from braindecode.datasets import BaseConcatDataset
50
  ds = BaseConcatDataset.pull_from_hub("EEGDash/ds002718")
51
 
52
+ # EEGDash datasets ARE braindecode datasets — plug into PyTorch unchanged.
53
  from torch.utils.data import DataLoader
54
  loader = DataLoader(ds, batch_size=32, shuffle=True)
55
  ```
56
 
57
+ ## Featured datasets
58
 
59
+ A handful of representative entries — the full catalog is at the [Space](https://huggingface.co/spaces/EEGDash/catalog).
 
 
 
60
 
61
+ - **[`ds002718`](https://huggingface.co/datasets/EEGDash/ds002718)** — *Face processing EEG dataset for EEGLAB* (Wakeman & Henson, 2020). 18 healthy subjects, 250 Hz, CC0. Canonical alias: `Wakeman2015`.
62
+ - **[`ds000117`](https://huggingface.co/datasets/EEGDash/ds000117)** — Multi-subject multimodal **MEG + EEG** face processing, source-localization-ready.
63
+ - **[`EEG2025r1`](https://huggingface.co/datasets/EEGDash/eeg2025r1)** — **Healthy Brain Network** release 1 — 136 participants, 10 paradigms, 20 GB. The canonical developmental-EEG benchmark.
64
+ - **[`ds003800`](https://huggingface.co/datasets/EEGDash/ds003800)** — EEG in **Parkinson's disease**.
65
+ - **[`ds002799`](https://huggingface.co/datasets/EEGDash/ds002799)** — Patient-day recording in **dementia**.
66
+ - **[`ds004551`](https://huggingface.co/datasets/EEGDash/ds004551)** — **iEEG** / intracranial recordings from neurosurgical patients.
67
 
68
+ ## Spaces & tools
 
 
 
 
 
 
 
69
 
70
+ - **[EEGDash Catalog](https://huggingface.co/spaces/EEGDash/catalog)** — interactive search with modality × population × source × license filters, plus live treemap / sankey / growth views of the whole archive.
71
 
72
  ## Backed by
73
 
74
+ EEGDash is a **U.S.–Israel collaboration** funded by the **U.S. National Science Foundation** as part of the EEG-DaSh initiative:
75
 
76
+ - **Swartz Center for Computational Neuroscience (SCCN)**, University of California, San Diego
77
+ - **Ben-Gurion University of the Negev**, Beer-Sheva, Israel
78
 
79
+ Built on and contributing back to the open-neuroscience ecosystem: [MNE-Python](https://mne.tools), [braindecode](https://braindecode.org), [EEGLAB](https://eeglab.org), [BIDS](https://bids.neuroimaging.io), [OpenNeuro](https://openneuro.org), [NEMAR](https://nemar.org).
80
 
81
  ## Contribute
82
 
83
+ The entire catalog regenerates from **one CSV** (`eegdash/dataset/dataset_summary.csv`) plus the public EEGDash API. Missing a dataset? Wrong metadata? The fix surfaces everywhere at once.
84
 
85
+ - Request a new dataset or report bad metadata → [open an issue](https://github.com/eegdash/EEGDash/issues).
86
+ - Bigger contributions [CONTRIBUTING.md](https://github.com/eegdash/EEGDash/blob/main/CONTRIBUTING.md).
 
87
 
88
  ## Cite
89
 
90
  ```bibtex
91
  @software{eegdash,
92
+ title = {EEGDash: An open catalog and loader for EEG/MEG datasets},
93
+ author = {{EEG-DaSh contributors}},
94
+ url = {https://github.com/eegdash/EEGDash},
95
+ year = {2025},
96
  license = {BSD-3-Clause}
97
  }
98
  ```
99
 
100
+ When you use a specific dataset, **follow its upstream citation policy** — the link is in every dataset's HF card under *How to cite*.
101
 
102
  ---
103
 
104
+ *EEGDash code is BSD-3-Clause. Each dataset retains its upstream license — always check the card before redistribution. Open, indexed, loadable.*