---
license: bsd-3-clause
library_name: braindecode
pipeline_tag: feature-extraction
tags:
  - eeg
  - biosignal
  - pytorch
  - neuroscience
  - braindecode
  - foundation-model
  - transformer
---

# CodeBrain

CodeBrain: Scalable Code EEG Pre-Training for Unified Downstream BCI Tasks.

> **Architecture-only repository.** This repo documents the
> `braindecode.models.CodeBrain` class. **No pretrained weights are
> distributed here** — instantiate the model and train it on your own
> data, or fine-tune from a published foundation-model checkpoint
> separately.

## Quick start

```bash
pip install braindecode
```

```python
from braindecode.models import CodeBrain

model = CodeBrain(
    n_chans=22,
    sfreq=200,
    input_window_seconds=4.0,
    n_outputs=2,
)
```

The signal-shape arguments above are example defaults — adjust them
to match your recording.

## Documentation

- Full API reference (parameters, references, architecture figure):
  <https://braindecode.org/stable/generated/braindecode.models.CodeBrain.html>
- Interactive browser with live instantiation:
  <https://huggingface.co/spaces/braindecode/model-explorer>
- Source on GitHub: <https://github.com/braindecode/braindecode/blob/master/braindecode/models/codebrain.py#L21>

## Architecture description

The block below is the rendered class docstring (parameters,
references, architecture figure where available).

<div class='bd-doc'><main>
<p>CodeBrain: Scalable Code EEG Pre-Training for Unified Downstream BCI Tasks.</p>
<span style="display:inline-block;padding:2px 8px;border-radius:4px;background:#d9534f;color:white;font-size:11px;font-weight:600;margin-right:4px;">Foundation Model</span><span style="display:inline-block;padding:2px 8px;border-radius:4px;background:#56B4E9;color:white;font-size:11px;font-weight:600;margin-right:4px;">Attention/Transformer</span>


 .. figure:: https://raw.githubusercontent.com/jingyingma01/CodeBrain/refs/heads/main/assets/intro.png
     :align: center
     :alt: CodeBrain pre-training overview
     :width: 1000px

 CodeBrain is a foundation model for EEG that pre-trains on large unlabelled
 corpora using a two-stage vector-quantised masking strategy, then fine-tunes
 on downstream BCI tasks. It segments EEG signals into fixed-size patches,
 embeds them with convolutional and spectral projections, and processes them
 through stacked residual blocks that combine a multi-scale convolutional
 structured state-space model (``_GConv``) with sliding-window self-attention.

 .. rubric:: Stage 2: EEGSSM Backbone (this implementation)

 This class implements Stage 2 of CodeBrain — the EEGSSM backbone described
 in Section 3.3 of [codebrain]_. Following :class:`Labram`, CodeBrain
 discretises EEG patches into codebook tokens via VQ-VAE (Stage 1, not
 implemented here), then trains the backbone to predict masked token indices
 via cross-entropy. CodeBrain extends this with a *dual* tokenizer that
 decouples temporal and frequency representations, as stated in the paper:
 *"the TFDual-Tokenizer, which decouples heterogeneous temporal and frequency
 EEG signals into discrete tokens to enhance discriminative power."*

 .. rubric:: Macro Components

 - **PatchEmbedding**: Splits ``(batch, n_chans, n_times)`` into
   ``(batch, n_chans, seq_len, patch_size)`` patches, projects each patch
   with a 2-D convolutional stack, adds FFT-based spectral embeddings, and
   applies depth-wise convolutional positional encoding.
 - **Residual blocks** (``ResidualGroup``): Each block applies RMSNorm,
   a ``_GConv`` SSM layer, and sliding-window multi-head attention, with
   gated activation and separate residual/skip paths.
 - **Classification head** (``final_layer``): Flattens the output and maps
   to ``n_outputs`` classes.

 .. important::
    **Pre-trained Weights Available**

    This model has pre-trained weights available on the Hugging Face Hub.
    You can load them using:

    .. code:: python
        from braindecode.models import CodeBrain

        # Load pre-trained model from Hugging Face Hub
        model = CodeBrain.from_pretrained("braindecode/codebrain-pretrained")

    To push your own trained model to the Hub:

    .. code:: python
        model.push_to_hub("my-username/my-codebrain")

 Parameters
 ----------
 patch_size : int, default=200
     Number of time samples per patch. Input length is trimmed to the
     nearest multiple of ``patch_size``.
 res_channels : int, default=200
     Width of the residual stream inside each ``ResidualBlock``.
 skip_channels : int, default=200
     Width of the skip-connection stream aggregated across blocks.
 out_channels : int, default=200
     Output channels of ``final_conv`` before the classification head.
 num_res_layers : int, default=8
     Number of stacked ``ResidualBlock`` modules.
 drop_prob : float, default=0.1
     Dropout rate used inside the ``_GConv`` SSM and attention layers.
 s4_bidirectional : bool, default=True
     Whether the ``_GConv`` SSM processes the sequence bidirectionally.
 s4_layernorm : bool, default=False
     Whether to apply layer normalisation inside the ``_GConv`` SSM.
     Set to ``False`` to match the released pretrained checkpoint.
 s4_lmax : int, default=570
     Maximum sequence length for the ``_GConv`` SSM kernel. Also determines
     the patch embedding dimension as ``s4_lmax // n_chans``.
 s4_d_state : int, default=64
     State dimension of the ``_GConv`` SSM.
 conv_out_chans : int, default=25
     Number of output channels in the patch projection convolutions.
 conv_groups : int, default=5
     Number of groups for ``GroupNorm`` in the patch projection.
 activation : type[nn.Module], default=nn.ReLU
     Non-linear activation class used in ``init_conv`` and ``final_conv``.

 References
 ----------
 .. [codebrain] Yi Ding, Xuyang Chen, Yong Li, Rui Yan, Tao Wang, Le Wu (2025).
    CodeBrain: Scalable Code EEG Pre-Training for Unified Downstream BCI Tasks.
    https://arxiv.org/abs/2506.09110

 .. rubric:: Hugging Face Hub integration

 When the optional ``huggingface_hub`` package is installed, all models
 automatically gain the ability to be pushed to and loaded from the
 Hugging Face Hub. Install with::

     pip install braindecode[hub]

 **Pushing a model to the Hub:**

 .. code::
     from braindecode.models import CodeBrain

     # Train your model
     model = CodeBrain(n_chans=22, n_outputs=4, n_times=1000)
     # ... training code ...

     # Push to the Hub
     model.push_to_hub(
         repo_id="username/my-codebrain-model",
         commit_message="Initial model upload",
     )

 **Loading a model from the Hub:**

 .. code::
     from braindecode.models import CodeBrain

     # Load pretrained model
     model = CodeBrain.from_pretrained("username/my-codebrain-model")

     # Load with a different number of outputs (head is rebuilt automatically)
     model = CodeBrain.from_pretrained("username/my-codebrain-model", n_outputs=4)

 **Extracting features and replacing the head:**

 .. code::
     import torch

     x = torch.randn(1, model.n_chans, model.n_times)
     # Extract encoder features (consistent dict across all models)
     out = model(x, return_features=True)
     features = out["features"]

     # Replace the classification head
     model.reset_head(n_outputs=10)

 **Saving and restoring full configuration:**

 .. code::
     import json

     config = model.get_config()            # all __init__ params
     with open("config.json", "w") as f:
         json.dump(config, f)

     model2 = CodeBrain.from_config(config)    # reconstruct (no weights)

 All model parameters (both EEG-specific and model-specific such as
 dropout rates, activation functions, number of filters) are automatically
 saved to the Hub and restored when loading.

 See :ref:`load-pretrained-models` for a complete tutorial.</main>
</div>

## Citation

Please cite both the original paper for this architecture (see the
*References* section above) and braindecode:

```bibtex
@article{aristimunha2025braindecode,
  title   = {Braindecode: a deep learning library for raw electrophysiological data},
  author  = {Aristimunha, Bruno and others},
  journal = {Zenodo},
  year    = {2025},
  doi     = {10.5281/zenodo.17699192},
}
```

## License

BSD-3-Clause for the model code (matching braindecode).
Pretraining-derived weights, if you fine-tune from a checkpoint,
inherit the licence of that checkpoint and its training corpus.