TEDBench
/

miae-b

@@ -1,24 +1,24 @@
 ---
 library_name: tedbench
-tags:
-  - protein
-  - structure
-  - fold-classification
-  - tedbench
-pipeline_tag: feature-extraction
 license: bsd-3-clause
 ---
 # TEDBench — Pretrained autoencoder (structure only)
 **Variant:** `miae_b` &nbsp;|&nbsp; **Parameters:** 102M &nbsp;|&nbsp; **Layers:** 12 &nbsp;|&nbsp; **Hidden dim:** 768 &nbsp;|&nbsp; **Attn heads:** 12
-This is a **pretrained MiAE** checkpoint. Use it as a feature extractor or as the starting point for fine-tuning.
-Part of the [TEDBench](https://github.com/BorgwardtLab/TEDBench) benchmark for
-protein fold classification (ICML 2026). MiAE is an SE(3)-invariant masked
-autoencoder that masks up to 90% of backbone frames and reconstructs the full
-structure with a lightweight decoder.
 ## Architecture sizes
@@ -34,6 +34,8 @@ Append `+model.use_seq_input=true` to `miae_b` for the **+seq** variant.
 ### Load from the HuggingFace Hub
 ```python
 from tedbench.utils.io import load_from_hf
@@ -55,8 +57,8 @@ model.eval()
 ```bibtex
 @inproceedings{chen2026tedbench,
   title={Protein Fold Classification at Scale: Benchmarking and Pretraining},
-  author={Chen, Dexiong and Manolache, Andrei and Niepert, Mathias and Borgwardt, Karsten},
-  booktitle={Proceedings of the 43rd International Conference on Machine Learning},
   year={2026}
 }
-```

 ---
 library_name: tedbench
 license: bsd-3-clause
+pipeline_tag: graph-ml
+tags:
+- protein
+- structure
+- fold-classification
+- tedbench
 ---
 # TEDBench — Pretrained autoencoder (structure only)
 **Variant:** `miae_b` &nbsp;|&nbsp; **Parameters:** 102M &nbsp;|&nbsp; **Layers:** 12 &nbsp;|&nbsp; **Hidden dim:** 768 &nbsp;|&nbsp; **Attn heads:** 12
+This repository contains the **pretrained MiAE-B** (base) checkpoint, a self-supervised model for protein structure representation learning introduced in the paper [Protein Fold Classification at Scale: Benchmarking and Pretraining](https://huggingface.co/papers/2605.18552).
+MiAE (Masked Invariant Autoencoders) is an $\mathrm{SE(3)}$-invariant autoencoder that uses an extremely high masking ratio (up to 90%) of backbone frames to reconstruct full protein structures. It can be used as a feature extractor or as the starting point for fine-tuning on protein fold classification tasks.
+- **Authors:** Dexiong Chen, Andrei Manolache, Mathias Niepert, Karsten Borgwardt
+- **Official Code:** [BorgwardtLab/TEDBench](https://github.com/BorgwardtLab/TEDBench)
 ## Architecture sizes
 ### Load from the HuggingFace Hub
+Using the `tedbench` library:
 ```python
 from tedbench.utils.io import load_from_hf
 ```bibtex
 @inproceedings{chen2026tedbench,
   title={Protein Fold Classification at Scale: Benchmarking and Pretraining},
+  author={Chen, Dexiong explorer and Manolache, Andrei and Niepert, Mathias and Borgwardt, Karsten},
+  booktitle={Proceedings of the 43rd International Conference on Machine Learning (ICML)},
   year={2026}
 }
+```