nielsr HF Staff commited on
Commit
65edee0
·
verified ·
1 Parent(s): 9859ff5

Improve model card and update metadata

Browse files

Hi, I'm Niels from the Hugging Face community science team.

This PR improves your model card by:
- Updating the `pipeline_tag` to `graph-ml` to improve discoverability in the Graph ML section of the Hub.
- Adding a direct link to the associated paper: [Protein Fold Classification at Scale: Benchmarking and Pretraining](https://huggingface.co/papers/2605.18552).
- Adding a link to the official GitHub repository.
- Ensuring metadata like `library_name` is properly set.

Please review and merge if this looks good!

Files changed (1) hide show
  1. README.md +16 -14
README.md CHANGED
@@ -1,24 +1,24 @@
1
  ---
2
  library_name: tedbench
3
- tags:
4
- - protein
5
- - structure
6
- - fold-classification
7
- - tedbench
8
- pipeline_tag: feature-extraction
9
  license: bsd-3-clause
 
 
 
 
 
 
10
  ---
11
 
12
  # TEDBench — Pretrained autoencoder (structure only)
13
 
14
  **Variant:** `miae_b`  |  **Parameters:** 102M  |  **Layers:** 12  |  **Hidden dim:** 768  |  **Attn heads:** 12
15
 
16
- This is a **pretrained MiAE** checkpoint. Use it as a feature extractor or as the starting point for fine-tuning.
 
 
17
 
18
- Part of the [TEDBench](https://github.com/BorgwardtLab/TEDBench) benchmark for
19
- protein fold classification (ICML 2026). MiAE is an SE(3)-invariant masked
20
- autoencoder that masks up to 90% of backbone frames and reconstructs the full
21
- structure with a lightweight decoder.
22
 
23
  ## Architecture sizes
24
 
@@ -34,6 +34,8 @@ Append `+model.use_seq_input=true` to `miae_b` for the **+seq** variant.
34
 
35
  ### Load from the HuggingFace Hub
36
 
 
 
37
  ```python
38
  from tedbench.utils.io import load_from_hf
39
 
@@ -55,8 +57,8 @@ model.eval()
55
  ```bibtex
56
  @inproceedings{chen2026tedbench,
57
  title={Protein Fold Classification at Scale: Benchmarking and Pretraining},
58
- author={Chen, Dexiong and Manolache, Andrei and Niepert, Mathias and Borgwardt, Karsten},
59
- booktitle={Proceedings of the 43rd International Conference on Machine Learning},
60
  year={2026}
61
  }
62
- ```
 
1
  ---
2
  library_name: tedbench
 
 
 
 
 
 
3
  license: bsd-3-clause
4
+ pipeline_tag: graph-ml
5
+ tags:
6
+ - protein
7
+ - structure
8
+ - fold-classification
9
+ - tedbench
10
  ---
11
 
12
  # TEDBench — Pretrained autoencoder (structure only)
13
 
14
  **Variant:** `miae_b`  |  **Parameters:** 102M  |  **Layers:** 12  |  **Hidden dim:** 768  |  **Attn heads:** 12
15
 
16
+ This repository contains the **pretrained MiAE-B** (base) checkpoint, a self-supervised model for protein structure representation learning introduced in the paper [Protein Fold Classification at Scale: Benchmarking and Pretraining](https://huggingface.co/papers/2605.18552).
17
+
18
+ MiAE (Masked Invariant Autoencoders) is an $\mathrm{SE(3)}$-invariant autoencoder that uses an extremely high masking ratio (up to 90%) of backbone frames to reconstruct full protein structures. It can be used as a feature extractor or as the starting point for fine-tuning on protein fold classification tasks.
19
 
20
+ - **Authors:** Dexiong Chen, Andrei Manolache, Mathias Niepert, Karsten Borgwardt
21
+ - **Official Code:** [BorgwardtLab/TEDBench](https://github.com/BorgwardtLab/TEDBench)
 
 
22
 
23
  ## Architecture sizes
24
 
 
34
 
35
  ### Load from the HuggingFace Hub
36
 
37
+ Using the `tedbench` library:
38
+
39
  ```python
40
  from tedbench.utils.io import load_from_hf
41
 
 
57
  ```bibtex
58
  @inproceedings{chen2026tedbench,
59
  title={Protein Fold Classification at Scale: Benchmarking and Pretraining},
60
+ author={Chen, Dexiong explorer and Manolache, Andrei and Niepert, Mathias and Borgwardt, Karsten},
61
+ booktitle={Proceedings of the 43rd International Conference on Machine Learning (ICML)},
62
  year={2026}
63
  }
64
+ ```