AmayaGS nielsr HF Staff commited on
Commit
0ad1f1a
·
1 Parent(s): f09d9a7

Improve model card metadata and links (#1)

Browse files

- Improve model card metadata and links (f598a6d4347f30760c4437b8bd22a610045f537a)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +70 -71
README.md CHANGED
@@ -1,71 +1,70 @@
1
- ---
2
- license: mit
3
- tags:
4
- - medical-imaging
5
- - computational-pathology
6
- - survival-analysis
7
- - multimodal
8
- - tcga
9
- - interpretable
10
- datasets:
11
- - TCGA
12
- library_name: pytorch
13
- ---
14
-
15
- # ProtoPathway
16
-
17
- Pretrained checkpoints, preprocessed cohort data, and the curated pathway
18
- graph for **ProtoPathway**, an interpretable-by-design multimodal framework
19
- for cancer survival prediction.
20
-
21
- See the [code repository](https://github.com/AmayaGS/ProtoPathway) for usage,
22
- training, and evaluation instructions.
23
-
24
- ## Layout
25
-
26
- ```
27
- pathways/pathways_base_*.pkl curated Reactome + Hallmark pathway graph
28
- raw_inputs/ raw files for re-running preprocessing from scratch
29
- Reactome/ Reactome hierarchy files (GMT, relations, names)
30
- Hallmark/ MSigDB Hallmark gene sets
31
- {cohort}/ rna_clean.csv, clinical CSV, SurvPath splits
32
- cohorts/{cohort}/ preprocessed cohort data and trained models
33
- gene_expression.csv preprocessed expression matrix
34
- bipartite_graph.pt cohort-specific gene-pathway graph
35
- labels.csv survival times, events, and bins
36
- data_splits.pkl 5-fold CV splits (SurvPath-compatible)
37
- checkpoints/best_fold_{0..4}.pt trained model weights
38
- ```
39
-
40
- ## Cohorts
41
-
42
- Five TCGA cohorts: BRCA (N=714), BLCA (N=359), COADREAD (N=227),
43
- HNSC (N=392), STAD (N=318). Gene expression is the preprocessed
44
- SurvPath release. WSI patch features (UNI2-h) are not redistributed
45
- here and should be obtained from the
46
- [Mahmood Lab](https://huggingface.co/MahmoodLab/UNI2-h) directly.
47
-
48
- ## Quick load
49
-
50
- ```python
51
- from huggingface_hub import snapshot_download
52
-
53
- # Everything for one cohort plus the shared pathway file
54
- snapshot_download(
55
- repo_id="AmayaGS/ProtoPathway",
56
- local_dir="./assets",
57
- allow_patterns=["cohorts/TCGA-BLCA/*", "pathways/*"],
58
- )
59
- ```
60
-
61
- ## Citation
62
-
63
- ```bibtex
64
- @article{protopathway2026,
65
- title = {ProtoPathway: Biologically Structured Prototype-Pathway Fusion for Multimodal Cancer Survival Prediction},
66
- author = {Amaya Gallagher-Syed, Costantino Pitzalis, Myles J. Lewis, Michael R.
67
- Barnes, Gregory Slabaugh},
68
- journal = {arXiv preprint arXiv:2605.21454},
69
- year = {2026},
70
- }
71
- ```
 
1
+ ---
2
+ datasets:
3
+ - TCGA
4
+ library_name: pytorch
5
+ license: mit
6
+ pipeline_tag: other
7
+ tags:
8
+ - medical-imaging
9
+ - computational-pathology
10
+ - survival-analysis
11
+ - multimodal
12
+ - tcga
13
+ - interpretable
14
+ ---
15
+
16
+ # ProtoPathway
17
+
18
+ Pretrained checkpoints, preprocessed cohort data, and the curated pathway graph for **ProtoPathway**, an interpretable-by-design multimodal framework for cancer survival prediction.
19
+
20
+ - **Paper:** [ProtoPathway: Biologically Structured Prototype-Pathway Fusion for Multimodal Cancer Survival Prediction](https://huggingface.co/papers/2605.21454)
21
+ - **Code:** [https://github.com/AmayaGS/ProtoPathway](https://github.com/AmayaGS/ProtoPathway)
22
+
23
+ ## Layout
24
+
25
+ ```
26
+ pathways/pathways_base_*.pkl curated Reactome + Hallmark pathway graph
27
+ raw_inputs/ raw files for re-running preprocessing from scratch
28
+ Reactome/ Reactome hierarchy files (GMT, relations, names)
29
+ Hallmark/ MSigDB Hallmark gene sets
30
+ {cohort}/ rna_clean.csv, clinical CSV, SurvPath splits
31
+ cohorts/{cohort}/ preprocessed cohort data and trained models
32
+ gene_expression.csv preprocessed expression matrix
33
+ bipartite_graph.pt cohort-specific gene-pathway graph
34
+ labels.csv survival times, events, and bins
35
+ data_splits.pkl 5-fold CV splits (SurvPath-compatible)
36
+ checkpoints/best_fold_{0..4}.pt trained model weights
37
+ ```
38
+
39
+ ## Cohorts
40
+
41
+ Five TCGA cohorts: BRCA (N=714), BLCA (N=359), COADREAD (N=227),
42
+ HNSC (N=392), STAD (N=318). Gene expression is the preprocessed
43
+ SurvPath release. WSI patch features (UNI2-h) are not redistributed
44
+ here and should be obtained from the
45
+ [Mahmood Lab](https://huggingface.co/MahmoodLab/UNI2-h) directly.
46
+
47
+ ## Quick load
48
+
49
+ ```python
50
+ from huggingface_hub import snapshot_download
51
+
52
+ # Everything for one cohort plus the shared pathway file
53
+ snapshot_download(
54
+ repo_id="AmayaGS/ProtoPathway",
55
+ local_dir="./assets",
56
+ allow_patterns=["cohorts/TCGA-BLCA/*", "pathways/*"],
57
+ )
58
+ ```
59
+
60
+ ## Citation
61
+
62
+ ```bibtex
63
+ @article{protopathway2026,
64
+ title = {ProtoPathway: Biologically Structured Prototype-Pathway Fusion for Multimodal Cancer Survival Prediction},
65
+ author = {Amaya Gallagher-Syed, Costantino Pitzalis, Myles J. Lewis, Michael R.
66
+ Barnes, Gregory Slabaugh},
67
+ journal = {arXiv preprint arXiv:2605.21454},
68
+ year = {2026},
69
+ }
70
+ ```