| # PD Cognition BioClinicalBERT Fine Tuned spaCy Model |
|
|
| This model is a spaCy pipeline for span categorization trained on clinical text related to Parkinson disease cognition. It uses BioClinicalBERT as the transformer backbone and predicts labeled spans stored under the key sc. |
|
|
| ## Model Details |
|
|
| Base model emilyalsentzer Bio ClinicalBERT |
| Framework spaCy with spacy transformers |
| Task span categorization |
| Language English |
|
|
| ## Labels |
|
|
| The label set is defined in labels.json included in the model directory. |
|
|
| ## Input: |
| Raw clinical text string |
|
|
| ## Output: |
| Predicted spans in doc.spans["sc"] with start end label and score |
|
|
| ## Usage |
|
|
| ```python |
| import spacy |
| |
| nlp = spacy.load("path_to_model_best") |
| doc = nlp("Patient shows cognitive decline and memory impairment") |
| |
| for span in doc.spans["sc"]: |
| print(span.text, span.label_, span.score) |
| ``` |
|
|
| ## Files |
|
|
| model best spaCy pipeline |
| config cfg training configuration |
| meta json pipeline metadata |
| labels json list of span labels |
|
|
| ## Notes |
|
|
| This is a spaCy model and should be loaded with spacy.load not transformers |
| Performance depends on span alignment and threshold tuning |
| Intended for research use on clinical text |
|
|
| ## Citation |
|
|
| ```bibtex |
| @article{khanna2024cognitive, |
| title={Toward Automated Cognitive Assessment in Parkinson’s Disease Using Pretrained Language Models}, |
| author={Khanna, Varada and Bhatt, Nilay and Shin, Ikgyu and Rosso, Mattia and Tinaz, Sule and Ren, Yang and Xu, Hua and Keloth, Vipina K}, |
| journal={arXiv preprint arXiv:2511.08806}, |
| year={2025} |
| } |
| ``` |