| # Alzheimer's Disease & Neuroscience Dataset Access Guide |
| **Compiled: 2026-04-05** |
|
|
| --- |
|
|
| ## TABLE OF CONTENTS |
|
|
| 1. [NACC](#1-nacc) |
| 2. [OASIS-3](#2-oasis-3) |
| 3. [Bio-Hermes-001](#3-bio-hermes-001) |
| 4. [ADSP / NIAGADS](#4-adsp--niagads) |
| 5. [HCP-Aging](#5-hcp-aging) |
| 6. [PREVENT-AD](#6-prevent-ad) |
| 7. [ANMerge (AddNeuroMed)](#7-anmerge-addneuromed) |
| 8. [ROSMAP](#8-rosmap) |
| 9. [Mayo Clinic Study of Aging (MCSA)](#9-mayo-clinic-study-of-aging) |
| 10. [AIBL](#10-aibl) |
| 11. [DIAN](#11-dian) |
| 12. [AD Workbench / AD Discovery Portal](#12-ad-workbench--ad-discovery-portal) |
| 13. [GAAIN](#13-gaain) |
| 14. [AMP-AD (via Synapse)](#14-amp-ad-via-synapse) |
| 15. [ADNI](#15-adni---bonus-essential-dataset) |
| 16. [Allen SEA-AD](#16-allen-sea-ad) |
| 17. [UK Biobank](#17-uk-biobank) |
| 18. [EEG/MEG Datasets](#18-eegmeg-datasets-for-ad) |
| 19. [Kaggle Datasets](#19-kaggle-ad-datasets) |
| 20. [Additional/Recent Datasets](#20-additional--recent-datasets-2025-2026) |
|
|
| --- |
|
|
| ## 1. NACC |
| **National Alzheimer's Coordinating Center** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 54,000+ from 39+ Alzheimer's Disease Research Centers | |
| | **Apply URL** | https://naccdata.org/requesting-data/nacc-data/ | |
| | **DUA PDF** | https://files.alz.washington.edu/documentation/nacc_data_use_agreement.pdf | |
| |
| ### Application Requirements |
| - Sign a Data Use Agreement (DUA) -- takes ~15 minutes |
| - Submit electronic data request describing your research project |
| - Must search for similar existing projects to ensure no significant overlap |
| - Must identify a unique research hypothesis |
| - Any researcher affiliated with a scientific/educational institution |
| |
| ### Approval Timeline |
| - NACC acknowledges request within **3 business days** |
| - Approved researchers receive data within **48 hours** (excluding weekends/holidays) |
| - One of the fastest turnarounds of any AD dataset |
| |
| ### What You Get |
| - **Quick Access File**: UDS (Uniform Data Set) standardized longitudinal data (CSV) |
| - **Imaging**: MRI (T1w, FLAIR, DTI, T2) and PET scans in DICOM and NIfTI (.zip archives) |
| - **Biomarkers**: CSF biomarkers, APOE genotypes, fluid biomarker data (via NCRAD) |
| - **Genomics**: Genetic/genomic data (via NIAGADS) |
| - **Disease modules**: FTLD, Lewy Body Dementia, Down Syndrome modules |
| - **Modalities**: Clinical, neuropathology, MRI/PET, biospecimen, digital, EHR/Claims |
| |
| ### Restrictions |
| - Data solely for identified individuals in the request |
| - Must acknowledge NACC in publications |
| - Must notify NACC before and during publication submission |
| - Cannot attempt to re-identify participants |
| |
| --- |
| |
| ## 2. OASIS-3 |
| **Open Access Series of Imaging Studies - 3** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 1,378 participants (ages 42-95); 755 cognitively normal, 622 with cognitive decline | |
| | **Apply URL** | https://www.nitrc.org/projects/oasis3/ (main) | |
| | **Tau PET Access** | https://sites.wustl.edu/oasisbrains/home/oasis-3/request-tau-access/ | |
| | **Contact** | oasis-brains@nrg.wustl.edu | |
| |
| ### Application Requirements |
| - Register through NITRC (Neuroimaging Informatics Tools and Resources Clearinghouse) |
| - Click "REQUEST ACCESS TO DATASETS" on the OASIS website |
| - For Tau PET data: send a detailed research statement to oasis-brains@nrg.wustl.edu |
| - Must already have OASIS-3 main dataset approval before requesting Tau data |
| |
| ### Approval Timeline |
| - Not explicitly stated; typically **1-2 weeks** for NITRC registration |
| - Tau data requires separate approval |
| |
| ### What You Get |
| - **MRI**: 2,842 sessions -- T1w, T2w, FLAIR, ASL, SWI, time-of-flight, resting-state BOLD, DTI |
| - **PET**: 2,157+ scans -- PIB, AV45 (amyloid), FDG, AV1451 (Tau; 451 baseline + 85 longitudinal) |
| - **CT**: 1,472 sessions |
| - **FreeSurfer**: Volumetric segmentations for many MR sessions |
| - **Clinical**: Cognitive assessments, UDS forms |
| - **Formats**: NIfTI, BIDS-compatible |
| - **Size**: Multiple TB total |
| |
| ### Restrictions |
| - Cannot attempt to identify participants (including facial recognition/3D rendering) |
| - Publications using AV45 or AV1451 PET data must be submitted to **Avid Radiopharmaceuticals** for review **30 days** before publication/presentation |
| - As of Dec 2025, Tau data integrated into main OASIS-3 project |
| |
| --- |
| |
| ## 3. Bio-Hermes-001 |
| **Global Alzheimer's Platform Foundation** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 80,000+ blood and digital test results | |
| | **Apply URL** | https://www.alzheimersdata.org/ad-workbench (via AD Discovery Portal) | |
| | **Study Page** | https://globalalzplatform.org/biohermesstudy/ | |
| |
| ### Application Requirements |
| - Create a free account on AD Workbench (alzheimersdata.org) |
| - Request access through the AD Discovery Portal |
| - Accept Terms of Use |
| - Available since August 1, 2025 |
| |
| ### Approval Timeline |
| - Varies by dataset; generally **days to weeks** after account approval |
| |
| ### What You Get |
| - **Blood biomarkers**: Comprehensive plasma/serum biomarker panels |
| - **Digital cognitive tests**: Novel digital cognitive assessment data |
| - **Retinal imaging**: Retinal exam data |
| - **Speech analysis**: Voice/speech biomarker data |
| - **PET imaging**: Traditional amyloid/tau PET |
| - **Deeply diverse cohort**: One of the most diverse AD datasets available |
| - Most comprehensive biomarker dataset in Alzheimer's research history |
| |
| ### Restrictions |
| - Must use data for Alzheimer's/dementia research purposes |
| - Must accept AD Workbench Terms of Use |
| - Free access, no cost |
| |
| --- |
| |
| ## 4. ADSP / NIAGADS |
| **Alzheimer's Disease Sequencing Project / NIA Genetics of AD Data Storage Site** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 110,270+ (58,507 whole genomes + 20,503 whole exomes + more) | |
| | **Apply URL** | https://dss.niagads.org/ | |
| | **Application Instructions** | https://dss.niagads.org/documentation/data-application-and-submission/application-instructions/ | |
| | **ADSP Umbrella Dataset** | https://dss.niagads.org/datasets/ng00067/ | |
| |
| ### Application Requirements |
| 1. Current **IRB approval** and protocol for the proposed project (must have 6+ months remaining) |
| 2. **NIA Genomic Data Sharing Plan** (signed by PI and Institutional Signing Official) |
| 3. **NIAGADS Data Distribution Agreement** |
| 4. **Derived/Secondary Data Return Plan** describing what data you will return |
| 5. Research use statement (technical and non-technical) |
| |
| ### Approval Timeline |
| - Applications reviewed by Data Access Committee |
| - Typically **4-8 weeks** (depends on completeness of application) |
| - Approval valid for **1 year** (renewable) |
| - IRB must have 6+ months validity at time of review |
| |
| ### What You Get |
| - **Whole Genome Sequences**: 58,507 in CRAMs, gVCFs |
| - **Whole Exome Sequences**: 20,503 in CRAMs, gVCFs |
| - **Quality-controlled VCFs**: Project-level variant calls |
| - **Harmonized phenotypes**: Standardized clinical data |
| - **Formats**: CRAM, gVCF, VCF, CSV |
| |
| ### Restrictions |
| - Must return derived/secondary data to NIAGADS upon publication or DAR expiration |
| - Data only for approved research use statement |
| - Local IRB approval required |
| - Cannot share with unauthorized users |
| |
| --- |
| |
| ## 5. HCP-Aging |
| **Human Connectome Project - Aging / AABC** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 1,396+ adults (ages 36-100+), 2,878 sessions | |
| | **Apply URL** | https://nda.nih.gov/ (via NIMH Data Archive permissions dashboard) | |
| | **Data Use Terms** | https://www.humanconnectome.org/study/hcp-lifespan-aging/data-use-terms | |
| | **Instructions** | https://www.humanconnectome.org/study/hcp-lifespan-aging/article/instructions-accessing-hcp-aging-data-releases-nda | |
| |
| ### Application Requirements |
| 1. Create NDA (NIMH Data Archive) account |
| 2. Submit **Data Use Certification (DUC)** via NDA Permissions Dashboard |
| 3. Describe how data will be accessed, managed, and eventually deleted |
| 4. Institutional sign-off required |
| 5. Annual renewal with progress report |
| |
| ### Approval Timeline |
| - **2-4 weeks** for DUC approval (varies) |
| - Contact: NDAhelp@mail.nih.gov for status |
| |
| ### What You Get |
| - **Lifespan 2.0 Release**: 725 HCP-A participants + AABC Release 2 (1,396 participants) |
| - **Structural MRI**: T1w, T2w, high-res hippocampal T2 |
| - **Functional MRI**: Resting state fMRI, task fMRI |
| - **Diffusion MRI**: DTI |
| - **ASL**: Arterial spin labeling perfusion |
| - **Phenotypic data**: Demographics, behavioral assessments |
| - **Total size**: 22+ TB |
| - **Formats**: NIfTI, CIFTI, CSV |
| |
| ### Restrictions |
| - Must adhere to consent-based data use limitations |
| - Cannot attempt to re-identify participants |
| - Must describe data management/deletion plan |
| - Annual renewal required |
| |
| --- |
| |
| ## 6. PREVENT-AD |
| **Pre-symptomatic Evaluation of Novel Treatments for AD** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 349 cognitively healthy at-risk participants (mean age 63) | |
| | **Open Data Portal** | https://openpreventad.loris.ca | |
| | **Registered Data Portal** | https://registeredpreventad.loris.ca | |
| | **Publication** | https://doi.org/10.1016/j.nicl.2021.102733 | |
| |
| ### Application Requirements |
| - **Open imaging data**: Freely accessible at openpreventad.loris.ca -- just register |
| - **Sensitive data** (CSF, genetics, cognition): Apply at registeredpreventad.loris.ca |
| - Must agree to standard good data use practices |
| - Must meet ethics requirements and keep data secure |
| - Findable through Canadian Open Neuroscience Platform (CONP) |
| |
| ### Approval Timeline |
| - **Open data**: Immediate after registration |
| - **Registered data**: Days to weeks for qualified researcher approval |
| |
| ### What You Get |
| - **Imaging**: Up to 5 years longitudinal MRI data (structural, functional) |
| - **Biomarkers**: Cerebrospinal fluid biochemistry |
| - **Genetics**: Genetic information |
| - **Cognitive**: Neurocognitive assessments |
| - **Neurosensory**: Sensory capacity measurements |
| - **Medical**: Clinical/medical information |
| - **Formats**: BIDS-compatible NIfTI, CSV |
| |
| ### Restrictions |
| - Must use for neuroscience research as stipulated in consent forms |
| - Cannot attempt to re-identify participants |
| - Must acknowledge PREVENT-AD in publications |
| |
| --- |
| |
| ## 7. ANMerge (AddNeuroMed) |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 1,702 participants | |
| | **Access URL** | https://doi.org/10.7303/syn22252881 (via Synapse) | |
| | **Publication** | https://doi.org/10.3233/JAD-200948 | |
| |
| ### Application Requirements |
| - Create a free Synapse account (accounts.synapse.org) |
| - Accept data use conditions on Synapse |
| - No complex approval process -- this is an open-access dataset |
| |
| ### Approval Timeline |
| - **Immediate to days** -- relatively straightforward once Synapse account is set up |
| |
| ### What You Get |
| - **Clinical assessments**: Longitudinal observational cohort data |
| - **MRI**: Magnetic resonance imaging |
| - **Genotyping**: Genetic variants |
| - **Transcriptomics**: Gene expression profiling (whole-blood RNA) |
| - **Proteomics**: Blood plasma proteomics |
| - **Formats**: CSV, processed data tables |
| - Fully interoperable between modalities with rigorous data curation |
| |
| ### Restrictions |
| - Must cite the dataset and primary publication |
| - Standard Synapse data use conditions apply |
| |
| --- |
| |
| ## 8. ROSMAP |
| **Religious Orders Study / Memory and Aging Project** |
| |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 3,600+ participants (longitudinal since early 1990s) | |
| | **AD Knowledge Portal** | https://adknowledgeportal.synapse.org/Explore/Studies/DetailsPage/StudyDetails?Study=syn3219045 | |
| | **RADC Hub** | https://www.radc.rush.edu/ | |
| | **Molecular Networks** | https://www.radc.rush.edu/molecular_networks/datasets.html | |
|
|
| ### Application Requirements |
| - **Omics data on Synapse**: Register for free Synapse account; some datasets require a signed Data Use Certificate (DUC) |
| - **Clinical/demographic data**: Request through Rush Alzheimer's Disease Center (RADC) Research Resource Sharing Hub |
| - **Additional phenotypes**: Separate request through RADC |
|
|
| ### Approval Timeline |
| - **Synapse open data**: Days (account registration) |
| - **DUC-protected data**: 2-4 weeks for approval |
| - **RADC data**: Variable, typically weeks |
|
|
| ### What You Get |
| - **Genomics**: Whole-genome sequencing |
| - **Transcriptomics**: RNA-seq, single-nucleus RNA-seq |
| - **Epigenomics**: DNA methylation, histone modifications, chromatin accessibility |
| - **Proteomics**: Mass spectrometry-based proteomics |
| - **Metabolomics**: Metabolite profiling |
| - **Clinical**: Longitudinal clinical assessments (since 1990s) |
| - **Neuropathology**: Post-mortem brain tissue analyses |
| - **Formats**: FASTQ, BAM, VCF, CSV, H5AD |
|
|
| ### Restrictions |
| - Must acknowledge data source in publications |
| - DUC-protected datasets (esp. Mayo/Broad samples from deceased individuals) have additional consent requirements |
| - Must use for research purposes consistent with informed consent |
|
|
| --- |
|
|
| ## 9. Mayo Clinic Study of Aging (MCSA) |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 5,925 unique participants (clinical); 1,802 (imaging); ages 30-90 | |
| | **GAAIN Access** | https://www.gaaindata.org/partner/MCSA | |
| | **LONI IDA** | https://ida.loni.usc.edu/collaboration/access/appApply.jsp?project=MCSA | |
| | **Synapse** | https://adknowledgeportal.synapse.org/Explore/Studies/DetailsPage?Study=syn22024536 | |
|
|
| ### Application Requirements |
| - Apply via LONI IDA or GAAIN |
| - Agree to Data Use Agreement |
| - Describe research project and collaborators |
| - Must be a qualified academic or industry researcher |
|
|
| ### Approval Timeline |
| - **2-4 weeks** for LONI IDA review |
| - GAAIN requests reviewed individually |
|
|
| ### What You Get |
| - **Clinical data**: Longitudinal data, 1-12 visits at ~15-month intervals |
| - **MRI**: T1w, T2w-FLAIR, diffusion MRI from 1,802 participants |
| - **Future releases**: De-faced amyloid PET images planned |
| - **Molecular data**: Whole-genome genotype + gene expression from 2,655 individuals (842M+ datapoints) |
| - **Interactive tool**: Multiomic Atlas of AD Brain Endophenotypes (free web app) |
| - **Formats**: NIfTI, CSV |
|
|
| ### Restrictions |
| - Standard DUA restrictions apply |
| - Cannot re-identify participants |
| - Must acknowledge Mayo Clinic in publications |
|
|
| --- |
|
|
| ## 10. AIBL |
| **Australian Imaging, Biomarkers and Lifestyle** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 1,112 inception cohort (expanded to 2,359+) | |
| | **Apply URL** | https://ida.loni.usc.edu/collaboration/access/appApply.jsp?project=AIBL | |
| | **Joint AIBL+ADNI** | https://ida.loni.usc.edu/collaboration/access/appApply.jsp?project=AIBL&project=ADNI | |
| | **Website** | https://aibl.org.au/ | |
|
|
| ### Application Requirements |
| - Apply via LONI IDA online form |
| - Read and agree to AIBL project Terms of Use (carefully) |
| - Describe your research and list collaborators |
| - Can apply jointly for AIBL + ADNI data |
|
|
| ### Approval Timeline |
| - **2-4 weeks** (similar to ADNI process via LONI) |
|
|
| ### What You Get |
| - **Neuroimaging**: Amyloid PET (PiB, flutemetamol), FDG PET, structural MRI |
| - **Blood biomarkers**: Amyloid-beta, tau, inflammatory markers |
| - **CSF biomarkers**: Cerebrospinal fluid analytes |
| - **Cognitive assessments**: Battery of neuropsychological tests |
| - **Genetics**: APOE genotyping and broader genetic data |
| - **Lifestyle**: Diet, exercise, sleep, social engagement data |
| - **Longitudinal**: Assessments every 18 months |
| - **Formats**: DICOM/NIfTI (imaging), CSV (clinical) |
|
|
| ### Restrictions |
| - Must comply with AIBL Terms of Use |
| - Acknowledge AIBL in publications |
| - Cannot re-identify participants |
|
|
| --- |
|
|
| ## 11. DIAN |
| **Dominantly Inherited Alzheimer Network** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 533 individuals across 206 families (autosomal dominant AD) | |
| | **Data Request Form** | https://dian.wustl.edu/dian-observational-data-request-form/ | |
| | **Website** | https://dian.wustl.edu | |
| | **Investigator Resources** | https://dian.wustl.edu/for-investigators/ | |
|
|
| ### Application Requirements |
| - Submit DIAN Observational Data Request Form |
| - Research proposal reviewed by DIAN Obs Resource Committee |
| - Must be a qualified researcher |
| - Must accept and comply with DIAN data sharing/publication policies |
| - Strict publication policy: all publications using DIAN data must follow their guidelines |
|
|
| ### Approval Timeline |
| - **Weeks to months** -- committee review required |
| - More restrictive than most datasets |
|
|
| ### What You Get |
| - **MRI**: Structural and functional brain imaging |
| - **PET**: Amyloid and tau PET scans |
| - **Clinical**: Longitudinal cognitive/clinical assessments |
| - **Biofluid**: CSF and blood biomarkers |
| - **Genetics**: Deep genetic phenotyping (PSEN1, PSEN2, APP mutations) |
| - **15+ years** of longitudinal data on autosomal dominant AD |
| - **Unique value**: Only large-scale dataset on dominantly inherited (genetic) AD |
|
|
| ### Restrictions |
| - Strict publication and authorship policies |
| - Violations can result in being barred from future data/biospecimen requests |
| - Potential institutional involvement or legal action for policy deviations |
| - Must comply with DIAN-TU Data and Biospecimen Sharing Policy |
|
|
| --- |
|
|
| ## 12. AD Workbench / AD Discovery Portal |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Datasets** | 100+ novel datasets across multiple modalities | |
| | **Portal URL** | https://discover.alzheimersdata.org | |
| | **Workbench URL** | https://www.alzheimersdata.org/ad-workbench | |
| | **How-To Guide** | https://www.alzheimersdata.org/how-to-use-the-ad-workbench | |
|
|
| ### Application Requirements |
| - Create a free user account on AD Workbench |
| - Account creation is automatic; data permissions require review |
| - Use FAIR Search for data discovery |
| - Request workspace for analysis |
| - Accept Terms of Use |
|
|
| ### Approval Timeline |
| - **Account creation**: Immediate |
| - **Data access permissions**: Variable per dataset (days to weeks) |
|
|
| ### What You Get |
| - **100+ datasets**: Imaging, omics, clinical, multi-modal |
| - **Cloud workspaces**: Secure, private analysis environments |
| - **Tools**: Data visualization, curation, combination tools |
| - **Bio-Hermes-001**: Available through this portal |
| - **Free**: No cost for any tool or data access |
|
|
| ### Restrictions |
| - Must be approved by ADDI |
| - Data use per individual dataset terms |
| - Research purposes only |
|
|
| --- |
|
|
| ## 13. GAAIN |
| **Global Alzheimer's Association Interactive Network** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | ~500,000 from nearly 50 institutions worldwide | |
| | **Portal URL** | https://www.gaaindata.org | |
| | **Website** | https://gaain.org | |
|
|
| ### Application Requirements |
| - Register on gaaindata.org |
| - Free access for researchers worldwide |
| - Use the Interrogator tool for cohort discovery and analysis |
|
|
| ### Approval Timeline |
| - **Immediate to days** for most federated queries |
| - Individual partner datasets may have their own access requirements |
|
|
| ### What You Get |
| - **Federated data platform**: Query across multiple cohorts simultaneously |
| - **Clinical data**: Cognitive scores, demographics, diagnoses |
| - **Imaging data**: Various imaging modalities from partner studies |
| - **Genomics**: Genetic data from contributing cohorts |
| - **Analytics tools**: Built-in analytics and visualization |
| - **Partner datasets**: MCSA, ADNI, and many international cohorts |
|
|
| ### Restrictions |
| - Individual partner datasets retain their own data use policies |
| - Cannot download all raw data -- federated query model |
| - Must acknowledge GAAIN and contributing studies |
|
|
| --- |
|
|
| ## 14. AMP-AD (via Synapse) |
| **Accelerating Medicines Partnership - Alzheimer's Disease** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Portal URL** | https://adknowledgeportal.synapse.org | |
| | **Data Access Instructions** | https://adknowledgeportal.synapse.org/DataAccess/Instructions | |
| | **Account Registration** | https://accounts.synapse.org/ | |
|
|
| ### Application Requirements |
| - Register for free Synapse account |
| - Browse public content freely |
| - Download data requires Synapse login |
| - Some datasets require signed Data Use Certificate (DUC) |
| - DUC datasets (esp. from Mayo Clinic/Broad Institute with deceased donor samples) have additional review |
|
|
| ### Approval Timeline |
| - **Open data**: Immediate after registration |
| - **DUC-protected data**: 1-4 weeks |
|
|
| ### What You Get |
| - **Multi-omics**: Genomics, transcriptomics, epigenomics, proteomics, metabolomics |
| - **Studies include**: ROSMAP, MayoRNAseq, MSBB (Mount Sinai Brain Bank), many more |
| - **Clinical**: Longitudinal clinical and neuropathological data |
| - **Tools**: Analysis pipelines, pre-computed results |
| - **Formats**: FASTQ, BAM, VCF, CSV, H5AD, AnnData |
|
|
| ### Restrictions |
| - Data use conditions per informed consent of each study |
| - Must acknowledge AMP-AD and NIA |
| - Some datasets have embargo periods for new data |
|
|
| --- |
|
|
| ## 15. ADNI - BONUS ESSENTIAL DATASET |
| **Alzheimer's Disease Neuroimaging Initiative** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 2,000+ across ADNI-1, ADNI-2, ADNI-GO, ADNI-3, ADNI-4 | |
| | **Apply URL** | https://ida.loni.usc.edu/collaboration/access/appApply.jsp | |
| | **Website** | https://adni.loni.usc.edu/ | |
|
|
| ### Application Requirements |
| - Review and agree to ADNI Data Use Agreement |
| - Application reviewed by Data Sharing and Publications Committee (DPC) |
| - Must be affiliated with a scientific or educational institution |
| - Describe proposed research or data use |
|
|
| ### Approval Timeline |
| - **~2 weeks** for DPC review |
|
|
| ### What You Get |
| - **MRI**: Structural, functional, DTI |
| - **PET**: Amyloid (AV45/PiB), Tau (AV1451), FDG |
| - **Biomarkers**: CSF (amyloid-beta, tau, p-tau), blood biomarkers |
| - **Genomics**: GWAS, WGS, WES |
| - **Clinical**: Longitudinal cognitive/clinical assessments |
| - **Formats**: DICOM, NIfTI, CSV |
|
|
| ### Restrictions (IMPORTANT - 2025 UPDATE) |
| - **New AI restriction**: ADNI DUA now **explicitly forbids** use of external AI tools on the data |
| - AI tools restricted to within university/company (no external release allowed) |
| - Cannot share data with others |
| - Must acknowledge ADNI in all publications |
| - Must follow ADNI publication policies |
|
|
| --- |
|
|
| ## 16. Allen SEA-AD |
| **Seattle Alzheimer's Disease Brain Cell Atlas** |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 84 donors spanning AD pathology spectrum | |
| | **Open Data** | https://portal.brain-map.org/explore/seattle-alzheimers-disease/seattle-alzheimers-disease-brain-cell-atlas-download | |
| | **AWS Registry** | https://registry.opendata.aws/allen-sea-ad-atlas/ | |
| | **Controlled Data** | Via AD Knowledge Portal (Synapse) | |
|
|
| ### Application Requirements |
| - **Open/processed data**: No application needed -- freely downloadable |
| - **Raw sequencing data**: Apply through Synapse AD Knowledge Portal |
|
|
| ### Approval Timeline |
| - **Open data**: Immediate |
| - **Controlled raw data**: 1-4 weeks via Synapse |
|
|
| ### What You Get |
| - **snRNA-seq**: Single-nucleus RNA sequencing |
| - **snATAC-seq**: Single-nucleus chromatin accessibility |
| - **Multiome**: Combined RNA + ATAC from same nuclei |
| - **Neuropathology**: Quantitative pathology data |
| - **Spatial transcriptomics**: MERFISH data |
| - **Formats**: H5AD, AnnData, CSV, FASTQ (raw) |
| - **Already in your project**: You have SEA-AD metadata in `/data/allen_sea_ad/` |
|
|
| ### Restrictions |
| - Must cite per Allen Institute Citation Policy |
| - Cite both primary publication and specific dataset |
|
|
| --- |
|
|
| ## 17. UK Biobank |
|
|
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 500,000+ (26,000+ with brain MRI) | |
| | **Apply URL** | https://www.ukbiobank.ac.uk/enable-your-research/register | |
| | **Website** | https://www.ukbiobank.ac.uk | |
|
|
| ### Application Requirements |
| - Register as a researcher |
| - Submit research application describing study |
| - Institutional affiliation required |
| - Application fee: ~2,000-5,000 GBP (varies by data type) |
| - Ethics approval may be required |
|
|
| ### Approval Timeline |
| - **Several weeks to months** for full approval |
|
|
| ### What You Get |
| - **Brain MRI**: 26,000+ participants with structural/functional imaging |
| - **4,000+ imaging-derived phenotypes**: Pre-computed brain measures |
| - **Genetics**: Genome-wide genotyping, exome sequencing, WGS |
| - **Clinical**: GP records, hospital admissions, cognitive tests |
| - **Lifestyle**: Diet, exercise, socioeconomic data |
| - **Longitudinal**: Repeat visits over 15+ years |
| - **Formats**: Various (bulk data downloads) |
|
|
| ### Restrictions |
| - Application fee required |
| - Strict data security requirements |
| - Must return results/findings |
| - UK-based ethics oversight |
| - Not AD-specific but massive AD-relevant subset |
|
|
| --- |
|
|
| ## 18. EEG/MEG Datasets for AD |
|
|
| ### 18a. OpenNeuro EEG AD Dataset (ds004504) |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 88 (36 AD, 23 FTD, 29 healthy) | |
| | **URL** | https://openneuro.org/datasets/ds004504 | |
| | **Access** | Free, immediate download | |
| | **Format** | BIDS-compliant EEG (EDF) | |
| | **License** | CC0 (public domain) | |
| | **Content** | Resting state EEG (eyes closed), raw + preprocessed | |
|
|
| ### 18b. Complementary Photic Stimulation EEG Dataset (2025) |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | Same 88 participants as ds004504 | |
| | **Content** | Eyes-open photic stimulation recordings | |
| | **Format** | BIDS-compliant | |
| | **Published** | April 2025 | |
|
|
| ### 18c. PEARL-Neuro Database |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 192 middle-aged (50-63) at-risk participants | |
| | **URL** | https://openneuro.org/datasets/ds004796 | |
| | **Content** | EEG + fMRI + APOE/PICALM genetics + psychometric tests + blood tests | |
| | **Access** | Open access via OpenNeuro | |
| | **Format** | BIDS-compliant | |
| | **Unique value** | Multi-modal (EEG + fMRI) with genetic risk factors | |
|
|
| ### 18d. LEAD Corpus (Research Reference) |
| | Field | Details | |
| |-------|---------| |
| | **Subjects** | 813 across 9 combined datasets (330 public, 483 private) | |
| | **Publication** | https://arxiv.org/html/2502.01678v1 | |
| | **Note** | World's largest EEG-AD corpus; public subset downloadable, private portion restricted | |
|
|
| --- |
|
|
| ## 19. Kaggle AD Datasets |
|
|
| ### 19a. Alzheimer MRI 4-Class Dataset |
| | Field | Details | |
| |-------|---------| |
| | **URL** | https://www.kaggle.com/datasets/sachinkumar413/alzheimer-mri-dataset | |
| | **Classes** | Non-Demented, Very Mild, Mild, Moderate Demented | |
| | **Size** | ~6,400 images | |
| | **Format** | JPG/PNG | |
| | **Access** | Free, immediate download | |
|
|
| ### 19b. Augmented Alzheimer MRI Dataset |
| | Field | Details | |
| |-------|---------| |
| | **URL** | https://www.kaggle.com/datasets/uraninjo/augmented-alzheimer-mri-dataset | |
| | **Content** | Augmented MRI slices for classification | |
| | **Access** | Free | |
|
|
| ### 19c. Alzheimer's Disease Clinical Dataset |
| | Field | Details | |
| |-------|---------| |
| | **URL** | https://www.kaggle.com/datasets/rabieelkharoua/alzheimers-disease-dataset | |
| | **Content** | Clinical features, demographics, lifestyle, cognitive scores | |
| | **Access** | Free | |
|
|
| ### 19d. OASIS-derived Kaggle Dataset |
| | Field | Details | |
| |-------|---------| |
| | **URL** | https://www.kaggle.com/datasets/jboysen/mri-and-alzheimers | |
| | **Content** | OASIS cross-sectional and longitudinal MRI data | |
| | **Access** | Free | |
|
|
| **Note**: Kaggle datasets are great for prototyping and model development but are NOT suitable for clinical validation or publications requiring primary data. |
|
|
| --- |
|
|
| ## 20. Additional & Recent Datasets (2025-2026) |
|
|
| ### 20a. OASIS-4 (NEW) |
| - Latest release in the OASIS series |
| - MR, clinical, cognitive, and biomarker data for individuals with memory complaints |
| - Access: Same as OASIS-3 via sites.wustl.edu/oasisbrains |
|
|
| ### 20b. CLARiTI (via NACC - NEW) |
| - URL: https://clariti.naccdata.org/for-researchers/access-data |
| - New collaborative data sharing initiative through NACC |
| - Focused on clinical trials integration |
|
|
| ### 20c. Allen Brain Cell Atlas (Broader) |
| - URL: https://portal.brain-map.org/atlases-and-data/bkp/abc-atlas |
| - Broader brain cell atlas including AD-relevant cell types |
| - Python API: https://alleninstitute.github.io/abc_atlas_access/intro.html |
|
|
| ### 20d. ADNI-4 (Latest Phase) |
| - URL: https://adni.loni.usc.edu/ |
| - Newest ADNI phase with updated protocols |
| - Blood-based biomarker focus |
| - NOTE: New AI restrictions in DUA |
|
|
| ### 20e. Bio-Hermes-002 (Upcoming) |
| - GAP Foundation + Alamar Biosciences collaboration announced January 2026 |
| - Next-generation biomarker study building on Bio-Hermes-001 |
| - Watch: https://globalalzplatform.org/ |
|
|
| --- |
|
|
| ## SYNTHETIC / AUGMENTED AD DATA APPROACHES |
|
|
| For when real data is insufficient or for pre-training: |
|
|
| | Approach | Description | Reference | |
| |----------|-------------|-----------| |
| | **CycleGAN MRI augmentation** | Generate synthetic MRI scans; achieved 95% F1 (vs 89% without) | Frontiers in Medicine 2025 | |
| | **SMOTE for tabular data** | Oversample minority AD classes in clinical datasets | Multiple papers 2025 | |
| | **Diffusion model MRI generation** | Generate 2D slice projections of 3D MRI scans | ScienceDirect 2026 | |
| | **3D CNN with data augmentation** | Standard augmentation (flip, rotate, scale) on 3D MRI | arxiv 2505.04097 | |
| | **AdaBoost synthetic generation** | Boost training data diversity for clinical features | PMC 2025 | |
|
|
| --- |
|
|
| ## PRIORITY APPLICATION ORDER (Recommended) |
|
|
| Based on ease of access, data richness, and relevance to multi-modal AD research: |
|
|
| ### Tier 1 -- Apply Immediately (Fast Approval, High Value) |
| 1. **NACC** -- 48hr turnaround, 54K subjects, multi-modal |
| 2. **AD Workbench / Bio-Hermes-001** -- Free account, 80K+ results, diverse cohort |
| 3. **ANMerge** -- Open on Synapse, 1,702 subjects, multi-modal |
| 4. **Allen SEA-AD** -- Open download, single-cell multi-omics (you already have metadata) |
| 5. **OpenNeuro EEG datasets** -- Immediate free download, BIDS format |
| 6. **Kaggle datasets** -- Immediate, good for prototyping |
|
|
| ### Tier 2 -- Apply This Week (1-4 Week Approval) |
| 7. **OASIS-3** -- NITRC registration, rich imaging data |
| 8. **AMP-AD / Synapse** -- Free account, massive multi-omics |
| 9. **ROSMAP** -- Via Synapse + RADC, deep longitudinal omics |
| 10. **AIBL** -- Via LONI IDA, good imaging + lifestyle data |
| 11. **ADNI** -- ~2 week review, gold standard (watch AI restrictions) |
| 12. **MCSA** -- Via GAAIN/LONI, large clinical + imaging release |
|
|
| ### Tier 3 -- Apply When Ready (Longer Approval, More Requirements) |
| 13. **HCP-Aging** -- NDA DUC required, 22+ TB connectome data |
| 14. **ADSP/NIAGADS** -- IRB required, massive genomics |
| 15. **DIAN** -- Committee review, unique genetic AD data |
| 16. **UK Biobank** -- Fee required, months for approval, massive scale |
| 17. **PREVENT-AD registered** -- For pre-symptomatic biomarkers |
| 18. **GAAIN** -- Federated queries across 500K subjects |
|
|
| --- |
|
|
| ## QUICK REFERENCE: ALL APPLICATION URLS |
|
|
| | Dataset | Application URL | |
| |---------|----------------| |
| | NACC | https://naccdata.org/requesting-data/nacc-data/ | |
| | OASIS-3 | https://www.nitrc.org/projects/oasis3/ | |
| | Bio-Hermes-001 | https://www.alzheimersdata.org/ad-workbench | |
| | ADSP/NIAGADS | https://dss.niagads.org/ | |
| | HCP-Aging | https://nda.nih.gov/ | |
| | PREVENT-AD (open) | https://openpreventad.loris.ca | |
| | PREVENT-AD (registered) | https://registeredpreventad.loris.ca | |
| | ANMerge | https://doi.org/10.7303/syn22252881 | |
| | ROSMAP | https://adknowledgeportal.synapse.org | |
| | MCSA | https://ida.loni.usc.edu/collaboration/access/appApply.jsp?project=MCSA | |
| | AIBL | https://ida.loni.usc.edu/collaboration/access/appApply.jsp?project=AIBL | |
| | DIAN | https://dian.wustl.edu/dian-observational-data-request-form/ | |
| | AD Workbench | https://discover.alzheimersdata.org | |
| | GAAIN | https://www.gaaindata.org | |
| | AMP-AD | https://adknowledgeportal.synapse.org | |
| | ADNI | https://ida.loni.usc.edu/collaboration/access/appApply.jsp | |
| | Allen SEA-AD | https://portal.brain-map.org/explore/seattle-alzheimers-disease | |
| | UK Biobank | https://www.ukbiobank.ac.uk/enable-your-research/register | |
| | OpenNeuro EEG | https://openneuro.org/datasets/ds004504 | |
| | PEARL-Neuro | https://openneuro.org/datasets/ds004796 | |
|
|
| --- |
|
|
| ## ESTIMATED TOTAL DATA AVAILABLE |
|
|
| | Category | Approximate Scale | |
| |----------|-------------------| |
| | **Total unique subjects across all datasets** | ~700,000+ | |
| | **Genomics subjects** | ~150,000+ (ADSP, ROSMAP, UK Biobank) | |
| | **Neuroimaging subjects** | ~50,000+ (ADNI, OASIS, HCP, NACC, AIBL, UK Biobank) | |
| | **Clinical/cognitive subjects** | ~600,000+ (NACC, UK Biobank, GAAIN) | |
| | **Single-cell omics** | ~84 donors, millions of cells (SEA-AD) | |
| | **EEG subjects** | ~1,000+ (OpenNeuro, PEARL-Neuro, LEAD corpus) | |
| | **Blood biomarkers** | ~80,000+ results (Bio-Hermes-001) | |
| | **Pre-symptomatic/at-risk** | ~2,000+ (PREVENT-AD, DIAN, HCP-Aging) | |
|
|