YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
𧬠Pan-Genomic Saudi Edge-VCF
A universal, 100% offline, mobile-native clinical variant interpreter for the Saudi population.
Built by BABA AI Solutions using Teacher-Student Distillation with the PAVS 2026 Framework.
Architecture
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β NTv3_650M_post (Teacher, 650M params) β
β InstaDeepAI Nucleotide Transformer v3 β
β β Universal Genomic Embeddings (1536-d) β
ββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββββββ
β TimeKD Distillation
β (KL-Divergence + SmoothL1 Feature Alignment)
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Edge-RGF-Pan (Student, 76,141 params < 100k budget) β
β βββββββββββββββ βββββββββββ ββββββββββββββββββββββββ β
β β Input Proj βββ 1D-CNN βββ Bi-GRU (bidirect.) β β
β β 1536β32 β β 24ch β β hidden=28, out=56 β β
β βββββββββββββββ βββββββββββ ββββββββββββ¬ββββββββββββ β
β β β
β βββββββββββββββββββββββββββββββββββββββββββ β
β β FiLM Conditioning Layer ββ β
β β context = [HPO_Emb(8), Saudi_AF(1), βββ Classifier β
β β Consanguinity_F(1)] ββ β 5 classes β
β β output = (1 + Ξ³) Β· h + Ξ² ββ β
β βββββββββββββββββββββββββββββββββββββββββββ β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
Parameter Budget
| Component | Parameters | Share |
|---|---|---|
| Input Projection (1536β32) | 49,184 | 64.6% |
| 1D-CNN (24 channels) | 4,664 | 6.1% |
| Bi-GRU (hidden=28) | 10,416 | 13.7% |
| HPO Embedder | 512 | 0.7% |
| FiLM Generator | 2,080 | 2.7% |
| Classifier | 1,989 | 2.6% |
| Feature Projection | 7,296 | 9.6% |
| TOTAL | 76,141 | < 100k β |
Package Structure
edge_vcf_pan/
βββ __init__.py # Package metadata
βββ config.py # Saudi allele frequencies, HPO ontology, PAVS 2026 KG
βββ teacher.py # NTv3_650M_post wrapper (+ simulation mode)
βββ student.py # Edge-RGF-Pan: 1D-CNN + Bi-GRU + FiLM (<100k params)
βββ distillation.py # TimeKD loss + training loop
βββ agent.py # smolagents v1.24.0 CodeAgent + 3 tools
βββ export.py # ONNX + INT8 TFLite export
βββ main.py # End-to-end demo
exported_models/
βββ edge_rgf_pan.onnx # ONNX model (77 KB)
βββ edge_rgf_pan_int8.tflite # INT8 quantized TFLite (108 KB)
Clinical Agent Tools
| Tool | Type | Function |
|---|---|---|
query_pan_saudi_db |
Tool subclass |
Lazy-loads Edge-RGF-Pan, queries PAVS 2026 knowledge graph |
hpo_reasoner |
@tool |
Maps raw symptoms β HPO IDs β ranked candidate genes |
clinical_reporter |
@tool |
ACMG Class 1-5 pathogenicity classification with Saudi context |
Verified Clinical Cases
Case A: Alport Syndrome (Renal)
- Symptoms: "blood in urine, hearing loss"
- Consanguinity: first_cousin (F=0.0625)
- HPO Mapping: HP:0000790 (Hematuria), HP:0012587 (Macroscopic hematuria), HP:0000407 (Hearing impairment)
- Top Gene: COL4A3 β c.765delC (50x Saudi enrichment)
- Classification: Differential renal panel triggered
Case B: Retinal Dystrophy (Vision)
- Symptoms: "retinal spots, night blindness"
- Consanguinity: second_cousin (F=0.015625)
- HPO Mapping: HP:0000580 (Pigmentary retinopathy), HP:0000662 (Nyctalopia)
- Top Gene: RPE65 β c.271C>T (15x Saudi enrichment)
- Classification: Differential retinal panel triggered
β Same model, different phenotype contexts β different clinical classifications
Quick Start
pip install torch smolagents onnx onnxscript
python -m edge_vcf_pan.main
License & Disclaimer
Research/clinical decision support only. All findings must be validated by a certified clinical geneticist. Not a medical device.
Built by BABA AI Solutions β Edge-VCF-Pan v1.0
- Downloads last month
- 5
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support