Spaces:

tech-doc
/

SkinProAI

Sleeping

App Files Files Community

SkinProAI / models

Commit History

Speed up CPU inference: halve token limits, pre-download models, fix OMP threads

4af4003

cgoodmaker Claude Opus 4.6 commited on Mar 2

Use bfloat16 on CPU to halve memory (8GB vs 16GB float32)

0989643

cgoodmaker Claude Opus 4.6 commited on Feb 26

Fix MCP subprocess deadlock: use stderr=None instead of PIPE

da343a7

cgoodmaker Claude Opus 4.6 commited on Feb 23

Add timeout and stderr logging to MCP subprocess to debug tool hangs

c376e14

cgoodmaker Claude Opus 4.6 commited on Feb 23

Remove unused files: old Gradio frontend, dead model code, test artifacts

672ed11

cgoodmaker Claude Opus 4.6 commited on Feb 23

Force MCP tool models to CPU to avoid GPU VRAM contention with MedGemma

1a97904

cgoodmaker Claude Opus 4.6 commited on Feb 23

Add RAG Phase 4 management guidance, rebuild guidelines index (286 chunks), post-analysis hint UI

5241b71

cgoodmaker Claude Opus 4.6 commited on Feb 23

Use dtype instead of deprecated torch_dtype in model_kwargs

82f82ac

cgoodmaker Claude Opus 4.6 commited on Feb 23

Redesign chat UI and fix MedGemma generation config issues

58a4476

cgoodmaker Claude Opus 4.6 commited on Feb 23

Pass HF_TOKEN explicitly to pipeline() for gated model auth

b08f876

cgoodmaker commited on Feb 21

Use HF_TOKEN env var to authenticate for gated MedGemma model

bb7e939

cgoodmaker commited on Feb 21

Add HF Spaces Dockerfile, Git LFS for model weights

72b1012

cgoodmaker commited on Feb 21

Initial commit — SkinProAI dermoscopic analysis platform

86f402d

cgoodmaker commited on Feb 21

Commit History

Speed up CPU inference: halve token limits, pre-download models, fix OMP threads 4af4003

Use bfloat16 on CPU to halve memory (8GB vs 16GB float32) 0989643

Fix MCP subprocess deadlock: use stderr=None instead of PIPE da343a7

Add timeout and stderr logging to MCP subprocess to debug tool hangs c376e14

Remove unused files: old Gradio frontend, dead model code, test artifacts 672ed11

Force MCP tool models to CPU to avoid GPU VRAM contention with MedGemma 1a97904

Add RAG Phase 4 management guidance, rebuild guidelines index (286 chunks), post-analysis hint UI 5241b71

Use dtype instead of deprecated torch_dtype in model_kwargs 82f82ac

Redesign chat UI and fix MedGemma generation config issues 58a4476

Pass HF_TOKEN explicitly to pipeline() for gated model auth b08f876

Use HF_TOKEN env var to authenticate for gated MedGemma model bb7e939

Add HF Spaces Dockerfile, Git LFS for model weights 72b1012

Initial commit — SkinProAI dermoscopic analysis platform 86f402d

Speed up CPU inference: halve token limits, pre-download models, fix OMP threads

4af4003

Use bfloat16 on CPU to halve memory (8GB vs 16GB float32)

0989643

Fix MCP subprocess deadlock: use stderr=None instead of PIPE

da343a7

Add timeout and stderr logging to MCP subprocess to debug tool hangs

c376e14

Remove unused files: old Gradio frontend, dead model code, test artifacts

672ed11

Force MCP tool models to CPU to avoid GPU VRAM contention with MedGemma

1a97904

Add RAG Phase 4 management guidance, rebuild guidelines index (286 chunks), post-analysis hint UI

5241b71

Use dtype instead of deprecated torch_dtype in model_kwargs

82f82ac

Redesign chat UI and fix MedGemma generation config issues

58a4476

Pass HF_TOKEN explicitly to pipeline() for gated model auth

b08f876

Use HF_TOKEN env var to authenticate for gated MedGemma model

bb7e939

Add HF Spaces Dockerfile, Git LFS for model weights

72b1012

Initial commit — SkinProAI dermoscopic analysis platform

86f402d