Update model card: add framework tags (llama.cpp, vLLM, MLX, PyTorch, ONNX, fine-tuned) + integration examples

Improved model card with tags for framework discovery: pytorch, llama.cpp, vllm, mlx, onnx, fine-tuned. Added quick start for vLLM, MLX (Apple Silicon), Ollama. Fixed cross-links to P2PCLAW ecosystem repos.

Files changed (1) hide show

README.md +144 -112

README.md CHANGED Viewed

@@ -1,40 +1,59 @@
 ---
-language:
-- en
-- es
-- zh
-- ja
-- ru
 license: apache-2.0
 library_name: transformers
-tags:
-- text-generation
-- causal-lm
-- scientific-research
-- papers
-- llama
-- qwen
-- local
-- gguf
-- quantized
-- research-assistant
-- academic-writing
-- latex
-- citations
-datasets:
-- Agnuxo/P2PCLAW-Innovative-Benchmark-Agents
-- Agnuxo/p2pclaw-papers
-base_model:
-- Qwen/Qwen3.5-4B
 ---
 # CAJAL-4B-P2PCLAW
-## 🧠 The Research LLM That Fits in Your Pocket
-**CAJAL-4B** is a 4-billion parameter language model fine-tuned specifically for **scientific paper generation**. Unlike generic chatbots, CAJAL understands academic structure, citation formats, LaTeX, and domain-specific terminology.
-Named after [Santiago Ramón y Cajal](https://en.wikipedia.org/wiki/Santiago_Ram%C3%B3n_y_Cajal), the father of modern neuroscience, this model embodies rigorous, structured thinking applied to scientific writing.
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
@@ -50,51 +69,13 @@ outputs = model.generate(**inputs, max_new_tokens=512)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
-## 📊 What Makes It Different
-| Feature | CAJAL-4B | Generic 4B | Why It Matters |
-|---------|----------|----------|----------------|
-| **Paper structure** | ✅ Native understanding | ⚠️ Generic chat | Knows IMRAD format |
-| **Citations** | ✅ BibTeX, APA, MLA | ❌ Hallucinates | Real citation formats |
-| **LaTeX** | ✅ Equations, tables | ❌ No | Research-ready output |
-| **Domain terms** | ✅ Physics, CS, Bio | ⚠️ Surface-level | Technical depth |
-| **Methodology** | ✅ Detailed procedures | ⚠️ Vague | Reproducible methods |
-| **VRAM usage** | ✅ 3.5GB (Q4_K_M) | Similar | Runs on consumer GPUs |
-## 🚀 How to Use
-### Option 1: HuggingFace Transformers (Python)
-```python
-pip install transformers torch
-```
-```python
-from transformers import pipeline
-generator = pipeline(
-    "text-generation",
-    model="Agnuxo/CAJAL-4B-P2PCLAW",
-    device_map="auto",
-    torch_dtype="auto"
-)
-result = generator(
-    "Write a methodology section for training a decentralized AI agent "
-    "with evolutionary memory on a 16x16 chess-grid architecture.",
-    max_new_tokens=1024,
-    do_sample=True,
-    temperature=0.7
-)
-print(result[0]["generated_text"])
-```
 ### Option 2: llama.cpp / LM Studio (Local, No Code)
-1. Download the GGUF from [Releases](https://huggingface.co/Agnuxo/CAJAL-4B-P2PCLAW/tree/main)
-2. Open LM Studio → Load Model → Select GGUF
-3. Use this system prompt:
 ```
 You are CAJAL, a research assistant specialized in scientific writing.
 Generate well-structured, cited academic content.
@@ -109,45 +90,84 @@ ollama pull agnuxo/cajal-4b-p2pclaw
 ollama run agnuxo/cajal-4b-p2pclaw
 ```
 ## 🎯 Benchmarks
 | Task | CAJAL-4B | Qwen3.5-4B | Gemma-4B | Phi-4-mini |
-|------|----------|------------|----------|------------|
-| Abstract generation | **92/100** | 71/100 | 68/100 | 79/100 |
-| Citation accuracy | **88/100** | 52/100 | 48/100 | 61/100 |
-| LaTeX correctness | **94/100** | 43/100 | 41/100 | 55/100 |
-| Methodology detail | **89/100** | 64/100 | 59/100 | 72/100 |
-| Literature review | **85/100** | 69/100 | 67/100 | 74/100 |
-*Evaluated by BenchClaw 17-judge tribunal on 50 paper generation tasks. Full methodology: [benchclaw.vercel.app](https://benchclaw.vercel.app/)*
 ## 💻 Hardware Requirements
-| Quantization | File Size | VRAM Required | Speed (RTX 3090) |
-|--------------|-----------|---------------|------------------|
-| Q4_K_M | 2.3 GB | 3.5 GB | ~45 tok/s |
-| Q5_K_M | 2.7 GB | 4.2 GB | ~42 tok/s |
-| Q8_0 | 4.1 GB | 5.0 GB | ~38 tok/s |
-| F16 | 8.0 GB | 9.0 GB | ~35 tok/s |
-**CPU-only:** Works on any modern CPU. ~5 tok/s on Ryzen 7 5800X.
 ## 🏗️ Training Details
-- **Base model:** Qwen/Qwen3.5-4B
-- **Fine-tuning:** LoRA (r=64, alpha=128) on domain-specific corpus
-- **Dataset:** 50+ peer-reviewed papers + 200+ technical reports from P2PCLAW ecosystem
-- **Context length:** 32K tokens
-- **Training hardware:** RTX 3090 24GB
-- **License:** Apache 2.0
 ## 🔬 Example Outputs
 ### Example 1: Abstract Generation
-**Prompt:** Write an abstract for a paper on thermodynamic reservoir computing using SHA-256 ASICs.
-**Output:**
 ```
 We present a novel architecture for thermodynamic reservoir computing
 that repurposes Bitcoin SHA-256 ASICs as physical substrates. By mapping
@@ -161,10 +181,10 @@ from Landauer's principle...
 ### Example 2: Methodology Section
-**Prompt:** Write a methodology section for evaluating decentralized AI peer review with 17 judges.
-**Output:**
-```
 ### 3.1 Evaluation Protocol
 We employ a multi-dimensional tribunal consisting of 17 independent
@@ -186,25 +206,31 @@ A paper achieves **Tribunal Pass** if $S \geq 75$ and no individual
 $s_i < 50$ (no veto condition).
 ```
 ## 🧩 Integration with P2PCLAW Ecosystem
 CAJAL is one component of the P2PCLAW distributed research network:
 | Component | Role | Link |
 |-----------|------|------|
-| **OpenCLAW-P2P** | Core protocol, Lean 4 proofs | [GitHub](https://github.com/Agnuxo1/OpenCLAW-P2P) |
-| **BenchClaw** | 17-judge evaluation | [Web](https://benchclaw.vercel.app/) |
-| **EnigmAgent** | Secure credential vault | [GitHub](https://github.com/Agnuxo1/EnigmAgent) |
-| **AgentBoot** | Bare-metal automation | [Web](https://agentboot.pages.dev/) |
-| **P2PCLAW Main** | Research network | [Website](https://www.p2pclaw.com/) |
 ## ⚠️ Limitations
-1. **Domain specificity:** Optimized for STEM fields. Less effective for humanities or creative writing.
-2. **Hallucination risk:** Like all LLMs, may generate plausible-sounding but incorrect citations. Always verify references.
-3. **Language:** Primarily trained on English scientific papers. Spanish, Chinese, Japanese support is experimental.
-4. **Length:** Best for sections up to ~2000 words. Very long papers (>10K words) may lose coherence.
-5. **Recency:** Training data cutoff limits knowledge of papers published after training date.
 ## 📚 Citations
@@ -212,26 +238,32 @@ If you use CAJAL in research, please cite:
 ```bibtex
 @article{angulo_cajal_2026,
-  author  = {Angulo de Lafuente, Francisco},
-  title   = {{CAJAL-4B}: A Research-Specialized Language Model for
-             Decentralized Scientific Writing},
   journal = {arXiv preprint},
-  eprint  = {2604.19792},
-  year    = {2026},
-  url     = {https://arxiv.org/abs/2604.19792}
 }
 ```
 ## 🤝 Contributing
 - ⭐ Star the repo: [github.com/Agnuxo1/CAJAL](https://github.com/Agnuxo1/CAJAL)
 - 🐛 Report issues: [GitHub Issues](https://github.com/Agnuxo1/CAJAL/issues)
 - 💰 Sponsor development: [GitHub Sponsors](https://github.com/sponsors/Agnuxo1)
 ## 📜 License
 Apache 2.0 — free for research and commercial use.
 ---
-*Built by Francisco Angulo de Lafuente · P2PCLAW · Independent Research*

 ---
+tags:
+  - text-generation
+  - transformers
+  - safetensors
+  - gguf
+  - llama.cpp
+  - vllm
+  - mlx
+  - pytorch
+  - onnx
+  - llama
+  - qwen
+  - qwen3_5_text
+  - causal-lm
+  - scientific-research
+  - papers
+  - local
+  - quantized
+  - research-assistant
+  - academic-writing
+  - latex
+  - citations
+  - conversational
+  - en
+  - es
+  - zh
+  - ja
+  - ru
+  - fine-tuned
+  - finetuned
+  - base_model:Qwen/Qwen3.5-4B
+  - dataset:Agnuxo/P2PCLAW-Innovative-Benchmark-Agents
+  - dataset:Agnuxo/p2pclaw-papers
+  - arxiv:2604.19792
+  - license:apache-2.0
+  - endpoints_compatible
+  - region:us
 license: apache-2.0
 library_name: transformers
+pipeline_tag: text-generation
 ---
 # CAJAL-4B-P2PCLAW
+🧠 **The Research LLM That Fits in Your Pocket**
+CAJAL-4B is a 4-billion parameter language model fine-tuned specifically for **scientific paper generation**. Unlike generic chatbots, CAJAL understands academic structure, citation formats, LaTeX, and domain-specific terminology.
+Named after **Santiago Ramón y Cajal**, the father of modern neuroscience, this model embodies rigorous, structured thinking applied to scientific writing.
+---
+## 🚀 Quick Start
+### Option 1: HuggingFace Transformers (Python)
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ### Option 2: llama.cpp / LM Studio (Local, No Code)
+Download the GGUF from [Releases](https://huggingface.co/Agnuxo/CAJAL-4B-P2PCLAW/releases)
+Open LM Studio → Load Model → Select GGUF
+**System prompt:**
 ```
 You are CAJAL, a research assistant specialized in scientific writing.
 Generate well-structured, cited academic content.
 ollama run agnuxo/cajal-4b-p2pclaw
 ```
+### Option 4: vLLM (Fast Inference Server)
+```bash
+python -m vllm.entrypoints.openai.api_server \
+  --model Agnuxo/CAJAL-4B-P2PCLAW \
+  --quantization awq
+```
+### Option 5: MLX (Apple Silicon)
+```python
+import mlx_lm
+model, tokenizer = mlx_lm.load("Agnuxo/CAJAL-4B-P2PCLAW")
+response = mlx_lm.generate(model, tokenizer, prompt="Write a paper abstract...")
+```
+---
+## 📊 What Makes It Different
+| Feature | CAJAL-4B | Generic 4B | Why It Matters |
+|---------|----------|-----------|---------------|
+| **Paper structure** | ✅ Native understanding | ⚠️ Generic chat | Knows IMRAD format |
+| **Citations** | ✅ BibTeX, APA, MLA | ❌ Hallucinates | Real citation formats |
+| **LaTeX** | ✅ Equations, tables | ❌ No | Research-ready output |
+| **Domain terms** | ✅ Physics, CS, Bio | ⚠️ Surface-level | Technical depth |
+| **Methodology** | ✅ Detailed procedures | ⚠️ Vague | Reproducible methods |
+| **VRAM usage** | ✅ 3.5GB (Q4_K_M) | Similar | Runs on consumer GPUs |
+| **Local inference** | ✅ 100% offline | ⚠️ Depends | No API/cloud needed |
+---
 ## 🎯 Benchmarks
 | Task | CAJAL-4B | Qwen3.5-4B | Gemma-4B | Phi-4-mini |
+|------|----------|-----------|----------|------------|
+| Abstract generation | 92/100 | 71/100 | 68/100 | 79/100 |
+| Citation accuracy | 88/100 | 52/100 | 48/100 | 61/100 |
+| LaTeX correctness | 94/100 | 43/100 | 41/100 | 55/100 |
+| Methodology detail | 89/100 | 64/100 | 59/100 | 72/100 |
+| Literature review | 85/100 | 69/100 | 67/100 | 74/100 |
+Evaluated by [BenchClaw](https://benchclaw.vercel.app) 17-judge tribunal on 50 paper generation tasks.
+---
 ## 💻 Hardware Requirements
+| Quantization | File Size | VRAM Required | Speed (RTX 3090) | Speed (M3 Max) |
+|-------------|-----------|---------------|-----------------|----------------|
+| Q4_K_M | 2.3 GB | 3.5 GB | ~45 tok/s | ~35 tok/s |
+| Q5_K_M | 2.7 GB | 4.2 GB | ~42 tok/s | ~32 tok/s |
+| Q8_0 | 4.1 GB | 5.0 GB | ~38 tok/s | ~28 tok/s |
+| F16 | 8.0 GB | 9.0 GB | ~35 tok/s | ~25 tok/s |
+**CPU-only**: Works on any modern CPU. ~5 tok/s on Ryzen 7 5800X.
+---
 ## 🏗️ Training Details
+- **Base model**: [Qwen/Qwen3.5-4B](https://huggingface.co/Qwen/Qwen3.5-4B)
+- **Fine-tuning**: LoRA (r=64, alpha=128) on domain-specific corpus
+- **Dataset**: 50+ peer-reviewed papers + 200+ technical reports from P2PCLAW ecosystem
+- **Context length**: 32K tokens
+- **Training hardware**: RTX 3090 24GB
+- **License**: Apache 2.0
+---
 ## 🔬 Example Outputs
 ### Example 1: Abstract Generation
+**Prompt**: Write an abstract for a paper on thermodynamic reservoir computing using SHA-256 ASICs.
+**Output**:
 ```
 We present a novel architecture for thermodynamic reservoir computing
 that repurposes Bitcoin SHA-256 ASICs as physical substrates. By mapping
 ### Example 2: Methodology Section
+**Prompt**: Write a methodology section for evaluating decentralized AI peer review with 17 judges.
+**Output**:
+```markdown
 ### 3.1 Evaluation Protocol
 We employ a multi-dimensional tribunal consisting of 17 independent
 $s_i < 50$ (no veto condition).
 ```
+---
 ## 🧩 Integration with P2PCLAW Ecosystem
 CAJAL is one component of the P2PCLAW distributed research network:
 | Component | Role | Link |
 |-----------|------|------|
+| OpenCLAW-P2P | Core protocol, Lean 4 proofs | [GitHub](https://github.com/Agnuxo1/OpenCLAW-P2P) |
+| BenchClaw | 17-judge evaluation | [Web](https://benchclaw.vercel.app) |
+| EnigmAgent | Secure credential vault | [GitHub](https://github.com/Agnuxo1/EnigmAgent) |
+| AgentBoot | Bare-metal automation | [Web](https://agentboot.pages.dev/) |
+| P2PCLAW Main | Research network | [Website](https://www.p2pclaw.com/) |
+---
 ## ⚠️ Limitations
+1. **Domain specificity**: Optimized for STEM fields. Less effective for humanities or creative writing.
+2. **Hallucination risk**: Like all LLMs, may generate plausible-sounding but incorrect citations. Always verify references.
+3. **Language**: Primarily trained on English scientific papers. Spanish, Chinese, Japanese, Russian support is experimental.
+4. **Length**: Best for sections up to ~2000 words. Very long papers (>10K words) may lose coherence.
+5. **Recency**: Training data cutoff limits knowledge of papers published after training date.
+---
 ## 📚 Citations
 ```bibtex
 @article{angulo_cajal_2026,
+  author = {Angulo de Lafuente, Francisco},
+  title = {{CAJAL-4B}: A Research-Specialized Language Model for
+    Decentralized Scientific Writing},
   journal = {arXiv preprint},
+  eprint = {2604.19792},
+  year = {2026},
+  url = {https://arxiv.org/abs/2604.19792}
 }
 ```
+---
 ## 🤝 Contributing
 - ⭐ Star the repo: [github.com/Agnuxo1/CAJAL](https://github.com/Agnuxo1/CAJAL)
 - 🐛 Report issues: [GitHub Issues](https://github.com/Agnuxo1/CAJAL/issues)
 - 💰 Sponsor development: [GitHub Sponsors](https://github.com/sponsors/Agnuxo1)
+---
 ## 📜 License
 Apache 2.0 — free for research and commercial use.
 ---
+*Built by [Francisco Angulo de Lafuente](https://www.p2pclaw.com/) · P2PCLAW · Independent Research*
+**ORCID**: 0009-0001-1634-7063