---
language:
- en
license: mit
tags:
- codette
- multi-perspective-reasoning
- ethical-ai
- lora
- qlora
- llama-3.1
- recursive-cognition
- rc-xi
library_name: peft
base_model: meta-llama/Llama-3.1-8B-Instruct
model-index:
- name: Codette RC+xi Reasoning Adapters
  results:
  - task:
      type: text-generation
      name: Multi-Perspective Reasoning
    metrics:
    - name: Phase Coherence (Gamma)
      type: custom
      value: 0.9835
    - name: AEGIS Ethical Alignment (Eta)
      type: custom
      value: 0.961
    - name: Cocoon Coherence
      type: custom
      value: 0.994
    - name: Memory Phase Stability
      type: custom
      value: 0.969
---

# Codette Adapter Training Lab

Codette is an experimental AI research system for **recursive reasoning, multi-perspective cognition, and ethical AI alignment**, created by **Jonathan Harrison**.

This repository contains the complete training pipeline, inference server, and 8 trained LoRA adapters for the Codette cognitive architecture running on Llama 3.1 8B.

## 🚀 Latest Status (Session 2026-03-19) — LIVE & TESTED

### ✅ Agent LLM Integration Complete
All 6 reasoning agents now use **real LLM inference** via trained LoRA adapters:
- **Newton** (physics reasoning) → newton adapter
- **Quantum** (probabilistic thinking) → quantum adapter
- **DaVinci** (creative invention) → davinci adapter
- **Philosophy** (conceptual reasoning) → philosophy adapter
- **Empathy** (emotional intelligence) → empathy adapter
- **Ethics** (moral reasoning) → philosophy adapter

**Result**: Agents generate domain-specific, LLM-backed reasoning instead of templates.

### ✅ GPU Acceleration Active
- Model load: ~8-10 seconds (GPU vs 40s CPU)
- Inference: 2-4 sec/query (GPU vs 15-20s CPU)
- Full eval: ~2-3 minutes (GPU vs 7-10 minutes CPU)
- **35/35 layers offloaded** to GPU via llama.cpp

### ✅ Phase 6 Stability Verified
All control mechanism patches tested and working:
- **Patch 2**: Conflict capping (23 → 10 conflicts/round)
- **Patch 4**: Gamma authority (threshold 0.3, prevents collapse)
- **Patch 5**: Domain-aware gating (2-3 agents/domain, not all 6)

### ✅ First Eval Results
```
Q1: "What is the speed of light in vacuum?"
  Agent modes: ✓ LLM ✓ LLM ✓ LLM ✓ LLM ✓ LLM ✓ LLM (all agents using GPU)
  Domain detection: physics → 2 agents active (Newton, Quantum)
  Conflicts: 23 detected → 10 capped (Patch 2)
  Gamma: 0.38 → intervention triggered (Patch 4)
  GPU: ✓ ENABLED (35 layers offloaded)
```

## Model Weights

All 8 adapters are included in two formats:

| Format | Directory | Size | Use Case |
|--------|-----------|------|----------|
| **GGUF (f16)** | `adapters/*.gguf` | ~924 MB | llama.cpp inference with hot-swap |
| **PEFT SafeTensors** | `adapters_peft/*/` | ~79 MB | HuggingFace / transformers fine-tuning |

**Base model required**: `meta-llama/Llama-3.1-8B-Instruct` (or any Llama-3.1-8B variant with hidden_size=4096)

## Key Metrics

| Metric | Value | Context |
|--------|-------|---------|
| Phase Coherence (Gamma) | 0.9835 | 11-agent convergence |
| AEGIS Ethical Alignment (Eta) | 0.961 | 6-framework ethical governance |
| Cocoon Coherence | 0.994 | Memory state stability |
| Memory Phase Stability | 0.969 | Cross-session persistence |
| Tension Decay | 91.2% | 200-agent embodied simulation |

## Cognitive Subsystems (10 active)

| Subsystem | Module | Purpose |
|-----------|--------|---------|
| Reasoning Forge | `reasoning_forge/forge_engine.py` | 6-agent multi-perspective debate + synthesis |
| Epistemic Metrics | `reasoning_forge/epistemic_metrics.py` | RC+xi tension/coherence tracking |
| Quantum Spiderweb | `reasoning_forge/quantum_spiderweb.py` | 5D belief propagation + attractor detection |
| Cocoon Sync | `reasoning_forge/cocoon_sync.py` | Fernet-encrypted federated state sync |
| AEGIS | `reasoning_forge/aegis.py` | 6-framework ethical governance (utilitarian, deontological, virtue, care, ubuntu, indigenous) |
| Nexus Signal Engine | `reasoning_forge/nexus.py` | Pre-corruption detection via entropy + FFT + intent vectors |
| Living Memory | `reasoning_forge/living_memory.py` | Emotionally-tagged memory cocoons with SHA-256 anchors |
| Guardian | `reasoning_forge/guardian.py` | 3-layer protection (sanitizer + ethical anchor + trust calibrator) |
| Resonant Continuity | `reasoning_forge/resonant_continuity.py` | Psi_r wavefunction: emotion x energy x frequency x intent |
| Perspective Registry | `reasoning_forge/perspective_registry.py` | 12 perspectives (8 LoRA-backed + 4 prompt-only with fallback) |

## Architecture

```
codette-training-lab/
├── dataset_engine/          # Dataset generation pipeline
│   ├── template_registry.py # Rich template pools per adapter
│   ├── answer_generator.py  # Structured educational answer generation
│   ├── dataset_generator.py # Main generator with dedup + validation
│   └── templates/           # JSON template definitions
│
├── reasoning_forge/         # Multi-agent reasoning dataset refinement
│   ├── agents/              # Newton, Quantum, Ethics, Philosophy, DaVinci, Empathy
│   ├── critic_agent.py      # Quality evaluation agent
│   ├── synthesis_engine.py  # Multi-perspective synthesis
│   ├── problem_generator.py # Reasoning problem generation
│   └── forge_engine.py      # Orchestrator
│
├── training/                # LoRA training scripts
│   ├── train_adapter.py     # Single adapter training (4-bit LoRA)
│   ├── train_all_adapters.py# Sequential multi-adapter training
│   ├── merge_adapters.py    # Merge LoRA into base model
│   └── configs/             # Training hyperparameters
│
├── evaluation/              # Benchmarks and quality assurance
│   ├── reasoning_metrics.py # Multi-dimensional scoring
│   ├── benchmark_runner.py  # Automated evaluation
│   ├── dataset_validator.py # Dataset quality checks
│   ├── failure_analyzer.py  # Weakness detection
│   └── prompts/             # Benchmark test sets
│
├── observatory/             # Experiment tracking and monitoring
│   ├── metrics_logger.py    # Training run logging
│   ├── performance_tracker.py # Improvement trends
│   ├── dataset_quality_monitor.py
│   └── dashboard.py         # ASCII status dashboard
│
├── research/                # Source research documents
│   ├── papers/              # Published manuscripts
│   ├── frameworks/          # RC+xi, quantum equations, perspectives
│   └── experiments/         # Cocoon simulations, logs
│
├── datasets/                # Generated training datasets (JSONL)
├── adapters/                # Trained LoRA adapters
├── scripts/                 # Pipeline orchestration
│   ├── run_full_pipeline.py # End-to-end pipeline
│   └── hf_job.yaml          # HuggingFace job config
└── configs/                 # System configuration
    ├── adapter_registry.yaml
    └── pipeline_config.yaml
```

## Adapters

| Adapter | Domain | Target Examples | System Prompt |
|---------|--------|----------------|---------------|
| Newton | Analytical physics reasoning | 3000 | Newtonian analytical precision |
| DaVinci | Creative invention thinking | 2500 | Creative inventiveness |
| Empathy | Emotional understanding | 2500 | Deep empathy and EQ |
| Philosophy | Conceptual reasoning | 2000 | Philosophical depth |
| Quantum | Probabilistic thinking | 2000 | Quantum probabilistic thinking |
| RC+xi | Recursive cognition | 3000 | RC+xi framework reasoning |
| Multi-Perspective | Synthesis across lenses | 2500 | Multi-perspective synthesis |
| Systems | AI architecture | 2000 | System architecture design |

## Training Pipeline

```
research documents
      ↓
dataset extraction (template-based generation)
      ↓
synthetic reasoning expansion (counterexamples, variations)
      ↓
dataset validation (dedup, quality filter)
      ↓
reasoning forge (multi-agent critique + refinement)
      ↓
adapter training (4-bit LoRA on Llama 3.1 8B)
      ↓
benchmark evaluation (multi-dimensional reasoning metrics)
      ↓
observatory logging (track improvement over time)
```

## Quick Start

### Install dependencies

```bash
pip install -r requirements.txt
```

### Generate all datasets

```bash
python -m dataset_engine.generate_all
```

### Run full pipeline

```bash
python scripts/run_full_pipeline.py --all
```

### Generate + validate only

```bash
python scripts/run_full_pipeline.py --generate --validate
```

### Train a single adapter

```bash
python -m training.train_adapter \
  --dataset datasets/newton_reasoning.jsonl \
  --adapter-name newton \
  --output-dir adapters/newton
```

### Run benchmarks

```bash
python -m evaluation.benchmark_runner --prompts evaluation/prompts/reasoning_tests.json
```

### View dashboard

```bash
python -m observatory.dashboard
```

## Dataset Format

All datasets use chat-format JSONL:

```json
{
  "messages": [
    {"role": "system", "content": "You are Codette, a recursive multi-perspective reasoning AI."},
    {"role": "user", "content": "Explain the conservation of momentum using a real-world example."},
    {"role": "assistant", "content": "Conservation of momentum states that in a closed system..."}
  ]
}
```

## Reasoning Forge

The Reasoning Forge refines training data through multi-agent debate:

```
concept → problem generator → agent analysis → critic evaluation → synthesis → training example
```

Agents: Newton (physics), Quantum (probability), Ethics (alignment), Philosophy (meaning), DaVinci (creativity), Empathy (emotion)

Each agent analyzes from its perspective, the critic scores quality, and the synthesis engine produces a unified multi-perspective response.

## Base Model

- **Model**: meta-llama/Llama-3.1-8B-Instruct
- **Method**: QLoRA (4-bit quantization)
- **LoRA config**: rank=16, alpha=32, target=q/k/v/o projections

## Research Background

Codette implements the RC+xi (Recursive Convergence + Epistemic Tension) framework for structured multi-perspective reasoning. The system coordinates 11 reasoning perspectives in parallel before synthesizing a final response.

Key research documents in `research/`:
- RC+xi Framework specification
- Quantum Cosmic Multicore experiment
- Codette Research Equations (8 core quantum mathematics)
- Multi-perspective reasoning architecture

## Inference & Evaluation

### Interactive Web UI

Launch the real-time multi-perspective reasoning UI:

```bash
# Launch web interface (default port 5000)
python inference/codette_server.py

# Or use the batch file (Windows)
codette_web.bat
```

Features:
- Real-time adapter hot-swap (0ms switching via llama.cpp LoRA)
- **Real LLM-backed agents** (not templates) generating domain-specific reasoning
- GPU acceleration (35 layers offloaded)
- Quantum spiderweb visualization
- Live AEGIS ethical alignment tracking
- Memory cocoon emotional profiling

### Evaluation & Testing

**Standard Evaluation** (4 conditions × 25 questions):
```bash
python evaluation/run_evaluation_sprint.py --questions 5
```

**Real-Time Agent Thinking** (see agents reasoning in real-time):
```bash
python evaluation/run_evaluation_verbose.py --questions 1
```

Shows:
- Agent mode: ✓ LLM (real inference) or ✗ TEMPLATE (fallback)
- System prompts used
- Token generation
- Domain detection and agent gating
- Conflict detection and capping
- Gamma coherence monitoring
- Final synthesis

**Verbose Logs** with `CODETTE_VERBOSE=1`:
```bash
CODETTE_VERBOSE=1 python evaluation/run_evaluation_verbose.py
```

Shows each agent's thinking step-by-step.

## LoRA Configuration

```yaml
method: QLoRA (4-bit NF4 quantization)
rank: 16
alpha: 32
dropout: 0.05
target_modules: [q_proj, k_proj, v_proj, o_proj]
total_training_examples: 20,500
```

## RC+xi Framework

The core theoretical framework — **Recursive Convergence + Epistemic Tension** — coordinates 11 reasoning perspectives:

1. Newton (analytical physics) → `newton` adapter
2. DaVinci (creative invention) → `davinci` adapter
3. Empathy (emotional intelligence) → `empathy` adapter
4. Philosophy (conceptual reasoning) → `philosophy` adapter
5. Quantum (probabilistic thinking) → `quantum` adapter
6. RC+xi Consciousness → `consciousness` adapter
7. Multi-Perspective Synthesis → `multi_perspective` adapter
8. Systems Architecture → `systems_architecture` adapter
9. Human Intuition → prompt-only (fallback: `empathy`)
10. Resilient Kindness → prompt-only (fallback: `empathy`)
11. AEGIS Ethics → prompt-only (fallback: `consciousness`)

## Requirements

- Python 3.10+
- PyTorch 2.1+ (CUDA, ROCm, or XPU backend)
- 16GB+ RAM (CPU training) or GPU with 8GB+ VRAM
- llama.cpp with GGUF support (for inference server)
- ~1-3 hours per adapter (CPU) or 20-40 min (A10/A100 GPU)

## Hardware Tested

- Intel Arc 140V (8GB) — PyTorch 2.10.0+xpu, native XPU backend
- NVIDIA GPUs via CUDA (A10, A100, RTX series)
- CPU-only mode supported

## License

MIT — Research project by Jonathan Harrison. Experimental AI development.