DJLougen commited on
Commit
4ad5184
Β·
verified Β·
1 Parent(s): 3e491a1

Split SABER and RYS organization card entries

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -25,7 +25,8 @@ GestaltLabs is the research home for the **Nemostein**, **Ornstein**, **Harmonic
25
  - **Nemostein** β€” Nemotron/Nemotron-Omni reasoning and tool-use fine-tunes, including GGUF releases for local inference
26
  - **Ornstein** β€” multimodal and MoE reasoning models (Qwen/Gemma-derived) optimized for local deployment and agentic workflows
27
  - **CRUCIBLE** β€” Conditional Reward-Utility Coreset by Iterative Bayesian Learnability Estimation, a private data pipeline for reasoning and agent-trace curation
28
- - **SABER / RYS** β€” post-training experiments around capability preservation, refusal boundaries, KLD drift, and model behavior editing
 
29
  - **Acta / Talos** β€” curated agentic tool-use and coding-assistant traces for SFT and evaluation
30
  - **Local inference** β€” GGUF, quantized, and deployment-friendly builds for llama.cpp and MLX workflows
31
 
@@ -59,12 +60,15 @@ Multimodal and MoE reasoning line across 27B–35B scales, shipped in formats fo
59
  ### CRUCIBLE
60
  [CRUCIBLE](https://github.com/DJLougen/crucible) is a private data pipeline: Conditional Reward-Utility Coreset by Iterative Bayesian Learnability Estimation. It is used for reasoning-oriented data selection, self-correction supervision, and agent-trace curation.
61
 
62
- ### SABER / RYS
63
- Experimental post-training branches studying refusal boundaries and capability-preserving edits.
64
 
65
  - [SABER code](https://github.com/DJLougen/SABER)
66
  - [DJLougen/gemma-4-E2B-it-saber](https://huggingface.co/DJLougen/gemma-4-E2B-it-saber)
67
 
 
 
 
68
  ## Code & Agent Infrastructure
69
  - [SABER](https://github.com/DJLougen/SABER) β€” refusal-boundary and KLD-drift tuning toolkit for capability-preserving behavior edits
70
  - [RBForge / Ornstein ToolForge](https://github.com/DJLougen/RBForge) β€” runtime tool creation for Ornstein, Hermes, and SABER agents, with forged tools persisted into RBMEM memory
 
25
  - **Nemostein** β€” Nemotron/Nemotron-Omni reasoning and tool-use fine-tunes, including GGUF releases for local inference
26
  - **Ornstein** β€” multimodal and MoE reasoning models (Qwen/Gemma-derived) optimized for local deployment and agentic workflows
27
  - **CRUCIBLE** β€” Conditional Reward-Utility Coreset by Iterative Bayesian Learnability Estimation, a private data pipeline for reasoning and agent-trace curation
28
+ - **SABER** β€” spectral refusal-boundary editing with KLD drift measurement for capability-preserving behavior edits
29
+ - **RYS** β€” a separate post-training method line for refusal/behavior shaping experiments
30
  - **Acta / Talos** β€” curated agentic tool-use and coding-assistant traces for SFT and evaluation
31
  - **Local inference** β€” GGUF, quantized, and deployment-friendly builds for llama.cpp and MLX workflows
32
 
 
60
  ### CRUCIBLE
61
  [CRUCIBLE](https://github.com/DJLougen/crucible) is a private data pipeline: Conditional Reward-Utility Coreset by Iterative Bayesian Learnability Estimation. It is used for reasoning-oriented data selection, self-correction supervision, and agent-trace curation.
62
 
63
+ ### SABER
64
+ SABER is a spectral refusal-boundary editing toolkit that tunes candidates against generated refusal rate and KLD drift to preserve useful capability.
65
 
66
  - [SABER code](https://github.com/DJLougen/SABER)
67
  - [DJLougen/gemma-4-E2B-it-saber](https://huggingface.co/DJLougen/gemma-4-E2B-it-saber)
68
 
69
+ ### RYS
70
+ RYS is a distinct post-training method line for refusal and behavior shaping experiments, tracked separately from SABER.
71
+
72
  ## Code & Agent Infrastructure
73
  - [SABER](https://github.com/DJLougen/SABER) β€” refusal-boundary and KLD-drift tuning toolkit for capability-preserving behavior edits
74
  - [RBForge / Ornstein ToolForge](https://github.com/DJLougen/RBForge) β€” runtime tool creation for Ornstein, Hermes, and SABER agents, with forged tools persisted into RBMEM memory