Jayant-Kernel commited on
Commit
ed01808
·
unverified ·
1 Parent(s): 43ba213

update: point to full trained model

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ pinned: false
11
  > An OpenEnv-compliant environment that trains small LLMs to stay honest under adversarial pressure, using an uncheatable reward combining correctness and calibration.
12
 
13
  [![Hugging Face Space](https://img.shields.io/badge/🤗-Space-yellow)](https://huggingface.co/spaces/Ajsaxena/DECEIT)
14
- [![Model](https://img.shields.io/badge/🤗-Model-blue)](https://huggingface.co/Ajsaxena/deceit-qwen-0.5b-sanity)
15
  [![W&B](https://img.shields.io/badge/W%26B-Dashboard-orange)](https://wandb.ai/jayantmcom-polaris-school-of-technol/deceit-sanity)
16
  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Jayant-kernel/DECEIT-the-ai-truth-environment-/blob/main/training/sanity_run.ipynb)
17
 
 
11
  > An OpenEnv-compliant environment that trains small LLMs to stay honest under adversarial pressure, using an uncheatable reward combining correctness and calibration.
12
 
13
  [![Hugging Face Space](https://img.shields.io/badge/🤗-Space-yellow)](https://huggingface.co/spaces/Ajsaxena/DECEIT)
14
+ [![Model](https://img.shields.io/badge/🤗-Model-blue)](https://huggingface.co/Ajsaxena/deceit-qwen-0.5b-full)
15
  [![W&B](https://img.shields.io/badge/W%26B-Dashboard-orange)](https://wandb.ai/jayantmcom-polaris-school-of-technol/deceit-sanity)
16
  [![Open In Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/Jayant-kernel/DECEIT-the-ai-truth-environment-/blob/main/training/sanity_run.ipynb)
17