File size: 4,323 Bytes

---
base_model:
- mistralai/Mistral-7B-Instruct-v0.3
tags:
- llama-cpp
license: mit
datasets:
- SKT-NRS/SKT-OMNI-CORPUS-146T-V1
language:
- en
- hi
pipeline_tag: text-generation
library_name: transformers
---
# 🚀 SKT-OM (TIGER-OM) - Agentic RAG System

**Advanced 13B Agentic RAG with Think Mode + Dynamic Plugins + LangGraph**

Built for **AMD Developer Hackathon 2026** on AMD Developer Cloud.

---

## 🌟 Project Overview

**SKT-OM** (also known as **TIGER-OM**) is a powerful **13B parameter fully agentic Retrieval-Augmented Generation (RAG)** system. It goes far beyond traditional RAG by integrating:

- **Think Mode** — Advanced multi-step reasoning engine
- **Dynamic Plugin Architecture** — Intelligent tool selection & execution
- **LangGraph Multi-Agent Workflow** — Stateful agent collaboration
- **SKT RAG** — High-performance retrieval pipeline

The system takes natural language queries and returns intelligent, reasoned, and accurate responses with tool usage and verification.

---

## 📊 Model Details

- **Model Name**: TIGER-OM (SKT-OM)
- **Parameters**: 13 Billion
- **Base Model**: Custom trained on AMD hardware
- **Quantization**: **Q4_K_M** (Excellent balance between quality and size)
- **GGUF Format**: Optimized for CPU + GPU inference
- **Training Hardware**: AMD Developer Cloud GPUs ($100 credits)
- **Inference**: ROCm 7.0 + vLLM (Full FP16) + GGUF (Q4_K_M)

**Q4_K_M Version** provides near FP16 level reasoning quality while being much more memory efficient and faster on consumer/pro hardware.

---

## ✨ Key Features

- **Think Mode Engine**: Chain-of-Thought, Self-Reflection, Verification Loops, and Self-Critique
- **Plugin Ecosystem**: Code Runner, Math Solver, Web Search, Data Analyzer, Document Parser + Custom Plugins
- **Advanced RAG**: SKT RAG with query rewriting, multi-hop retrieval, reranking & contextual compression
- **Multi-Agent System**: LangGraph powered stateful workflow
- **Memory**: Persistent conversation state
- **Tool Use**: Dynamic plugin routing based on query intent

---

## 🔗 Important Links

- **Live Demo**: [https://huggingface.co/spaces/lablab-ai-amd-developer-hackathon/SKT-OM](https://huggingface.co/spaces/lablab-ai-amd-developer-hackathon/SKT-OM)
- **Main Model Repo**: [Shrijanagain/TIGER-OM](https://huggingface.co/Shrijanagain/TIGER-OM)
- **GGUF Quantized Models (Q4_K_M)**: [Shrijanagain/TIGER-GGUF](https://huggingface.co/Shrijanagain/TIGER-GGUF)
- **GitHub Repository (RAG + ADK)**: [https://github.com/SHRIJANAGAIN/SKT-AMD-FILES](https://github.com/SHRIJANAGAIN/SKT-AMD-FILES)

---

## How It Works

```mermaid
graph TD
    A[User Query] --> B[Think Mode]
    B --> C[Decomposition & Planning]
    C --> D[Plugin Router]
    C --> E[SKT RAG Retrieval]
    D --> F[Execute Plugins]
    E --> G[Context Processing]
    F & G --> H[Verification Loop]
    H --> I[LangGraph Synthesis]
    I --> J[Final Response]
```

---

## 🛠️ Technologies Used

- **LLM**: 13B TIGER-OM (Q4_K_M GGUF)
- **RAG Framework**: SKT RAG + ADK Kit
- **Agent Framework**: LangGraph
- **GPU Stack**: ROCm 7.0 + AMD ADK Kit
- **Inference**: vLLM (FP16) + llama.cpp (GGUF Q4_K_M)
- **Hardware**: AMD MI300X
- **Cloud**: AMD Developer Cloud

---

## 🚀 Quick Start - GGUF Q4_K_M

```bash
# Using llama.cpp
./llama-cli \
  -m tiger-om-q4_k_m.gguf \
  -p "Your complex query here..." \
  -n 1024 \
  -t 8 \
  --temp 0.7
```

**Python Example (llama-cpp-python)**

```python
from llama_cpp import Llama

llm = Llama(
    model_path="tiger-om-q4_k_m.gguf",
    n_gpu_layers=-1,      # Use all GPU layers
    n_ctx=8192,
    verbose=False
)

response = llm.create_chat_completion(
    messages=[{"role": "user", "content": "Explain..."}],
    temperature=0.7,
    max_tokens=1024
)

print(response['choices'][0]['message']['content'])
```

---

## 📁 Repository Structure

- `/skt_ai_labs` — Core ADK + RAG integration
- `/plugins` — Plugin system
- `/agents` — LangGraph workflows
- `/examples` — Ready-to-use examples
- `/docs` — Architecture & guides

---

## 🏆 Hackathon Information

- **Event**: AMD Developer Hackathon 2026
- **Trained on**: AMD Developer Cloud ($100 credits)
- **Built in Public**: Regular technical updates shared
- **Goal**: Showcasing powerful agentic AI on AMD ROCm ecosystem

---

## 📄 License

*MIT*