File size: 4,577 Bytes
169e801
a446150
3fc0665
a446150
 
 
3fc0665
a446150
3fc0665
a446150
169e801
a446150
3fc0665
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a446150
 
 
 
3fc0665
 
 
 
 
a446150
 
 
 
3fc0665
 
 
 
 
 
 
a446150
3fc0665
 
 
 
 
 
f6ca789
a446150
f6ca789
 
 
a446150
f6ca789
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a446150
 
 
 
 
f6ca789
 
 
8f67ac4
 
a446150
8f67ac4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a446150
8f67ac4
f6ca789
 
 
 
a446150
 
 
 
f6ca789
 
 
 
 
 
 
 
 
a446150
 
 
 
 
f6ca789
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a446150
f6ca789
 
 
 
 
 
 
 
 
 
 
 
 
a446150
f6ca789
 
 
 
 
 
 
 
 
a446150
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
---

license: apache-2.0
base_model: "Qwen/Qwen2.5-0.5B-Instruct"
tags: ["legal-ai", "lora", "peft", "qlora", "indian-law", "legal-simplification", "accessibility", "irac", "transformers", "unsloth"]
language: ["en"]
pipeline_tag: text-generation
datasets: ["joyboseroy/inIRAC"]
library_name: transformers

---
--------------------------

# NyayaSaar-LoRA

NyayaSaar-LoRA is a lightweight LoRA adapter fine-tuned to simplify structured Indian legal reasoning into plain English.

The project focuses on improving accessibility to legal reasoning for non-lawyers interacting with the Indian legal system.

This model was trained using parameter-efficient fine-tuning (LoRA/QLoRA) on structured IRAC-style legal reasoning examples derived from the inIRAC dataset.

---

# Motivation

Indian legal documents are often difficult for ordinary citizens to understand because of:

* archaic terminology
* procedural complexity
* dense sentence structures
* formal legal phrasing

NyayaSaar-LoRA explores whether small language models can learn to preserve legal meaning while simplifying language and improving readability.

The project prioritizes:

* accessibility
* explainability
* low-resource adaptation
* efficient fine-tuning

---

# Example

## Input

```text id="jlwmkm"
Issue: Whether the detention order violates Article 22.

Rule: Preventive detention laws require procedural safeguards.

Application: The petitioner argued safeguards were not followed.

Conclusion: The detention order is quashed.
```

## Output

```text id="4awft0"
The court examined whether the detention was legal under the Constitution.

The law says preventive detention must follow proper safeguards.

The petitioner argued these safeguards were ignored.

The court agreed and cancelled the detention order.
```

---

# Technical Details

## Base Model

* Qwen2.5-0.5B-Instruct (4-bit quantized)

## Fine-Tuning Method

* LoRA / QLoRA
* PEFT
* Unsloth
* Supervised Fine-Tuning (SFT)

## Training Environment

* Google Colab
* Single GPU
* Low-resource fine-tuning setup

---

# Dataset

Training examples were derived from the inIRAC dataset:

https://huggingface.co/datasets/joyboseroy/inIRAC

The dataset contains structured Indian legal reasoning in IRAC format:

* Issue
* Rule
* Application
* Conclusion

---

# Intended Use

This project is intended for:

* legal accessibility research
* plain-English legal explanation
* educational demonstrations
* low-resource legal NLP experimentation
* research into explainable legal AI

---

# Usage

```python id="zj38zt"
from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel

base_model = "Qwen/Qwen2.5-0.5B-Instruct"
adapter_model = "joyboseroy/nyayasaar-lora"

tokenizer = AutoTokenizer.from_pretrained(base_model)

model = AutoModelForCausalLM.from_pretrained(
    base_model,
    device_map="auto"
)

model = PeftModel.from_pretrained(model, adapter_model)

prompt = """
Issue: Whether the detention order violates Article 22.

Rule: Preventive detention laws require procedural safeguards.

Application: The petitioner argued safeguards were not followed.

Conclusion: The detention order is quashed.
"""

inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

outputs = model.generate(
    **inputs,
    max_new_tokens=200
)

print(tokenizer.decode(outputs[0], skip_special_tokens=True))
```

---

# Limitations

This model:

* does not provide legal advice
* may oversimplify nuanced legal reasoning
* may omit procedural subtleties
* should not be used in high-stakes legal decision-making

Outputs must always be reviewed by qualified legal professionals before practical use.

---

# Future Work

Potential future directions include:

* multilingual Indian legal simplification
* Hindi and Bengali adaptation
* graph-grounded legal reasoning
* retrieval-augmented legal explanation
* evaluation using readability and semantic-preservation metrics

---

# Related Work

## Dataset

inIRAC Dataset:
https://huggingface.co/datasets/joyboseroy/inIRAC

## Research

Falkor-IRAC: Graph-Constrained Generation for Verified Legal Reasoning in Indian Judicial AI
https://arxiv.org/abs/2605.14665

---

# Citation

```bibtex id="t1xjlwm"
@misc{bose2026nyayasaar,
  title={NyayaSaar-LoRA: Simplifying Indian Legal Reasoning using PEFT},
  author={Joy Bose},
  year={2026},
  howpublished={Hugging Face Model Repository}
}
```

---

# Author

Joy Bose
Senior Data Scientist and Researcher

Research interests:

* Legal AI
* Explainable AI
* Graph Reasoning
* Ethical AI
* Efficient LLM Adaptation

---