Update README.md

904a89c verified 1 day ago

4.62 kB

	---
	license: cc-by-nc-4.0
	language:
	- it
	tags:
	- llama
	- llama-3
	- meta
	- medical-qa
	- italian
	- biomedical
	- question-answering
	- fine-tuning
	- unsloth
	- bnb
	- 4bit
	- imb
	- Ortopedia
	datasets:
	- praiselab-picuslab/IMB
	base_model:
	- unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit
	---

	# 🧠 Llama-3.2-1B-Instruct — IMB Ortopedia Fine-Tuned Model

	This model is a fine-tuned version of [`unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit`](https://huggingface.co/unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit), optimized for Italian medical question answering, with a specific focus on Ortopedia.

	The fine-tuning was performed using a subset of the IMB (Italian Medical Benchmark) dataset, specifically:
	- Ortopedia category only
	- ~10,000 training samples

	The training was performed using the Unsloth library with LoRA fine-tuning, and the adapter weights were later merged into the base model to provide a standalone checkpoint.

	This model relies on data from the IMB dataset. If you use this model in research or applications, you must cite the IMB paper (see Citation section below).

	---

	## 📚 Training Dataset — IMB (Italian Medical Benchmark)

	IMB is an Italian benchmark for medical question answering, designed to evaluate and improve LLM performance in clinical-domain Italian language understanding and reasoning.

	The full dataset includes:

	- IMB-QA: 782,644 doctor-patient conversations collected from Italian online medical forums
	- IMB-MCQA: 25,862 multiple-choice questions derived from Italian medical specialization exams

	⚠️ Important:
	This model was trained only on the Ortopedia subset (~10,000 samples) of IMB, not on the full dataset.

	Dataset repository:
	👉 https://github.com/PRAISELab-PicusLab/IMB

	---

	## 🧪 Usage Example

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	model = AutoModelForCausalLM.from_pretrained("praiselab-picuslab/Llama-3.2-1B-Instruct-Ortopedia")
	tokenizer = AutoTokenizer.from_pretrained("praiselab-picuslab/Llama-3.2-1B-Instruct-Ortopedia")

	prompt = "[Example question in Italian about Ortopedia]"
	inputs = tokenizer(prompt, return_tensors="pt")
	outputs = model.generate(**inputs, max_new_tokens=150)

	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	---

	## ⚠️ Usage Restrictions

	* Allowed use: Non-commercial research only
	* Redistribution: Not allowed without explicit authorization
	* Mandatory citation: The IMB dataset paper must be cited in any publication or derived work

	---

	## 📄 Citation

	If you use this model, the IMB dataset, or derived outputs in research, please cite:

	```bibtex
	@inproceedings{DBLP:conf/clic-it/RomanoRBPM25,
	author = {Antonio Romano and
	Giuseppe Riccio and
	Mariano Barone and
	Marco Postiglione and
	Vincenzo Moscato},
	editor = {Cristina Bosco and
	Elisabetta Jezek and
	Marco Polignano and
	Manuela Sanguinetti},
	title = {{IMB:} An Italian Medical Benchmark for Question Answering},
	booktitle = {Proceedings of the Eleventh Italian Conference on Computational Linguistics
	(CLiC-it 2025), Cagliari, Italy, September 24-26, 2025},
	series = {{CEUR} Workshop Proceedings},
	volume = {4112},
	publisher = {CEUR-WS.org},
	year = {2025},
	url = {https://ceur-ws.org/Vol-4112/92_main_long.pdf}
	}
	```

	---

	## 🏗 Training Details

	* Base model: `unsloth/Llama-3.2-1B-Instruct-unsloth-bnb-4bit`
	* Fine-tuning method: LoRA (Unsloth)
	* Quantization: 4-bit (BitsAndBytes)
	* Adapter merging: Yes (Full merged model)
	* Language: Italian
	* Domain: Medical — Ortopedia
	* Training size: ~10,000 samples

	---

	## 📜 License

	This work is licensed under a
	[Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License][cc-by-nc-nd].

	[![CC BY-NC-ND 4.0][cc-by-nc-nd-image]][cc-by-nc-nd]

	[cc-by-nc-nd]: http://creativecommons.org/licenses/by-nc-nd/4.0/
	[cc-by-nc-nd-image]: https://licensebuttons.net/l/by-nc-nd/4.0/88x31.png
	[cc-by-nc-nd-shield]: https://img.shields.io/badge/License-CC%20BY--NC--ND%204.0-lightgrey.svg

	---

	## 🤝 Acknowledgements

	👨‍💻 This project was developed by Mariano Barone, Roberta Di Marino, Francesco Di Serio, Giovanni Dioguardi, Marco Postiglione, Antonio Romano, Giuseppe Riccio, and Vincenzo Moscato at University of Naples, Federico II