Question about finetuning dataset

by xiahao2 - opened Feb 2

Discussion

xiahao2

Feb 2

Hello, may I ask what dataset you used for fine-tuning, what its license is, and how much data was used?

Best regards

tarasz98

Owner Feb 11

Hi! I used the FrancophonIA/English-French dataset. The license I assume CC0: Public Domain, since the dataset was extracted from Kaggle. And I used the whole thing to fine-tune.
This model was an exercise part of the LLM Course (from HF), in Section 7.3.

xiahao2

Feb 12

I understand. Thank you very much for your explanation.

xiahao2 changed discussion status to closed Feb 12

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment