clemsail commited on
Commit
2aa5b8b
·
verified ·
1 Parent(s): e888b48

docs: PST-verbatim model card v0.4.3 (Commission template 2025-07-24)

Browse files
Files changed (1) hide show
  1. README.md +9 -8
README.md CHANGED
@@ -5,6 +5,7 @@ tags:
5
  - lora
6
  - peft
7
  - mlx
 
8
  - eu-kiki
9
  - eu-ai-act
10
  - art-52
@@ -18,7 +19,7 @@ library_name: peft
18
 
19
  # eu-kiki-devstral-rust-lora
20
 
21
- LoRA adapter for **mistralai/Devstral-Small-2-24B-Instruct-2512**, part of the [eu-kiki](https://github.com/L-electron-Rare/eu-kiki) project. Live demo: https://ml.saillant.cc.
22
 
23
  > **EU AI Act compliance.** This card follows the **European Commission's
24
  > *Template for the Public Summary of Training Content* for general-purpose
@@ -36,7 +37,7 @@ LoRA adapter for **mistralai/Devstral-Small-2-24B-Instruct-2512**, part of the [
36
 
37
  | Field | Value |
38
  |---|---|
39
- | **Provider name and contact details** | L'Électron Rare (Saillant Clément) — `clemsail` on Hugging Face — Issues: https://github.com/L-electron-Rare/eu-kiki/issues |
40
  | **Authorised representative name and contact details** | Not applicable — provider is established within the European Union (France). |
41
 
42
  ## 1.2. Model identification
@@ -57,7 +58,7 @@ LoRA adapter for **mistralai/Devstral-Small-2-24B-Instruct-2512**, part of the [
57
  | **Approximate size in alternative units** | ≈ 0.6 M tokens. |
58
  | **Latest date of data acquisition / collection for model training** | 11/2024 (StarCoder2 Self-Instruct release). The model is **not** continuously trained on new data after this date. |
59
  | **Linguistic characteristics of the overall training data** | English. No other natural languages. |
60
- | **Other relevant characteristics / additional comments** | LoRA fine-tune (rank 16, alpha 32, dropout 0.05); only attention projections (`q_proj`, `k_proj`, `v_proj`, `o_proj`) are trained. Per-record `_provenance` (source, SPDX licence, `record_idx`, `access_date`) attached at the system level (see [`docs/eu-ai-act-transparency.md`](https://github.com/L-electron-Rare/eu-kiki/blob/main/docs/eu-ai-act-transparency.md) §4.4). Tokenizer: inherited from the base model. |
61
 
62
  ---
63
 
@@ -127,8 +128,8 @@ _(N/A — no other data sources used.)_
127
 
128
  - **Public HF datasets (§2.1):** all carry permissive open licences (Apache-2.0, MIT, CC-BY-*, BSD); SPDX matrix verified per-source. The licences explicitly authorise instructional / model-training use for the rows actually selected.
129
  - **Web-scraped sources (§2.3):** prior to collection the provider verified `robots.txt`, `<meta name="robots" content="noai">`, `ai.txt`, and TDM-Reservation HTTP headers. Any source returning a reservation under Article 4(3) of Directive (EU) 2019/790 was excluded from collection. Scraping was limited to authoritative vendor-controlled repositories (ESP-IDF, STM32Cube, Arduino, KiCad symbols/footprints) operating under permissive licences.
130
- - **Vendor PDF datasheets (§2.2.2 where present):** processed under the EU DSM Directive Article 4 TDM exception. SHA-256 manifests and per-source legal-basis records are published in [`docs/pdf-compliance-report.md`](https://github.com/L-electron-Rare/eu-kiki/blob/main/docs/pdf-compliance-report.md).
131
- - **Public copyright policy (Art. 53(1)(c)):** [`docs/eu-ai-act-transparency.md`](https://github.com/L-electron-Rare/eu-kiki/blob/main/docs/eu-ai-act-transparency.md). Removal requests are handled via the issue tracker on the source repository; the provider commits to remove disputed content within 30 days and re-train on the next release cycle.
132
 
133
  ## 3.2. Removal of illegal content
134
 
@@ -152,8 +153,8 @@ _(N/A — no other data sources used.)_
152
  **HumanEval** (custom Studio scorer): base 87.20 → +rust 86.59 = **−0.61 pts**. Best of the three Devstral adapters in this release.
153
 
154
  Full bench results, methodology, env.json, and rerun.sh per measurement:
155
- [`eval/results/SUMMARY.md`](https://github.com/L-electron-Rare/eu-kiki/blob/main/eval/results/SUMMARY.md) ·
156
- [`MODEL_CARD.md`](https://github.com/L-electron-Rare/eu-kiki/blob/main/MODEL_CARD.md).
157
 
158
  ---
159
 
@@ -200,7 +201,7 @@ python -m mlx_lm fuse \
200
  title = {eu-kiki: EU-sovereign multi-model LLM serving with HF-traceable LoRA adapters},
201
  author = {Saillant, Clément},
202
  year = {2026},
203
- url = {https://github.com/L-electron-Rare/eu-kiki},
204
  note = {Live demo: https://ml.saillant.cc}
205
  }
206
  ```
 
5
  - lora
6
  - peft
7
  - mlx
8
+ - ailiance
9
  - eu-kiki
10
  - eu-ai-act
11
  - art-52
 
19
 
20
  # eu-kiki-devstral-rust-lora
21
 
22
+ LoRA adapter for **mistralai/Devstral-Small-2-24B-Instruct-2512**, part of the [ailiance](https://github.com/L-electron-Rare/ailiance) project. Live demo: https://ml.saillant.cc.
23
 
24
  > **EU AI Act compliance.** This card follows the **European Commission's
25
  > *Template for the Public Summary of Training Content* for general-purpose
 
37
 
38
  | Field | Value |
39
  |---|---|
40
+ | **Provider name and contact details** | L'Électron Rare (Saillant Clément) — `clemsail` on Hugging Face — Issues: https://github.com/L-electron-Rare/ailiance/issues |
41
  | **Authorised representative name and contact details** | Not applicable — provider is established within the European Union (France). |
42
 
43
  ## 1.2. Model identification
 
58
  | **Approximate size in alternative units** | ≈ 0.6 M tokens. |
59
  | **Latest date of data acquisition / collection for model training** | 11/2024 (StarCoder2 Self-Instruct release). The model is **not** continuously trained on new data after this date. |
60
  | **Linguistic characteristics of the overall training data** | English. No other natural languages. |
61
+ | **Other relevant characteristics / additional comments** | LoRA fine-tune (rank 16, alpha 32, dropout 0.05); only attention projections (`q_proj`, `k_proj`, `v_proj`, `o_proj`) are trained. Per-record `_provenance` (source, SPDX licence, `record_idx`, `access_date`) attached at the system level (see [`docs/eu-ai-act-transparency.md`](https://github.com/L-electron-Rare/ailiance/blob/main/docs/eu-ai-act-transparency.md) §4.4). Tokenizer: inherited from the base model. |
62
 
63
  ---
64
 
 
128
 
129
  - **Public HF datasets (§2.1):** all carry permissive open licences (Apache-2.0, MIT, CC-BY-*, BSD); SPDX matrix verified per-source. The licences explicitly authorise instructional / model-training use for the rows actually selected.
130
  - **Web-scraped sources (§2.3):** prior to collection the provider verified `robots.txt`, `<meta name="robots" content="noai">`, `ai.txt`, and TDM-Reservation HTTP headers. Any source returning a reservation under Article 4(3) of Directive (EU) 2019/790 was excluded from collection. Scraping was limited to authoritative vendor-controlled repositories (ESP-IDF, STM32Cube, Arduino, KiCad symbols/footprints) operating under permissive licences.
131
+ - **Vendor PDF datasheets (§2.2.2 where present):** processed under the EU DSM Directive Article 4 TDM exception. SHA-256 manifests and per-source legal-basis records are published in [`docs/pdf-compliance-report.md`](https://github.com/L-electron-Rare/ailiance/blob/main/docs/pdf-compliance-report.md).
132
+ - **Public copyright policy (Art. 53(1)(c)):** [`docs/eu-ai-act-transparency.md`](https://github.com/L-electron-Rare/ailiance/blob/main/docs/eu-ai-act-transparency.md). Removal requests are handled via the issue tracker on the source repository; the provider commits to remove disputed content within 30 days and re-train on the next release cycle.
133
 
134
  ## 3.2. Removal of illegal content
135
 
 
153
  **HumanEval** (custom Studio scorer): base 87.20 → +rust 86.59 = **−0.61 pts**. Best of the three Devstral adapters in this release.
154
 
155
  Full bench results, methodology, env.json, and rerun.sh per measurement:
156
+ [`eval/results/SUMMARY.md`](https://github.com/L-electron-Rare/ailiance/blob/main/eval/results/SUMMARY.md) ·
157
+ [`MODEL_CARD.md`](https://github.com/L-electron-Rare/ailiance/blob/main/MODEL_CARD.md).
158
 
159
  ---
160
 
 
201
  title = {eu-kiki: EU-sovereign multi-model LLM serving with HF-traceable LoRA adapters},
202
  author = {Saillant, Clément},
203
  year = {2026},
204
+ url = {https://github.com/L-electron-Rare/ailiance},
205
  note = {Live demo: https://ml.saillant.cc}
206
  }
207
  ```