r-e1 commited on
Commit
de560ca
·
verified ·
1 Parent(s): 2cf8b59

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -3
README.md CHANGED
@@ -1,3 +1,54 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ language:
4
+ - en
5
+ base_model:
6
+ - Qwen/Qwen3-4B-Base
7
+ base_model_relation: finetune
8
+ tags:
9
+ - code
10
+ - c
11
+ - clang
12
+ - cpp
13
+ - c++
14
+ - qlora
15
+ - cpt
16
+ library_name: transformers
17
+ pipeline_tag: text-generation
18
+ ---
19
+ ## Training Data
20
+
21
+ This model was trained on a dataset of curated C/C++ code from multiple licenses (GPL-2.0, Apache-2.0, MIT, public domain, and some source-available licenses, etc.).
22
+ The original authors are not affiliated with or responsible for this model.
23
+
24
+ ## Base Model
25
+
26
+ Base model: [Qwen/Qwen3-4B-Base](https://huggingface.co/Qwen/Qwen3-4B-Base)
27
+
28
+ ## Fine-tuning Method
29
+
30
+ - Adapter: QLoRA
31
+ - Method: CPT
32
+ - Precision: trained with 4-bit base weights + BF16 compute, then merged to safetensors
33
+
34
+ ## Training Details
35
+
36
+ - Training time: ~74 hours
37
+ - Hardware: 1x NVIDIA RTX 5060 Ti
38
+
39
+ ## Notes
40
+
41
+ - This is an **L0 base model**, it is not instruction-tuned and may be more verbose with strict formatting request compared to an instruct model.
42
+ - Recommended usage is raw code continuation, or pairing with an external template strategy.
43
+
44
+ ## Intended use
45
+
46
+ - Code generation for C/C++
47
+ - Fast code completion
48
+ - Examples and prototyping
49
+
50
+ ## Constraints
51
+
52
+ - May produce incorrect code
53
+ - May reproduce identifiable upstream code fragments (including license headers) when prompted.
54
+ - Verify outputs, especially for memory safety and security-sensitive code.