robertzty commited on
Commit
218febf
·
verified ·
1 Parent(s): fa1c2aa

Add files using upload-large-folder tool

Browse files
.gitattributes CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ mmproj-Cosmos-Reason2-32B-F16.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Cosmos-Reason2-32B-BF16.gguf filter=lfs diff=lfs merge=lfs -text
Cosmos-Reason2-32B-BF16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cd4c6ffb40ec2c30a729eb31528928c8e95349030387fb8dd0d14a64373940d
3
+ size 65531576896
NOTICE ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ Licensed by NVIDIA Corporation under the NVIDIA Open Model License.
2
+
3
+ Built on NVIDIA Cosmos.
4
+
5
+ Original model: nvidia/Cosmos-Reason2-32B
6
+ Original model page: https://huggingface.co/nvidia/Cosmos-Reason2-32B
7
+ License: https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license
8
+
9
+ This repository contains a GGUF format conversion of the original model weights.
10
+ No quantization was applied.
README.md ADDED
@@ -0,0 +1,53 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: nvidia/Cosmos-Reason2-32B
3
+ library_name: llama.cpp
4
+ pipeline_tag: image-text-to-text
5
+ tags:
6
+ - gguf
7
+ - qwen3-vl
8
+ - cosmos
9
+ - nvidia
10
+ - multimodal
11
+ - image-text-to-text
12
+ - bf16
13
+ license: other
14
+ license_name: nvidia-open-model-license
15
+ license_link: https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license
16
+ ---
17
+
18
+ # Cosmos-Reason2-32B GGUF
19
+
20
+ Pure GGUF conversion of `nvidia/Cosmos-Reason2-32B`.
21
+
22
+ Built on NVIDIA Cosmos.
23
+
24
+ ## Files
25
+
26
+ - `Cosmos-Reason2-32B-BF16.gguf`: BF16 text backbone GGUF.
27
+ - `mmproj-Cosmos-Reason2-32B-F16.gguf`: F16 multimodal projector / vision GGUF.
28
+
29
+ No quantization was applied.
30
+
31
+ ## Source
32
+
33
+ Original model: https://huggingface.co/nvidia/Cosmos-Reason2-32B
34
+
35
+ This GGUF conversion was produced with `llama.cpp` `convert_hf_to_gguf.py` from the original Hugging Face safetensors.
36
+
37
+ ## Usage
38
+
39
+ Use both files together for multimodal inference in `llama.cpp`:
40
+
41
+ ```bash
42
+ llama-server \
43
+ -m Cosmos-Reason2-32B-BF16.gguf \
44
+ --mmproj mmproj-Cosmos-Reason2-32B-F16.gguf
45
+ ```
46
+
47
+ The BF16 model is large and may require CPU offload or a multi-GPU setup.
48
+
49
+ ## License
50
+
51
+ Licensed by NVIDIA Corporation under the NVIDIA Open Model License.
52
+
53
+ See `NOTICE` and the original model card for license terms and usage requirements.
mmproj-Cosmos-Reason2-32B-F16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e9bbec648752a87f8922d20559648f2774ecd8a5af7374ad2a1b32262ca60fca
3
+ size 1196795648