File size: 1,955 Bytes
514d766
 
fb26a60
 
 
 
d4806e0
 
fb26a60
 
 
d4806e0
514d766
fb26a60
d4806e0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ffbe3f5
 
 
 
 
 
 
 
 
 
 
 
d4806e0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38431f0
d4806e0
 
 
 
 
 
 
67d5b01
 
d4806e0
fb26a60
d4806e0
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
---
license: mit
datasets:
- ylecun/mnist
tags:
- harley-ml
- image
- digit-to-image
- mnist
- small
- text-to-image
- supralabs
---

# **SupraMNiST-IMG-200k**

## Sumary

```
Task: Number-To-Image
Dataset: ylecun/mnist
Total training time: ~8 minutes
Inputs: Number (0-9) 
Outputs: 32x32 image
Params: ~201k
Framework: PyTorch, diffusers
Author: SupraLabs
```

## **Description**
MNiST-IMG-200k is an ~**200k parameter model** trained to **generate an image** based on an **input number (0-9)**. 

## Architecture

| Parameter            | Value      |
| -------------------- | ---------- |
| `image_size`         | `32`       |
| `in_channels`        | `1`        |
| `out_channels`       | `1`        |
| `num_classes`        | `10`       |
| `block_out_channels` | `[12, 16]` |
| `layers_per_block`   | `8`        |
| `norm_num_groups`    | `4`        |

## **Training**

### **Hardware**

MNiST-IMG was trained on Google Colaboratory (NVIDA Tesla T4) for ~8 minutes with a batch size of 64 for 10 epochs.

### **Dataset**

[ylecun/mnist](https://huggingface.co/ylecun/mnist)

### **Training Results**

Loss ended at ~**0.40**.

Note: I can't provide the raw training logs as I loss it somehwere after training. Sorry!

## **Generation Examples**

At **1000** decoding steps:

![1000 Decoding Step Digit Image Generation](images/digit_image_samples_1000s.png)

At **200** decoding steps:

![200 Decoding Step Generation Image](images/digit_image_samples_200s.png)

# Inference

Use the script in the repo. [inference.py](https://huggingface.co/Harley-ml/MNIST-IMG-390k/blob/main/inference.py)

### Related Models

1. [MNIST-IMG-390k](https://huggingface.co/Harley-ml/MNIST-IMG-390k)

# Citation

```bibtex
@misc{mnist-img-390k,
  title     = {MNIST-IMG-390k: a Tiny Diffusion Model for Generating Handwritten Digits},
  author    = {Paul Courneya; Harley-ml},
  year      = {2026},
  url       = {https://huggingface.co/Harley-ml/MNIST-IMG-390k}
}
```