Commit History

Novel GPU memory reduction experiments
6de6495
verified

ticketguy commited on

CogMemBench paper for Kaggle + GPU memory reduction experiments
d7a2e27
verified

ticketguy commited on

GPU memory experiment + CogMemBench on real model
ae7a539
verified

ticketguy commited on

Update paper + README with final GPU results, fix Colab, research GPU memory reduction
246f26e
verified

ticketguy commited on

Fix engine compilation errors
3d1f75d
verified

ticketguy commited on

Test engine compilation
30d0fa0
verified

ticketguy commited on

FigQuant GPU training test (with dtype fix)
be68d13
verified

ticketguy commited on

Fix lowram backward dtype bug + AGPL licenses + rerun GPU test
11e27f8
verified

ticketguy commited on

Fix FigQuant GPU benchmark (use figcache mode) + test engine conversion
282001f
verified

ticketguy commited on

Engine Phase 3: Complete format converter + BPE tokenizer + kernel wiring
e75ae96
verified

ticketguy commited on

Engine Phase 2: Full transformer forward pass + tokenizer + attention
bc38a2c
verified

ticketguy commited on

Engine Phase 1b: AVX2 vectorized matmul + RMSNorm kernels
bb40248
verified

ticketguy commited on

Lila Engine Phase 1: Foundation — matmul kernel + model loader + token generation
5a1c190
verified

ticketguy commited on

Full GPU benchmark including FP32 quant quality test
1b1fe45
verified

ticketguy commited on

Training-only GPU benchmark (skip FP32 quant quality — already proven on CPU)
45b9d61
verified

ticketguy commited on

Lila inference engine build plan + fix LilaCore
6e900ca
verified

ticketguy commited on

Lila restructure script
f279a18
verified

ticketguy commited on

GPU benchmark script (fixed for PyTorch 2.11)
5c41b47
verified

ticketguy commited on

Update ML Intern artifact metadata
bda7fa3
verified

ticketguy commited on

initial commit
a8d0035
verified

ticketguy commited on