domainTokenizer / notebooks

Commit History

Update 02_ecommerce notebook: add HF login, memory-free cell, subsample option for <64GB RAM machines
2410b7e
verified

rtferraz commited on

Add 02_ecommerce_pretrain.ipynb β€” REES46 e-commerce pre-training with sequential entropy check, wandb, push to hub
d60868a
verified

rtferraz commited on

Fix notebook: total_mem β†’ total_memory, add hub_model_id push, add wandb logging support
65ecf7e
verified

rtferraz commited on

Add 01_finance_pretrain.ipynb β€” Phase 3.1 notebook for pre-training on 5M Nigerian financial transactions
2c3ddfa
verified

rtferraz commited on