Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
rtferraz
/
domainTokenizer
like
0
arxiv:
9 papers
Model card
Files
Files and versions
xet
Community
7edb04f
domainTokenizer
249 kB
Ctrl+K
Ctrl+K
1 contributor
History:
33 commits
rtferraz
Update package to v0.4.0 with fine-tuning exports
7edb04f
verified
8 days ago
docs
Add Phase 2A-2C implementation report โ technical decisions, architecture summary, test results
8 days ago
src
Update package to v0.4.0 with fine-tuning exports
8 days ago
tests
Add training test suite โ 19 tests covering data pipeline, packing, collation, integration, Trainer smoke test
8 days ago
.gitattributes
Safe
1.52 kB
initial commit
8 days ago
README.md
Safe
8.46 kB
Update README v0.3.0 โ add usage example, update roadmap status, add implementation report link
8 days ago