Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
rtferraz
/
domainTokenizer
like
0
arxiv:
9 papers
Model card
Files
Files and versions
xet
Community
d685c0e
domainTokenizer
/
src
/
domain_tokenizer
/
models
16.3 kB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
rtferraz
Add PLR embeddings (Gorishniy et al. 2022)
d685c0e
verified
13 days ago
__init__.py
685 Bytes
Phase 2B: Model architecture โ DomainTransformerForCausalLM (NoPE, GPT-style), PLR embeddings, DCNv2 + JointFusion, 105 passing tests
13 days ago
configuration.py
4.46 kB
Add DomainTransformerConfig with presets (24M/85M/330M)
13 days ago
modeling.py
9.05 kB
Add DomainTransformerForCausalLM โ GPT-style NoPE model with SDPA attention, weight tying, HF Trainer compatible
13 days ago
plr_embeddings.py
2.06 kB
Add PLR embeddings (Gorishniy et al. 2022)
13 days ago