Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

rtferraz
/
domainTokenizer

Model card Files Files and versions
xet
Community
domainTokenizer / src /domain_tokenizer /models
16.3 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
rtferraz's picture
rtferraz
Add PLR embeddings (Gorishniy et al. 2022)
d685c0e verified 13 days ago
  • __init__.py
    685 Bytes
    Phase 2B: Model architecture โ€” DomainTransformerForCausalLM (NoPE, GPT-style), PLR embeddings, DCNv2 + JointFusion, 105 passing tests 13 days ago
  • configuration.py
    4.46 kB
    Add DomainTransformerConfig with presets (24M/85M/330M) 13 days ago
  • modeling.py
    9.05 kB
    Add DomainTransformerForCausalLM โ€” GPT-style NoPE model with SDPA attention, weight tying, HF Trainer compatible 13 days ago
  • plr_embeddings.py
    2.06 kB
    Add PLR embeddings (Gorishniy et al. 2022) 13 days ago