Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zaydzuhri 's Collections
Token Order Prediction
Softpick

Token Order Prediction

updated Sep 1, 2025

Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"

Upvote
-

  • Predicting the Order of Upcoming Tokens Improves Language Modeling

    Paper • 2508.19228 • Published Aug 26, 2025 • 23

  • zaydzuhri/vanilla-340M-4096-model

    0.4B • Updated Sep 1, 2025 • 2

  • zaydzuhri/mtp-340M-4096-model

    0.4B • Updated Sep 1, 2025 • 1

  • zaydzuhri/top-340M-4096-model

    0.4B • Updated Sep 1, 2025 • 3 • 1

  • zaydzuhri/vanilla-1.8B-4096-model

    2B • Updated Sep 1, 2025 • 1

  • zaydzuhri/mtp-1.8B-4096-model

    2B • Updated Sep 1, 2025 • 11

  • zaydzuhri/top-1.8B-4096-model

    2B • Updated Sep 1, 2025 • 1

  • zaydzuhri/vanilla-7B-4096-model

    7B • Updated Sep 1, 2025 • 4

  • zaydzuhri/mtp-7B-4096-model

    7B • Updated Sep 1, 2025 • 1

  • zaydzuhri/top-7B-4096-model

    7B • Updated Sep 1, 2025 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs