GPT Scorer β Pump.fun Token Migration Predictor
XGBoost model trained on 135K+ pump.fun token launches to predict which tokens will migrate to a DEX (PumpSwap/Raydium/Meteora/Jupiter) vs rug/stall/abandon.
Performance
| Metric | Value |
|---|---|
| ROC AUC | 0.9899 |
| Precision @ 80% | 80.2% |
| Recall @ 80% | 78.3% |
| F1 (migrated) | 0.66 |
| Training samples | 135,617 |
| Features | 36 |
| Trained | Daily at 6am UTC |
Top Features
entity_total_migrationsβ how many times entity has migrated beforeentity_migration_rateβ entity success rateinactivity_gap_secondsβ dead time between tradesis_quality_devβ archetype classificationdistinct_sellerstotal_sellsis_bot_clusterβ bot detectioncurve_pct_filledβ bonding curve progressentity_priorβ Bayesian prior scoredistinct_buyers
Usage
import xgboost as xgb
from huggingface_hub import hf_hub_download
# Download model
model_path = hf_hub_download(repo_id="jrhode666/gpt-scorer", filename="xgb_scorer.json")
# Load
model = xgb.XGBClassifier()
model.load_model(model_path)
# Predict (36 features required β see xgb_meta.json)
probability = model.predict_proba(features_df)[:, 1]
Features (36 total)
See xgb_meta.json for the full feature list. Core categories:
- Activity: unique_buyers_15s/30s/60s, buy_count_*, sell_count_*, total_buys, total_sells
- Microstructure: early_sell_pressure, buyer_concentration, repeat_buyer_rate
- Entity history: entity_prior, entity_confidence, entity_migration_rate, entity_rug_rate, entity_total_launches, entity_total_migrations
- Archetype: is_quality_dev, is_moderate_dev, is_whale_dev, is_serial_rugger, is_bot_cluster
- Temporal: launch_hour_utc, inactivity_gap_seconds
- Market: curve_pct_filled
- Derived: buy_sell_ratio, seller_buyer_ratio, buyers_per_minute
Target
Binary classification: 1 if token migrated to DEX (PumpSwap, Raydium, Meteora, Jupiter), 0 otherwise.
License
MIT