hamverbot commited on
Commit
9f3c561
·
verified ·
1 Parent(s): c900336

Upload RESEARCH_RESOURCES.md

Browse files
Files changed (1) hide show
  1. RESEARCH_RESOURCES.md +254 -0
RESEARCH_RESOURCES.md ADDED
@@ -0,0 +1,254 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # RTB Bidding Algorithm Comparison — Complete Research Resource List
2
+
3
+ > Generated: 2026-05-05 | Repository: https://huggingface.co/hamverbot/bidding_algorithms_benchmark
4
+
5
+ ---
6
+
7
+ ## Table of Contents
8
+
9
+ 1. [Bidding Algorithms](#1-bidding-algorithms)
10
+ 2. [CTR Prediction Models](#2-ctr-prediction-models)
11
+ 3. [Clearing Price / Market Price Prediction](#3-clearing-price--market-price-prediction)
12
+ 4. [Datasets](#4-datasets)
13
+ 5. [Codebases & Implementations](#5-codebases--implementations)
14
+ 6. [Benchmark Leaderboards](#6-benchmark-leaderboards)
15
+ 7. [Recommended Architecture](#7-recommended-architecture)
16
+
17
+ ---
18
+
19
+ ## 1. Bidding Algorithms
20
+
21
+ ### 1.1 Lagrangian Dual + Online Gradient Descent (BEST MATCH)
22
+
23
+ | Property | Detail |
24
+ |----------|--------|
25
+ | **Paper** | "Learning to Bid in Repeated First-Price Auctions with Budgets" |
26
+ | **Authors** | Qian Wang, Zongjun Yang, Xiaotie Deng, Yuqing Kong (2023) |
27
+ | **Venue** | NeurIPS 2023 |
28
+ | **arXiv** | [2304.13477](https://arxiv.org/abs/2304.13477) |
29
+ | **HF Papers** | https://huggingface.co/papers/2304.13477 |
30
+ | **Algorithm** | DualOGD — Lagrangian dual multiplier updated by online error gradient descent |
31
+ | **Auction Type** | First-price (also handles second-price) |
32
+ | **Constraints** | Budget cap: total spend ≤ ρT |
33
+ | **Regret Bound** | Õ(√T) for both full-information and one-sided feedback |
34
+ | **Key Formula** | λ_{t+1} = Proj_{λ>0}(λ_t − ε·(ρ − c̃_t(b_t))) |
35
+ | **Bid Rule** | b_t = argmax_b (r̃_t(v_t, b) − λ_t·c̃_t(b)) |
36
+ | **Prediction Models Needed** | CTR predictor (for v_t), empirical CDF of competing bids (G̃) |
37
+
38
+ ### 1.2 Dual Mirror Descent (Second-Price)
39
+
40
+ | Property | Detail |
41
+ |----------|--------|
42
+ | **Paper** | "The Best of Many Worlds: Dual Mirror Descent for Online Allocation Problems" |
43
+ | **Authors** | Santiago Balseiro, Haihao Lu, Vahab Mirrokni (2020) |
44
+ | **Venue** | Operations Research (2023) |
45
+ | **arXiv** | [2011.10124](https://arxiv.org/abs/2011.10124) |
46
+ | **Citations** | 135+ |
47
+ | **Algorithm** | Dual mirror descent — generalizes OGD with Bregman divergences |
48
+ | **Auction Type** | Second-price (truthful) |
49
+ | **Bid Rule** | b_t = v_t / (1 + μ_t) |
50
+ | **Dual Update** | μ_{t+1} = Proj(μ_t − η·(ρ − payment_t)) |
51
+ | **Key Insight** | No market price model needed for second-price auctions |
52
+ | **Prediction Models** | CTR predictor only |
53
+
54
+ ### 1.3 Dual Descent with RoS + Budget (Multi-Constraint)
55
+
56
+ | Property | Detail |
57
+ |----------|--------|
58
+ | **Paper** | "Online Bidding Algorithms for Return-on-Spend Constrained Advertisers" |
59
+ | **Authors** | Zhe Feng, Swati Padmanabhan, Di Wang (2022) |
60
+ | **Venue** | ICML 2022 |
61
+ | **arXiv** | [2208.13713](https://arxiv.org/abs/2208.13713) |
62
+ | **Algorithm** | Two dual variables: λ for RoS, μ for budget |
63
+ | **Bid Rule** | b_t = ((1+λ_t)/(μ_t+λ_t)) · v_t |
64
+ | **Key Insight** | Adaptable for k% spend floor — second dual variable enforces minimum spend |
65
+
66
+ ### 1.4 RLB — Reinforcement Learning Bidding
67
+
68
+ | Property | Detail |
69
+ |----------|--------|
70
+ | **Paper** | "Real-Time Bidding by Reinforcement Learning in Display Advertising" |
71
+ | **Authors** | Han Cai et al. (2017) |
72
+ | **Venue** | WSDM 2017 |
73
+ | **arXiv** | [1701.02490](https://arxiv.org/abs/1701.02490) |
74
+ | **GitHub** | https://github.com/han-cai/rlb-dp (188 stars) |
75
+ | **Algorithm** | MDP + Dynamic Programming + Neural value function |
76
+ | **Results** | +22% clicks over linear bidding on iPinYou |
77
+ | **Prediction Models** | CTR θ(x) + market price distribution m(δ, x) |
78
+
79
+ ### 1.5 HiBid — Industrial Hierarchical Dual-RL
80
+
81
+ | Property | Detail |
82
+ |----------|--------|
83
+ | **Paper** | "HiBid: A Cross-Channel Constrained Bidding System" |
84
+ | **arXiv** | [2312.17503](https://arxiv.org/abs/2312.17503) |
85
+ | **Scale** | 64K advertisers, 70M requests/day, 4 channels, Meituan |
86
+ | **Algorithm** | High-level RL budget allocation + Low-level λ-parameterized bidding |
87
+
88
+ ### Unified Dual Multiplier Template
89
+
90
+ ```
91
+ For each auction t:
92
+ 1. Observe value v_t (from CTR prediction × click value)
93
+ 2. Compute bid: b_t = f(v_t, dual_multiplier_t)
94
+ 3. Observe outcome: payment c_t (if won) or 0 (if lost)
95
+ 4. Compute gradient: g_t = ρ − c_t
96
+ 5. Update multiplier: λ_{t+1} = Proj_{λ≥0}(λ_t − η·g_t)
97
+ ```
98
+
99
+ | Method | Auction | Bid Function f(v, λ) |
100
+ |--------|---------|----------------------|
101
+ | Wang 2023 | First-price | argmax_b (r̃(v,b) − λ·c̃(b)) |
102
+ | Balseiro 2020 | Second-price | v / (1+λ) |
103
+ | Feng 2022 | Second-price | ((1+λ_RoS)/(λ_RoS+λ_budget)) · v |
104
+
105
+ ---
106
+
107
+ ## 2. CTR Prediction Models
108
+
109
+ ### 2.1 FinalMLP (RECOMMENDED)
110
+
111
+ | Property | Detail |
112
+ |----------|--------|
113
+ | **Paper** | "FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction" |
114
+ | **arXiv** | [2304.00902](https://arxiv.org/abs/2304.00902) |
115
+ | **Criteo AUC** | **0.8149** |
116
+ | **Avazu AUC** | **0.7666** |
117
+ | **Architecture** | Two-stream MLP + feature gating + bilinear fusion |
118
+ | **Inference** | <1ms — best for RTB latency constraints |
119
+
120
+ ### 2.2 Other Top Models
121
+
122
+ | Model | Criteo AUC | Architecture | RTB-Suitable |
123
+ |-------|-----------|-------------|--------------|
124
+ | **FinalMLP** | 0.8149 | Two-stream MLP | ✅ Best |
125
+ | **DCNv2** | 0.8142-0.8144 | CrossNetV2 + DNN | ✅ |
126
+ | **GDCN** | 0.8161* | Gated Cross + DNN | ✅ |
127
+ | **DeepFM** | 0.8138 | FM + DNN | ✅ |
128
+ | **FCN** | New | LCN + ECN (no DNN) | ✅ |
129
+ | DIN | — | Attention (user history) | ❌ Slow |
130
+ | DIEN | — | GRU + attention | ❌ Slow |
131
+
132
+ *GDCN uses own data split — not directly comparable.
133
+
134
+ **BARS Meta-Finding (2009.05794):** After 7,000+ experiments, SOTA deep CTR models differ by only 0.1-0.3% AUC. Architecture matters less than data preprocessing, hyperparameter tuning, and feature engineering.
135
+
136
+ ---
137
+
138
+ ## 3. Clearing Price / Market Price Prediction
139
+
140
+ ### 3.1 Non-Parametric Empirical CDF (BASELINE)
141
+
142
+ | Property | Detail |
143
+ |----------|--------|
144
+ | **Source** | Wang et al. (2023), Algorithm 1 |
145
+ | **Method** | G̃_t(b) = (1/(t-1))∑𝟙{b ≥ d_s} |
146
+ | **Pros** | No training, theoretically sound, handles distribution shift |
147
+ | **Cons** | No context, cold-start |
148
+
149
+ ### 3.2 Deep Censored Learning / Survival Analysis
150
+
151
+ | Property | Detail |
152
+ |----------|--------|
153
+ | **Library** | **TorchSurv** (Novartis, 200★) [2404.10761] |
154
+ | **URL** | https://github.com/Novartis/torchsurv |
155
+ | **Method** | Neural net with censored survival loss |
156
+ | **Loss** | Win: -log f(price\|x); Loss: -log S(bid\|x) |
157
+ | **Key Insight** | Proper survival framework handles censoring |
158
+
159
+ ### 3.3 Censored Linear Regression (Wu et al. 2015, KDD)
160
+
161
+ | Property | Detail |
162
+ |----------|--------|
163
+ | **Method** | Tobit-like: log(market_price) = β·x + ε, ε ~ N(0, σ²) |
164
+ | **Pros** | Contextual, simple |
165
+ | **Cons** | Linear — limited capacity |
166
+
167
+ ### Comparison
168
+
169
+ | Method | Contextual? | Handles Censoring? | Training? | Complexity |
170
+ |--------|-------------|-------------------|-----------|------------|
171
+ | Empirical CDF | ❌ | N/A | None | Minimal |
172
+ | Censored Linear | ✅ | ✅ | Light | Low |
173
+ | Deep Survival | ✅ | ✅ | Neural net | Medium |
174
+ | Win Prob NN | ✅ | ❌ | Neural net | Low |
175
+
176
+ ---
177
+
178
+ ## 4. Datasets
179
+
180
+ ### CTR Prediction (Verified on HF Hub)
181
+
182
+ | Dataset | HF Path | Size | Verified |
183
+ |---------|---------|------|----------|
184
+ | Criteo_x4 | reczoo/Criteo_x4 | 45.8M rows, 5.6GB | ✅ |
185
+ | Avazu_x4 | reczoo/Avazu_x4 | 40.4M rows, 1.8GB | ✅ |
186
+
187
+ ### RTB Bidding (External Only)
188
+
189
+ | Dataset | Source | Availability |
190
+ |---------|--------|-------------|
191
+ | iPinYou | data.computational-advertising.org | External download |
192
+ | YOYI | Various mirrors | External download |
193
+
194
+ ---
195
+
196
+ ## 5. Codebases
197
+
198
+ | Library | URL | Purpose |
199
+ |---------|-----|---------|
200
+ | **FuxiCTR** | https://github.com/reczoo/FuxiCTR | 40+ CTR models, config-driven |
201
+ | **DeepCTR-Torch** | https://github.com/shenweichen/DeepCTR-Torch | 20+ CTR models, simple API |
202
+ | **TorchSurv** | https://github.com/Novartis/torchsurv | Deep survival for clearing price |
203
+ | **BARS** | https://github.com/openbenchmark/BARS | Standardized CTR benchmark |
204
+ | **rlb-dp** | https://github.com/han-cai/rlb-dp | RL for RTB |
205
+ | **budget_constrained_bidding** | https://github.com/dingmu365/budget_constrained_bidding | Budget-constrained algorithms |
206
+
207
+ ---
208
+
209
+ ## 6. Benchmark Leaderboards
210
+
211
+ | Leaderboard | URL |
212
+ |-------------|-----|
213
+ | BARS CTR Criteo_x4 | https://openbenchmark.github.io/BARS/CTR/leaderboard/criteo_x4.html |
214
+ | BARS CTR Avazu | https://openbenchmark.github.io/BARS/CTR/leaderboard/avazu_x4.html |
215
+
216
+ ---
217
+
218
+ ## 7. Recommended Architecture
219
+
220
+ ```
221
+ ┌─────────────────────────────────────────────┐
222
+ │ BIDDING ALGORITHM │
223
+ │ Dual OGD: λ_{t+1} = Proj(λ_t - ε·(ρ - c̃)) │
224
+ │ Two-sided: μ (cap) + ν (floor) │
225
+ ├─────────────────────────────────────────────┤
226
+ │ PREDICTION MODELS │
227
+ │ ┌──────────────┐ ┌────────────────────┐ │
228
+ │ │ FinalMLP │ │ Empirical CDF / │ │
229
+ │ │ v_t=pCTR×V │ │ TorchSurv │ │
230
+ │ └──────────────┘ └────────────────────┘ │
231
+ ├─────────────────────────────────────────────┤
232
+ │ DATASETS │
233
+ │ Criteo_x4 + synthetic auction simulation │
234
+ └─────────────────────────────────────────────┘
235
+ ```
236
+
237
+ ## Paper Index
238
+
239
+ | # | Paper | arXiv | Year | Citations |
240
+ |---|-------|-------|------|-----------|
241
+ | 1 | Wang et al. — First-Price Auctions with Budgets | 2304.13477 | 2023 | Growing |
242
+ | 2 | Balseiro et al. — Dual Mirror Descent | 2011.10124 | 2020 | 135+ |
243
+ | 3 | Feng et al. — RoS Constrained Bidding | 2208.13713 | 2022 | 38+ |
244
+ | 4 | Cai et al. — RLB | 1701.02490 | 2017 | 300+ |
245
+ | 5 | Wang et al. — HiBid | 2312.17503 | 2023 | New |
246
+ | 6 | — Contextual First-Price (Quantile) | 2603.07207 | 2026 | New |
247
+ | 7 | Mao et al. — FinalMLP | 2304.00902 | 2023 | Growing |
248
+ | 8 | Wang et al. — GDCN | 2311.04635 | 2023 | Growing |
249
+ | 9 | Wang et al. — DCN V2 | 2008.13535 | 2021 | 500+ |
250
+ | 10 | Guo et al. — DeepFM | — | 2017 | 3000+ |
251
+ | 11 | Zhu et al. — BARS-CTR | 2009.05794 | 2021 | 100+ |
252
+ | 12 | Wu et al. — Censored Price Prediction | — | 2015 | 101 |
253
+ | 13 | — TorchSurv | 2404.10761 | 2024 | New |
254
+ | 14 | — Robust Budget Pacing | 2302.02006 | 2023 | Growing |