-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 13 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 102
daje kang
daje
AI & ML interests
None yet
Recent Activity
updated a dataset 4 days ago
daje/korean-tts-training published a dataset 4 days ago
daje/korean-tts-training liked a dataset 4 months ago
nvidia/ToolScaleOrganizations
Paper
-
Cached Transformers: Improving Transformers with Differentiable Memory Cache
Paper • 2312.12742 • Published • 13 -
ProTIP: Progressive Tool Retrieval Improves Planning
Paper • 2312.10332 • Published • 8 -
Paloma: A Benchmark for Evaluating Language Model Fit
Paper • 2312.10523 • Published • 13 -
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale
Paper • 2406.17557 • Published • 102
models 41
daje/whisper-v3-turbo-address
Automatic Speech Recognition • 0.8B • Updated
daje/Qwen2-VL-7B-Instruct-fashion-product-images-small
8B • Updated • 1
daje/Meta-Llama-3.1-8B-Instruct-de-identification
8B • Updated • 1
daje/Qwen2.5-14B-Instruct-tools
Text Generation • 15B • Updated • 2
daje/model_0.0002_alpha-32_r-64
Updated • 178
daje/model_0.0002_alpha-8_r-16
Updated • 187
daje/model_5e-05_alpha-128_r-256
Updated • 571
daje/model_2e-4_alpha-8_r-16
Updated • 555
daje/model_Lora
Updated • 70
daje/model_2e-4
Updated • 949
datasets 20
daje/korean-tts-training
Viewer • Updated • 120 • 456
daje/korean-address-voice-v2
Viewer • Updated • 3.74k • 28
daje/korean-address-voice
Viewer • Updated • 118 • 6
daje/synthetic-ko-sql-hard-add-llm-result
Viewer • Updated • 1.68k • 5
daje/synthetic-ko-sql-hard
Viewer • Updated • 1.68k • 7 • 1
daje/kotext-to-sql-v1-hard
Viewer • Updated • 2k • 12
daje/kaggle-image-datasets
Viewer • Updated • 44.4k • 14
daje/de-identify-chat-ko
Viewer • Updated • 9.92k • 7
daje/ko-hatefulmemes_train_8500
Viewer • Updated • 8.2k • 24
daje/ko-hatefulmemes_train_8500_kmhas
Viewer • Updated • 95.3k • 42