-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 14 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 74 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 49
Collections
Discover the best community collections!
Collections including paper arxiv:2403.09029
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 25 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 26
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 32.4k • 390 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 87 • 192 -
laion/filtered-wit
Viewer • Updated • 2.8M • 5.08k • 11
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 82 -
bigcode/starcoder2-15b
Text Generation • Updated • 8.32k • 669 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 123 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • Updated • 121k • 142
-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 14 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 74 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 49
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 26
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 25 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 72 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 57 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 32.4k • 390 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 87 • 192 -
laion/filtered-wit
Viewer • Updated • 2.8M • 5.08k • 11
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 82 -
bigcode/starcoder2-15b
Text Generation • Updated • 8.32k • 669 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 123 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • Updated • 121k • 142