dhruva-sarma 's Collections Natural Language (LLM, NLP etc)
updated
Toward Self-Improvement of LLMs via Imagination, Searching, and
Criticizing
Paper
• 2404.12253
• Published • 55
FlowMind: Automatic Workflow Generation with LLMs
Paper
• 2404.13050
• Published • 34
How Far Can We Go with Practical Function-Level Program Repair?
Paper
• 2404.12833
• Published • 7
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of
Diverse Models
Paper
• 2404.18796
• Published • 71
Prometheus 2: An Open Source Language Model Specialized in Evaluating
Other Language Models
Paper
• 2405.01535
• Published • 124
An Introduction to Vision-Language Modeling
Paper
• 2405.17247
• Published • 90
From RAGs to rich parameters: Probing how language models utilize
external knowledge over parametric information for factual queries
Paper
• 2406.12824
• Published • 21
Tokenization Falling Short: The Curse of Tokenization
Paper
• 2406.11687
• Published • 16
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen
Reference Content
Paper
• 2406.11811
• Published • 16
GLiNER multi-task: Generalist Lightweight Model for Various Information
Extraction Tasks
Paper
• 2406.12925
• Published • 25
HARE: HumAn pRiors, a key to small language model Efficiency
Paper
• 2406.11410
• Published • 40
Judging the Judges: Evaluating Alignment and Vulnerabilities in
LLMs-as-Judges
Paper
• 2406.12624
• Published • 37
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published • 64
Octo-planner: On-device Language Model for Planner-Action Agents
Paper
• 2406.18082
• Published • 48
WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks,
and Refusals of LLMs
Paper
• 2406.18495
• Published • 13
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
• 2406.20094
• Published • 107
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
Paper
• 2407.09025
• Published • 140
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Paper
• 2407.13623
• Published • 56
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore
Paper
• 2407.12854
• Published • 31
Building and better understanding vision-language models: insights and
future directions
Paper
• 2408.12637
• Published • 133
Text2SQL is Not Enough: Unifying AI and Databases with TAG
Paper
• 2408.14717
• Published • 26
Generative Verifiers: Reward Modeling as Next-Token Prediction
Paper
• 2408.15240
• Published • 13
Towards a Unified View of Preference Learning for Large Language Models:
A Survey
Paper
• 2409.02795
• Published • 72
LLaMA-Omni: Seamless Speech Interaction with Large Language Models
Paper
• 2409.06666
• Published • 60
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question
Answering
Paper
• 2409.06595
• Published • 38
Not All LLM Reasoners Are Created Equal
Paper
• 2410.01748
• Published • 29
Transformer^2: Self-adaptive LLMs
Paper
• 2501.06252
• Published • 55
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents
Paper
• 2501.08828
• Published • 30
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
• 2501.08313
• Published • 302
Potential and Perils of Large Language Models as Judges of Unstructured
Textual Data
Paper
• 2501.08167
• Published • 6
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction
Paper
• 2501.06282
• Published • 53
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference
Optimization
Paper
• 2502.04306
• Published • 20
Matryoshka Representation Learning
Paper
• 2205.13147
• Published • 25