reviewing
updated
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue
Summarization
Paper
• 2402.13249
• Published • 15
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper
• 2402.12659
• Published • 24
Instruction-tuned Language Models are Better Knowledge Learners
Paper
• 2402.12847
• Published • 26
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for
Language Models
Paper
• 2402.13064
• Published • 51
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper
• 2402.13753
• Published • 116
User-LLM: Efficient LLM Contextualization with User Embeddings
Paper
• 2402.13598
• Published • 21
Coercing LLMs to do and reveal (almost) anything
Paper
• 2402.14020
• Published • 13
Ouroboros: Speculative Decoding with Large Model Enhanced Drafting
Paper
• 2402.13720
• Published • 7
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
Models
Paper
• 2402.10986
• Published • 82
Speculative Streaming: Fast LLM Inference without Auxiliary Models
Paper
• 2402.11131
• Published • 42
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper
• 2402.12226
• Published • 45
Paper
• 2402.12219
• Published • 17
OneBit: Towards Extremely Low-bit Large Language Models
Paper
• 2402.11295
• Published • 24
LongAgent: Scaling Language Models to 128k Context through Multi-Agent
Collaboration
Paper
• 2402.11550
• Published • 19
CoLLaVO: Crayon Large Language and Vision mOdel
Paper
• 2402.11248
• Published • 22
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and
Local Refinements
Paper
• 2402.10963
• Published • 12
Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning
Paper
• 2402.11690
• Published • 9
Linear Transformers with Learnable Kernel Functions are Better
In-Context Models
Paper
• 2402.10644
• Published • 81
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs
Miss
Paper
• 2402.10790
• Published • 42
SPAR: Personalized Content-Based Recommendation via Long Engagement
Attention
Paper
• 2402.10555
• Published • 35
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM
Workflows
Paper
• 2402.10379
• Published • 31
LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video
Editing
Paper
• 2402.10294
• Published • 27
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large
Language Models
Paper
• 2402.10524
• Published • 23
Large Language Models as Zero-shot Dialogue State Tracker through
Function Calling
Paper
• 2402.10466
• Published • 18
OpenCodeInterpreter: Integrating Code Generation with Execution and
Refinement
Paper
• 2402.14658
• Published • 84
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper
• 2402.14034
• Published • 13
OmniPred: Language Models as Universal Regressors
Paper
• 2402.14547
• Published • 14
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper
• 2402.14289
• Published • 20
Scaling Up LLM Reviews for Google Ads Content Moderation
Paper
• 2402.14590
• Published • 9
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper
• 2402.14261
• Published • 10
Linear Transformers are Versatile In-Context Learners
Paper
• 2402.14180
• Published • 7