Collections
Discover the best community collections!
Collections including paper arxiv:2410.02525
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 6.63M • • 2.95k -
jasperai/Flux.1-dev-Controlnet-Upscaler
Image-to-Image • Updated • 2.83k • 865 -
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 36 -
Contextual Document Embeddings
Paper • 2410.02525 • Published • 24
-
Knowing When to Ask -- Bridging Large Language Models and Data
Paper • 2409.13741 • Published • 1 -
A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases
Paper • 2311.07509 • Published • 2 -
Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!
Paper • 2405.11706 • Published • 2 -
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
Paper • 2404.17723 • Published • 2
-
What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs
Paper • 2409.08775 • Published -
OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering
Paper • 2409.08250 • Published • 1 -
Synthetic continued pretraining
Paper • 2409.07431 • Published • 5 -
WonderWorld: Interactive 3D Scene Generation from a Single Image
Paper • 2406.09394 • Published • 3
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58 -
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper • 2402.03766 • Published • 15 -
LLM Agent Operating System
Paper • 2403.16971 • Published • 73
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 6.63M • • 2.95k -
jasperai/Flux.1-dev-Controlnet-Upscaler
Image-to-Image • Updated • 2.83k • 865 -
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 36 -
Contextual Document Embeddings
Paper • 2410.02525 • Published • 24
-
What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs
Paper • 2409.08775 • Published -
OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering
Paper • 2409.08250 • Published • 1 -
Synthetic continued pretraining
Paper • 2409.07431 • Published • 5 -
WonderWorld: Interactive 3D Scene Generation from a Single Image
Paper • 2406.09394 • Published • 3
-
Knowing When to Ask -- Bridging Large Language Models and Data
Paper • 2409.13741 • Published • 1 -
A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases
Paper • 2311.07509 • Published • 2 -
Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!
Paper • 2405.11706 • Published • 2 -
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering
Paper • 2404.17723 • Published • 2
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Evolutionary Optimization of Model Merging Recipes
Paper • 2403.13187 • Published • 58 -
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model
Paper • 2402.03766 • Published • 15 -
LLM Agent Operating System
Paper • 2403.16971 • Published • 73