-
Human-inspired Perspectives: A Survey on AI Long-term Memory
Paper • 2411.00489 • Published • 1 -
Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation
Paper • 2409.09135 • Published • 2 -
Reading Recognition in the Wild
Paper • 2505.24848 • Published • 1 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46
Collections
Discover the best community collections!
Collections including paper arxiv:2311.09213
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33
-
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13 -
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Paper • 2402.06149 • Published • 18 -
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Paper • 2402.06178 • Published • 13 -
proj-persona/PersonaHub
Viewer • Updated • 375k • 7.45k • 739
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14
-
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13 -
Text-Guided 3D Face Synthesis -- From Generation to Editing
Paper • 2312.00375 • Published • 11 -
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Paper • 2312.00330 • Published • 13 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 27
-
Levels of AGI for Operationalizing Progress on the Path to AGI
Paper • 2311.02462 • Published • 36 -
Ultra-Long Sequence Distributed Transformer
Paper • 2311.02382 • Published • 6 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper • 2311.07989 • Published • 26 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13
-
Human-inspired Perspectives: A Survey on AI Long-term Memory
Paper • 2411.00489 • Published • 1 -
Multimodal Fusion with LLMs for Engagement Prediction in Natural Conversation
Paper • 2409.09135 • Published • 2 -
Reading Recognition in the Wild
Paper • 2505.24848 • Published • 1 -
EgoLife: Towards Egocentric Life Assistant
Paper • 2503.03803 • Published • 46
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
Textbooks Are All You Need
Paper • 2306.11644 • Published • 154 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 6 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 33
-
Visual In-Context Prompting
Paper • 2311.13601 • Published • 18 -
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework
Paper • 2308.08155 • Published • 11 -
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models
Paper • 2303.02927 • Published • 3 -
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Paper • 2311.07361 • Published • 14
-
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13 -
Text-Guided 3D Face Synthesis -- From Generation to Editing
Paper • 2312.00375 • Published • 11 -
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter
Paper • 2312.00330 • Published • 13 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper • 2312.03029 • Published • 27
-
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13 -
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Paper • 2402.06149 • Published • 18 -
MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models
Paper • 2402.06178 • Published • 13 -
proj-persona/PersonaHub
Viewer • Updated • 375k • 7.45k • 739
-
Levels of AGI for Operationalizing Progress on the Path to AGI
Paper • 2311.02462 • Published • 36 -
Ultra-Long Sequence Distributed Transformer
Paper • 2311.02382 • Published • 6 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper • 2311.07989 • Published • 26 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 13