-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 38 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 231k • 3.27k
Robson Cassio Ribas
rocari
·
AI & ML interests
None yet
Organizations
CV
Agents, Planning & Tools
LLMs
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 50.2k • 650 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 2.31k • 278
Audio, Speech & Music
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 65.8k • 968 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.75M • • 5.57k -
jonatasgrosman/whisper-large-pt-cv11
Automatic Speech Recognition • Updated • 11 • 16 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 67.9k • 1.79k
CodeGen
Image Generation
-
StarVector: Generating Scalable Vector Graphics Code from Images
Paper • 2312.11556 • Published • 38 -
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
Paper • 2312.12423 • Published • 13 -
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Paper • 2312.11392 • Published • 20 -
stabilityai/stable-video-diffusion-img2vid-xt
Image-to-Video • Updated • 231k • 3.27k
LLMs
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 80 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 50.2k • 650 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 2.31k • 278
CV
Audio, Speech & Music
-
facebook/seamless-m4t-v2-large
Automatic Speech Recognition • 2B • Updated • 65.8k • 968 -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.75M • • 5.57k -
jonatasgrosman/whisper-large-pt-cv11
Automatic Speech Recognition • Updated • 11 • 16 -
openai/whisper-large-v2
Automatic Speech Recognition • 2B • Updated • 67.9k • 1.79k
Agents, Planning & Tools
CodeGen