Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Robson Cassio Ribas's picture

40

Robson Cassio Ribas

rocari

·

rocari

AI & ML interests

None yet

Organizations

rocari 's collections 6

Image Generation

StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 38
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Paper • 2312.12423 • Published Dec 19, 2023 • 13
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Paper • 2312.11392 • Published Dec 18, 2023 • 20
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 231k • 3.27k

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 29
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

Paper • 2401.00935 • Published Jan 1, 2024 • 18

Agents, Planning & Tools

Nexusflow/NexusRaven-V2-13B

Text Generation • 13B • Updated May 1, 2025 • 80 • 470
ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 90

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 80
upstage/SOLAR-10.7B-Instruct-v1.0

Text Generation • 11B • Updated Sep 10, 2024 • 50.2k • 650
openchat/openchat-3.5-1210

Text Generation • 7B • Updated May 18, 2024 • 2.31k • 278

Audio, Speech & Music

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 65.8k • 968
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.75M • • 5.57k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 11 • 16
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 67.9k • 1.79k

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • 7B • Updated Mar 6, 2024 • 1.16k • 205
deepseek-ai/deepseek-coder-33b-instruct

Text Generation • 33B • Updated Mar 7, 2024 • 7.6k • 566
Phind/Phind-CodeLlama-34B-v2

Text Generation • Updated Aug 28, 2023 • 1.88k • 833

Image Generation

StarVector: Generating Scalable Vector Graphics Code from Images

Paper • 2312.11556 • Published Dec 17, 2023 • 38
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model

Paper • 2312.12423 • Published Dec 19, 2023 • 13
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing

Paper • 2312.11392 • Published Dec 18, 2023 • 20
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 231k • 3.27k

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 80
upstage/SOLAR-10.7B-Instruct-v1.0

Text Generation • 11B • Updated Sep 10, 2024 • 50.2k • 650
openchat/openchat-3.5-1210

Text Generation • 7B • Updated May 18, 2024 • 2.31k • 278

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Paper • 2310.11441 • Published Oct 17, 2023 • 29
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

Paper • 2401.00935 • Published Jan 1, 2024 • 18

Audio, Speech & Music

facebook/seamless-m4t-v2-large

Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 65.8k • 968
openai/whisper-large-v3

Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.75M • • 5.57k
jonatasgrosman/whisper-large-pt-cv11

Automatic Speech Recognition • Updated Dec 22, 2022 • 11 • 16
openai/whisper-large-v2

Automatic Speech Recognition • 2B • Updated Feb 29, 2024 • 67.9k • 1.79k

Agents, Planning & Tools

Nexusflow/NexusRaven-V2-13B

Text Generation • 13B • Updated May 1, 2025 • 80 • 470
ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 90

ise-uiuc/Magicoder-S-DS-6.7B

Text Generation • 7B • Updated Mar 6, 2024 • 1.16k • 205
deepseek-ai/deepseek-coder-33b-instruct

Text Generation • 33B • Updated Mar 7, 2024 • 7.6k • 566
Phind/Phind-CodeLlama-34B-v2

Text Generation • Updated Aug 28, 2023 • 1.88k • 833

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs