yuhangzang commited on
Commit
ed62caa
Β·
verified Β·
1 Parent(s): f75d711

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,14 +16,14 @@ library_name: diffusers
16
  ---
17
 
18
  # ETCHR-FLUX.2-klein-9B
19
- <p align="center" style="font-size: 1.2em; margin-top: 0.5em">
20
- πŸ“–<a href="https://arxiv.org/abs/">Paper</a>
21
  | 🏠<a href="https://github.com/InternLM/ETCHR">Homepage</a >
22
  | πŸ€—<a href="https://huggingface.co/internlm/ETCHR-FLUX.2-klein-9B">ETCHR-FLUX.2-klein-9B Model</a >
23
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/ETCHR-SFT-400K">ETCHR SFT-400K Dataset</a >
24
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/ETCHR-GRPO-10K">ETCHR GRPO-10K Dataset</a >
25
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/DL3DV-2k">DL3DV-2K Benchmark</a >
26
- </p >
27
  ETCHR-FLUX.2-klein-9B is a novel question-conditioned, reasoning-aware image editor designed to serve as a decoupled visual reasoning assistant for Multimodal Large Language Models. By decoupling the specialized image editor from the downstream understanding model, ETCHR bridges the critical bottleneck where a purely textual chain of thought fails in fine-grained focus or complex spatial transformations.
28
 
29
 
 
16
  ---
17
 
18
  # ETCHR-FLUX.2-klein-9B
19
+
20
+ πŸ“–<a href="https://arxiv.org/abs/">Paper</a>
21
  | 🏠<a href="https://github.com/InternLM/ETCHR">Homepage</a >
22
  | πŸ€—<a href="https://huggingface.co/internlm/ETCHR-FLUX.2-klein-9B">ETCHR-FLUX.2-klein-9B Model</a >
23
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/ETCHR-SFT-400K">ETCHR SFT-400K Dataset</a >
24
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/ETCHR-GRPO-10K">ETCHR GRPO-10K Dataset</a >
25
  | πŸ€—<a href="https://huggingface.co/datasets/internlm/DL3DV-2k">DL3DV-2K Benchmark</a >
26
+
27
  ETCHR-FLUX.2-klein-9B is a novel question-conditioned, reasoning-aware image editor designed to serve as a decoupled visual reasoning assistant for Multimodal Large Language Models. By decoupling the specialized image editor from the downstream understanding model, ETCHR bridges the critical bottleneck where a purely textual chain of thought fails in fine-grained focus or complex spatial transformations.
28
 
29