rookiexiong commited on
Commit
2ebb24a
·
verified ·
1 Parent(s): 7986555

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -0
README.md ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ pipeline_tag: image-segmentation
7
+ tags:
8
+ - referring-segmentation
9
+ - image-segmentation
10
+ - video-segmentation
11
+ - vision-language
12
+ ---
13
+
14
+ # SetCon-8B
15
+
16
+ SetCon-8B is the model checkpoint for **SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction**.
17
+
18
+ ## Usage
19
+
20
+ Please use this checkpoint together with the official codebase:
21
+
22
+ ```bash
23
+ git clone https://github.com/rookiexiong7/SetCon.git
24
+ cd SetCon
25
+ uv sync --extra latest
26
+ source .venv/bin/activate
27
+ ```
28
+
29
+ Single-image inference:
30
+ ```
31
+ python demo.py \
32
+ --image-path assets/room.jpg \
33
+ --query-text "the target objects" \
34
+ --model-path path/to/SetCon-8B
35
+ ```
36
+
37
+ ## Intended Use
38
+
39
+ This model is intended for research on open-ended referring image/video segmentation.
40
+
41
+ ## Limitations
42
+
43
+ The model may produce incomplete or inaccurate masks for ambiguous expressions, small objects, crowded scenes, or out-of-domain visual
44
+ concepts.
45
+
46
+ ## Citation
47
+
48
+