ivnle commited on
Commit
13963a7
·
verified ·
1 Parent(s): 3ea4545

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -29,6 +29,8 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
29
  | `vision_base_h0_recon` | Vision base | 3.60 | 1.03 |
30
  | `meanpool_w4s4_h0_recon` | Meanpool w4s4 | 3.97 | 1.04 |
31
  | `conv1d_t250_h0_recon` | Conv1D t250 | 3.97 | 1.00 |
 
 
32
 
33
  ### Language Modeling
34
 
@@ -43,10 +45,10 @@ Naming convention: `{regime}_{config}_h{N}_{objective}[_recon-init]`
43
  ## Model Details
44
 
45
  - **Architecture**: DeepSeek-OCR with vision encoder
46
- - **Vision checkpoints**: Trained encoder, 768x768 (base)
47
  - **Text checkpoints**: Truncation baseline (no vision encoder), context=277 tokens
48
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
49
- - **Conv1D checkpoints**: Trained hierarchical encoder, target=250 tokens
50
  - **Dataset**: 510k samples from FineWiki
51
 
52
  ## Usage
 
29
  | `vision_base_h0_recon` | Vision base | 3.60 | 1.03 |
30
  | `meanpool_w4s4_h0_recon` | Meanpool w4s4 | 3.97 | 1.04 |
31
  | `conv1d_t250_h0_recon` | Conv1D t250 | 3.97 | 1.00 |
32
+ | `vision_tiny_h0_recon` | Vision tiny | 12.82 | 1.14 |
33
+ | `conv1d_t63_h0_recon` | Conv1D t63 | 15.38 | 1.01 |
34
 
35
  ### Language Modeling
36
 
 
45
  ## Model Details
46
 
47
  - **Architecture**: DeepSeek-OCR with vision encoder
48
+ - **Vision checkpoints**: Trained encoder (base=768x768, tiny=384x384)
49
  - **Text checkpoints**: Truncation baseline (no vision encoder), context=277 tokens
50
  - **Meanpool checkpoints**: Frozen encoder, window=4, stride=4
51
+ - **Conv1D checkpoints**: Trained hierarchical encoder (t250=CR 3.97, t63=CR 15.38)
52
  - **Dataset**: 510k samples from FineWiki
53
 
54
  ## Usage