why print rightarrow

#61
by wangtf-Kevin - opened

always print "$\rightarrow" in my claude code, use vllm v0.19.1rc1.dev64+g62095e82c in A100 80G x2

Hi @wangtf-Kevin -

If you have a specific question, feedback, or issue related to this model, please feel free to share more details. Weโ€™d be happy to help!

Hi @wangtf-Kevin -

If you have a specific question, feedback, or issue related to this model, please feel free to share more details. Weโ€™d be happy to help!

That was a specific question or issue.

He's asking why your model, in it's CoT block, goes "(\rightarrow)" to mark logic chains like this:

User is asking question (\rightarrow) google staff ignores it (\rightarrow) Model is sad.

It's not a problem per say, but it's weird enough to be worth asking why it's the case, as an emoji or a "->" or literally anything else would have made more sense. Is that a special token that was trained / used for logic chains?

hi @sonali-kumari11 @SerialKicked
I think it may be due to the use of Chinese. Please note the printed information : $\rightarrow$

I use gemma4 to analyze MinerU ๏ผŒset default Chinese in Claude code cli use vs code terminal, the print like :
ask:

ๆ‰พๅ‡บ mineru-models-download -s modelscope -m all ้œ€่ฆไธ‹่ฝฝ็š„ๆ‰€ๆœ‰ๆจกๅž‹
โง‰ Selected 1 lines from docker/china/Dockerfile in Visual Studio Code

anwser:
โ— ้€š่ฟ‡ๅˆ†ๆž mineru/cli/models_download.py ็š„ๆบ็ ๏ผŒๆ‰ง่กŒ mineru-models-download -s
modelscope -m all ไผšไธ‹่ฝฝไธค็ฑปๆจกๅž‹๏ผšPipeline ๆจกๅž‹ ๅ’Œ VLM ๆจกๅž‹ใ€‚

ไปฅไธ‹ๆ˜ฏ่ฏฆ็ป†็š„ๆจกๅž‹ๆธ…ๅ•๏ผš

  1. Pipeline ๆจกๅž‹ (Pipeline Models)

ไปฃ็ ไธญ็š„ download_pipeline_models()
ๅ‡ฝๆ•ฐๅฎšไน‰ไบ†้œ€่ฆไธ‹่ฝฝ็š„ๆจกๅž‹ๅˆ—่กจใ€‚่ฟ™ไบ›ๆจกๅž‹ไธป่ฆ่ดŸ่ดฃ็‰ˆ้ขๅˆ†ๆžใ€ๅ…ฌๅผ่ฏ†ๅˆซใ€่กจๆ ผ่ฏ†ๅˆซๅ’Œ
OCRใ€‚

้œ€่ฆไธ‹่ฝฝ็š„ๅ…ทไฝ“ๆจกๅž‹๏ผˆๅฏนๅบ” ModelPath ๆžšไธพ๏ผ‰ๅŒ…ๆ‹ฌ๏ผš

  • pp_doclayout_v2: ๆ–‡ๆกฃ็‰ˆ้ขๅˆ†ๆžๆจกๅž‹ (Layout Analysis)
  • unimernet_small: ๅ…ฌๅผ่ฏ†ๅˆซๆจกๅž‹ (Formula Recognition)
  • pytorch_paddle: OCR ๅŸบ็ก€ๆจกๅž‹ (OCR)
  • slanet_plus: ่กจๆ ผ็ป“ๆž„่ฏ†ๅˆซๆจกๅž‹ (Table Structure Recognition)
  • unet_structure: ่กจๆ ผ็ป“ๆž„่ฏ†ๅˆซๆจกๅž‹ (Table Structure Recognition)
  • paddle_table_cls: ่กจๆ ผๅˆ†็ฑปๆจกๅž‹ (Table Classification)
  • paddle_orientation_classification: ๆ–‡ๆœฌๆ–นๅ‘ๅˆ†็ฑปๆจกๅž‹ (Orientation
    Classification)
  • pp_formulanet_plus_m: ๅ…ฌๅผ่ฏ†ๅˆซๅขžๅผบๆจกๅž‹ (Formula Recognition)
  1. VLM ๆจกๅž‹ (VLM Models)

ไปฃ็ ไธญ็š„ download_vlm_models() ๅ‡ฝๆ•ฐ่ดŸ่ดฃไธ‹่ฝฝ VLM (Vision-Language Model)
็›ธๅ…ณ็š„ๆจกๅž‹ใ€‚

  • VLM ๆƒ้‡ๆ–‡ไปถ: ่ฏฅๅ‡ฝๆ•ฐ่ฐƒ็”จ auto_download_and_get_model_root_path("/",
    repo_mode='vlm')๏ผŒๅฎƒไผšไธ‹่ฝฝไธ€ไธชๅฎŒๆ•ด็š„ VLM
    ๆจกๅž‹ๆƒ้‡ๅบ“๏ผˆ้€šๅธธๆ˜ฏไธ€ไธชๅคงๅž‹็š„ๆจกๅž‹ๆ–‡ไปถๅคน๏ผ‰๏ผŒ็”จไบŽ้ซ˜็ฒพๅบฆ็š„็ซฏๅˆฐ็ซฏ่งฃๆžใ€‚

ๆ€ป็ป“

ๅฝ“ไฝ ่ฟ่กŒ -m all ๆ—ถ๏ผŒ็จ‹ๅบไผšไพๆฌกๆ‰ง่กŒไธŠ่ฟฐไธคไธชไธ‹่ฝฝๅ‡ฝๆ•ฐใ€‚

ไธ‹่ฝฝๆต็จ‹ๆ‘˜่ฆ๏ผš

  1. Pipeline ็ป„ $\rightarrow$ ไธ‹่ฝฝ 8 ไธชไธ“้กนๆจกๅž‹ $\rightarrow$ ๅ†™ๅ…ฅ mineru.json
    ็š„ models-dir.pipelineใ€‚
  2. VLM ็ป„ $\rightarrow$ ไธ‹่ฝฝ VLM ๆƒ้‡ๅบ“ $\rightarrow$ ๅ†™ๅ…ฅ mineru.json ็š„
    models-dir.vlmใ€‚

ๅฆ‚ๆžœไฝ ๆ‰“็ฎ—ๆ‰‹ๅŠจไธ‹่ฝฝ๏ผŒไฝ ้œ€่ฆ็กฎไฟไธŠ่ฟฐๆ‰€ๆœ‰ๆจกๅž‹้ƒฝ่ขซไธ‹่ฝฝๅˆฐๆœฌๅœฐ๏ผŒๅนถไธ”ๅœจ mineru.json
ไธญๆญฃ็กฎ้…็ฝฎ่ฟ™ไธคไธช็›ฎๅฝ•็š„่ทฏๅพ„ใ€‚

ask:
โฏ use English Re-answer

answer๏ผš
โ— Based on the analysis of the source code in mineru/cli/models_download.py, executing the command mineru-models-download -s modelscope -m all triggers the download of two main categories of models: Pipeline Models and VLM Models.

Here is the detailed breakdown of the models that will be downloaded:

  1. Pipeline Models

The download_pipeline_models() function defines a list of specialized models used for layout analysis, formula recognition, table recognition, and OCR.

The specific models downloaded (corresponding to the ModelPath enum) are:

  • pp_doclayout_v2: Document Layout Analysis model.
  • unimernet_small: Formula Recognition model.
  • pytorch_paddle: Base OCR models.
  • slanet_plus: Table Structure Recognition model.
  • unet_structure: Table Structure Recognition model.
  • paddle_table_cls: Table Classification model.
  • paddle_orientation_classification: Text Orientation Classification model.
  • pp_formulanet_plus_m: Enhanced Formula Recognition model.
  1. VLM Models

The download_vlm_models() function handles the download of the Vision-Language Model (VLM) weights.

  • VLM Weight Library: This function calls auto_download_and_get_model_root_path("/", repo_mode='vlm'), which downloads a comprehensive VLM model weights directory used for high-accuracy end-to-end parsing.

Summary Table

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚ Model Category โ”‚ Scope โ”‚ Primary Purpose โ”‚ Config Key in mineru.json โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ Pipeline โ”‚ 8 Specialized Models โ”‚ Layout, OCR, Formula, and Table Recognition โ”‚ models-dir.pipeline โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚ VLM โ”‚ VLM Weight Library โ”‚ High-precision end-to-end multimodal parsing โ”‚ models-dir.vlm โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Execution Workflow:
When the -m all flag is used, the program sequentially executes both download_pipeline_models() and download_vlm_models(). Once completed, it automatically updates your mineru.json configuration file with the local paths to these downloaded models.

Interesting how it's removed from your English translated version. I use the model only in English, and while I've seen $\rightarrow$ surface plenty of times in English, it was always in the thinking block, never in the responses.

Now that I think about it, I noticed the rightarrow thing in the early GGUF versions on early llama.cpp implementation, but I don't remember seeing it recently. But I haven't paid that much attention to it either (don't often read the thinking block).

Edit: Correction, I still see it in modern GGUF and backends. Both in response and thinking block. It's just not super common.

Sign up or log in to comment