chanderbalaji commited on
Commit
f800a92
·
verified ·
1 Parent(s): 998dd2b

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -21,17 +21,17 @@ This repository contains an MLX-compatible 4-bit version of [`internlm/Intern-S2
21
 
22
  ```bash
23
  python -m mlx_lm generate \
24
- --model chanderbalaji/Intern-S2-Preview-FP8-MLX-4bit \
25
  --trust-remote-code \
26
  --prompt "Explain superconductivity in one paragraph." \
27
  --max-tokens 4096
28
  ```
29
 
30
- For local paths:
31
 
32
  ```bash
33
- /Users/shikharpant/mlx-vlm-test/bin/python -m mlx_lm generate \
34
- --model /Users/shikharpant/models/Intern-S2-Preview-FP8-MLX-4bit \
35
  --trust-remote-code \
36
  --prompt "Explain superconductivity in one paragraph." \
37
  --max-tokens 4096
@@ -39,15 +39,15 @@ For local paths:
39
 
40
  ## Local Benchmark
41
 
42
- Benchmarks were run locally with `mlx_lm generate` on Apple Silicon using the MLX environment at `/Users/shikharpant/mlx-vlm-test`.
43
 
44
  ### Basic Generation
45
 
46
  Command:
47
 
48
  ```bash
49
- /Users/shikharpant/mlx-vlm-test/bin/python -m mlx_lm generate \
50
- --model /Users/shikharpant/models/Intern-S2-Preview-FP8-MLX-4bit \
51
  --trust-remote-code \
52
  --prompt "Explain superconductivity in one paragraph." \
53
  --max-tokens 4096
@@ -68,8 +68,8 @@ Observed output stats:
68
  Command:
69
 
70
  ```bash
71
- /Users/shikharpant/mlx-vlm-test/bin/python -m mlx_lm generate \
72
- --model /Users/shikharpant/models/Intern-S2-Preview-FP8-MLX-4bit \
73
  --trust-remote-code \
74
  --prompt "Do not show reasoning, analysis, thinking process, scratchpad, or <think> text. Output only the final answer. Explain superconductivity in one paragraph." \
75
  --max-tokens 4096
 
21
 
22
  ```bash
23
  python -m mlx_lm generate \
24
+ --model <namespace>/Intern-S2-Preview-FP8-MLX-4bit \
25
  --trust-remote-code \
26
  --prompt "Explain superconductivity in one paragraph." \
27
  --max-tokens 4096
28
  ```
29
 
30
+ For a local checkout:
31
 
32
  ```bash
33
+ python -m mlx_lm generate \
34
+ --model /path/to/Intern-S2-Preview-FP8-MLX-4bit \
35
  --trust-remote-code \
36
  --prompt "Explain superconductivity in one paragraph." \
37
  --max-tokens 4096
 
39
 
40
  ## Local Benchmark
41
 
42
+ Benchmarks were run locally with `mlx_lm generate` on Apple Silicon.
43
 
44
  ### Basic Generation
45
 
46
  Command:
47
 
48
  ```bash
49
+ python -m mlx_lm generate \
50
+ --model /path/to/Intern-S2-Preview-FP8-MLX-4bit \
51
  --trust-remote-code \
52
  --prompt "Explain superconductivity in one paragraph." \
53
  --max-tokens 4096
 
68
  Command:
69
 
70
  ```bash
71
+ python -m mlx_lm generate \
72
+ --model /path/to/Intern-S2-Preview-FP8-MLX-4bit \
73
  --trust-remote-code \
74
  --prompt "Do not show reasoning, analysis, thinking process, scratchpad, or <think> text. Output only the final answer. Explain superconductivity in one paragraph." \
75
  --max-tokens 4096