Add comprehensive model card with MLX inference examples

#1
by YingxuHe - opened
MERaLiON org

Adds a rich model card adapted from the original MERaLiON-2-3B README with:

  • MLX format info (fp16, no quantization)
  • MLX inference code examples
  • No-repeat n-gram blocking documentation
  • Supported task prompts

Source: https://github.com/YingxuH/mlx_conversion

YingxuHe changed pull request status to merged

Sign up or log in to comment