YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Generated with:

CUDA_VISIBLE_DEVICES=5 python train_rotation.py \
    --model_dir /models/Qwen_Qwen3-0.6B \
    --quant_scheme mxfp4 \
    --eval_batch_size 16 \
    --num_samples 4000 \
    --learning_rate 1.5 \
    --rotation_algo_config_file ${ROTATION_TRAIN_CFG} \
    --output_dir qwen3-0.6b-mxfp4-tuned-orthogonal \
    --max_steps 5 \
    --loss_type kl_top_1000 \
    --model_attn_implementation sdpa \
    --train_batch_size 8 \
    --model_export hf_format \
    --export_weight_format real_quantized
Downloads last month
11
Safetensors
Model size
0.5B params
Tensor type
F64
BF16
U8
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support