YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Generated with:
CUDA_VISIBLE_DEVICES=5 python train_rotation.py \
--model_dir /models/Qwen_Qwen3-0.6B \
--quant_scheme fp8 \
--eval_batch_size 16 \
--num_samples 4000 \
--learning_rate 1.5 \
--rotation_algo_config_file ${ROTATION_TRAIN_CFG} \
--output_dir qwen3-0.6b-fp8-tuned-orthogonal \
--max_steps 5 \
--loss_type kl_top_1000 \
--model_attn_implementation sdpa \
--train_batch_size 8 \
--model_export hf_format \
--export_weight_format real_quantized
- Downloads last month
- 2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support