torch transformers datasets gradio>=6.6.0 accelerate bitsandbytes peft trl evaluate rouge_score