Add v3 thinking control patch - task-aware system prompts + think efficiency reward 0f39df7 verified rtferraz commited on 15 days ago