docs: add V4.1 run report — detailed evaluation with per-task analysis and V4.2 roadmap 482efc4 verified rtferraz commited on 10 days ago