rtferraz's picture
docs: add V4.1 run report โ€” detailed evaluation with per-task analysis and V4.2 roadmap
482efc4 verified