demo: add 5 quick test cases + grader breakdown panel + Show JSON

#1
by anugrahhu - opened
  • new agents: antioracle, lazy_antioracle, spammer (deliberately bad,
    to demonstrate the grader's penalty components firing)
  • 5 preset Quick Test Cases in Tab 1 (3 positive, 2 clearly negative)
  • Grader breakdown markdown panel showing per-component reward
    (decision_accuracy / evidence_coverage / penalty / etc.)
  • Working 'Show observation JSON' button + gr.JSON output in Tab 2
  • Raw observation JSON accordion in Tab 1

Superseded — same change pushed directly to main as 505bf67.

anugrahteesdollar changed pull request status to closed

Sign up or log in to comment