Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

rtferraz
/
tucano2-commerce

Model card Files Files and versions
xet
Community
tucano2-commerce
591 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 35 commits
rtferraz's picture
rtferraz
fix(probe): use TRL 0.24.0 log keys โ€” rewards/commerce_reward_fn/mean, grad_norm (not train/ prefix)
080fd9a verified 27 days ago
  • docs
    Create v4_2-handoff.md 27 days ago
  • notebooks
    fix(probe): use TRL 0.24.0 log keys โ€” rewards/commerce_reward_fn/mean, grad_norm (not train/ prefix) 27 days ago
  • scripts
    tools: add md-to-ipynb converter script about 1 month ago
  • .gitattributes
    1.52 kB
    initial commit about 1 month ago
  • .gitignore
    497 Bytes
    Initial commit: Tucano2-Commerce GRPO v3 training pipeline about 1 month ago
  • grpo_vertex_v2_ipynb.md
    58.3 kB
    Create grpo_vertex_v2_ipynb.md about 1 month ago