Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nickhe 's Collections
FIRL-Abalone-REINFORCE++

FIRL-Abalone-REINFORCE++

updated Jan 25

Saved LORA adapter checkpoints from training Qwen2.5-7B to generate decision trees for Abalone age regression dataset, using reinforce++ algorithm.

Upvote
-

  • nickhe/firl-ckpt-20-40-60

    Updated Jan 23

  • nickhe/firl-ckpt-100

    Updated Jan 23

  • nickhe/firl-ckpt-120

    Updated Jan 23

  • nickhe/firl-ckpt-140

    Updated Jan 23

  • nickhe/firl-ckpt-160

    Updated Jan 23

  • nickhe/firl-ckpt-260

    Updated Jan 23

  • nickhe/firl-ckpt-280

    Updated Jan 23

  • nickhe/firl-ckpt-300

    Updated Jan 23

  • nickhe/firl-ckpt-360

    Updated Jan 23

  • nickhe/firl-ckpt-380

    Updated Jan 23

  • nickhe/firl-ckpt-420

    Updated Jan 23

  • nickhe/firl-ckpt-460

    Updated Jan 24

  • nickhe/firl-ckpt-500

    Updated Jan 24

  • nickhe/firl-ckpt-200

    Updated Jan 24

  • nickhe/firl-ckpt-180

    Updated Jan 24

  • nickhe/firl-ckpt-220

    Updated Jan 24

  • nickhe/firl-ckpt-240

    Updated Jan 24

  • nickhe/firl-ckpt-520

    Updated Jan 24

  • nickhe/firl-ckpt-540

    Updated Jan 24

  • nickhe/firl-ckpt-440

    Updated Jan 24

  • nickhe/firl-ckpt-660

    Updated Jan 24

  • nickhe/firl-ckpt-620

    Updated Jan 24

  • nickhe/firl-ckpt-580

    Updated Jan 24

  • nickhe/firl-ckpt-840

    Updated Jan 24

  • nickhe/firl-ckpt-800

    Updated Jan 24

  • nickhe/firl-ckpt-760

    Updated Jan 24

  • nickhe/firl-ckpt-720

    Updated Jan 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs