will you test on becnhmarks?

by Roman1111111 - opened 28 days ago

Discussion

Roman1111111

28 days ago

could you please evaluate it on benchmarks

armand0e

TeichAI org 28 days ago

Way ahead of you brother. Just wait until you see these numbers ;)

Roman1111111

28 days ago

thanks bro😊❤️

armand0e

TeichAI org 27 days ago

all up. check out those massive relative gains on IFEval and ARC

Bob-the-Koala

27 days ago

1 epoch, did you tweak the learning rate?

CompactAI

TeichAI org 27 days ago

1 epoch, did you tweak the learning rate?

In my experience one epoch is just better than doing 2 or 3. Dont know if he did LR curves

armand0e

TeichAI org 26 days ago

No tweak on LR as of now.

saipangon

24 days ago

Just test on GPQA and MMLU, I wanna see how exactly this model perform

CompactAI

TeichAI org 23 days ago

Just test on GPQA and MMLU, I wanna see how exactly this model perform

When I get home i'll see if I can do that for you. :D

saipangon

22 days ago

Do it. I want to see your real benchmark

CompactAI

TeichAI org 22 days ago

Don't know why the other one was "fake" but ok.

armand0e

TeichAI org 22 days ago

Don't know why the other one was "fake" but ok.

Yea the relative gain/loss on those benchmarks should still hold. I think they just want it done the official way (using generate_until requests for the whole thing, few-shot, etc)

CompactAI

TeichAI org 22 days ago

Just test on GPQA and MMLU, I wanna see how exactly this model perform

So MMLU is 15,908 questions. Pretty sure im going to pass on that benchmark. But im working on others now

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment