Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Dwootton
/
p2p-stabletoolbench
like
0
tool-use
evaluation
play2prompt
stabletoolbench
arxiv:
2403.07714
License:
mit
Model card
Files
Files and versions
xet
Community
main
p2p-stabletoolbench
/
pipeline
/
run_eval.py
Commit History
Add run_eval.py and virtual_api_server.py
53cd770
verified
Dwootton
commited on
7 days ago