Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Dwootton
/
p2p-stabletoolbench
like
0
tool-use
evaluation
play2prompt
stabletoolbench
arxiv:
2403.07714
License:
mit
Model card
Files
Files and versions
xet
Community
main
p2p-stabletoolbench
/
pipeline
35.4 kB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
Dwootton
Add virtual_api_server.py"
6284167
verified
7 days ago
config.py
1.28 kB
Add pipeline code: config, tool_utils, prompt_builder, llm_client, react_loop, run_eval, virtual_api_server
7 days ago
llm_client.py
2.45 kB
Add llm_client.py, react_loop.py, run_eval.py, virtual_api_server.py
7 days ago
prompt_builder.py
5.03 kB
Add prompt_builder.py
7 days ago
react_loop.py
6.45 kB
Add react_loop.py
7 days ago
run_eval.py
7.08 kB
Add run_eval.py and virtual_api_server.py
7 days ago
tool_utils.py
10.1 kB
Add tool_utils.py
7 days ago
virtual_api_server.py
3.02 kB
Add virtual_api_server.py"
7 days ago