GeoBenchLLM Metrics - a rfr2003 Collection

rfr2003 's Collections

GeoBenchLLM Metrics

GeoBenchLLM Metrics

updated Feb 26

This collection gather all the metrics used for the evaluation of the datasets in GeoBenchLLM.

Sleeping

Agents

MCQ_eval

🚀

Evaluate multiple-choice question predictions
Sleeping

Agents

Coord_eval

🚀

Evaluate coordinate predictions with a simple Gradio UI
Sleeping

Agents

NY_POI_evaluate

🚀

Evaluate your model on New York POI dataset
Sleeping

Agents

Keywords_evaluate

🚀

Evaluate keyword extraction with precision, recall, F1 scores
Sleeping

Agents

regression_evaluate

🚀

Evaluate regression model predictions with key metrics
Sleeping

Agents

Path_Planning_evaluate

🚀

Evaluate path planning results with performance metrics
Sleeping

Agents

Place_gen_evaluate

🚀

Evaluate your generated place images