Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rfr2003 's Collections
GeoBenchLLM Metrics
GeoBenchLLM

GeoBenchLLM Metrics

updated Feb 26

This collection gather all the metrics used for the evaluation of the datasets in GeoBenchLLM.

Upvote
-

  • Sleeping
    Agents

    MCQ_eval

    🚀

    Evaluate multiple-choice question predictions


  • Sleeping
    Agents

    Coord_eval

    🚀

    Evaluate coordinate predictions with a simple Gradio UI


  • Sleeping
    Agents

    NY_POI_evaluate

    🚀

    Evaluate your model on New York POI dataset


  • Sleeping
    Agents

    Keywords_evaluate

    🚀

    Evaluate keyword extraction with precision, recall, F1 scores


  • Sleeping
    Agents

    regression_evaluate

    🚀

    Evaluate regression model predictions with key metrics


  • Sleeping
    Agents

    Path_Planning_evaluate

    🚀

    Evaluate path planning results with performance metrics


  • Sleeping
    Agents

    Place_gen_evaluate

    🚀

    Evaluate your generated place images

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs