Minibase commited on
Commit
4fcfaab
·
verified ·
1 Parent(s): 4173f9b

Upload benchmarks.txt with huggingface_hub

Browse files
Files changed (1) hide show
  1. benchmarks.txt +46 -0
benchmarks.txt ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # NER Benchmark Results
2
+ **Model:** Minibase-NER-Small
3
+ **Dataset:** ner_benchmark_dataset.jsonl
4
+ **Sample Size:** 100
5
+ **Date:** 2025-10-07T13:20:42.785262
6
+
7
+ ## Overall Performance
8
+
9
+ | Metric | Score | Description |
10
+ |--------|-------|-------------|
11
+ | F1 Score | 0.435 | Overall NER performance (harmonic mean of precision and recall) |
12
+ | Precision | 0.630 | Accuracy of entity predictions |
13
+ | Recall | 0.343 | Ability to find all entities |
14
+ | Average Latency | 76.6ms | Response time performance |
15
+
16
+ ## Entity Type Performance
17
+
18
+ | Entity Type | Accuracy | Correct/Total |
19
+ |-------------|----------|---------------|
20
+ | ENTITY | 0.936 | 103/110 |
21
+
22
+ ## Key Improvements
23
+
24
+ - **BIO Tagging**: Model outputs entities in BIO (Beginning-Inside-Outside) format
25
+ - **Multiple Entity Types**: Supports PERSON, ORG, LOC, and MISC entities
26
+ - **Entity-Level Evaluation**: Metrics calculated at entity level rather than token level
27
+ - **Comprehensive Coverage**: Evaluates across different text domains
28
+
29
+ ## Example Results
30
+
31
+ ### Example 1
32
+ **Input:** John Smith works at Google in New York and uses Python programming language....
33
+ **Predicted:** PERGON, ORG...
34
+ **F1 Score:** 0.000
35
+
36
+ ### Example 2
37
+ **Input:** Microsoft Corporation announced that Satya Nadella will visit London next week....
38
+ **Predicted:** 1. Microsoft Corporation...
39
+ **F1 Score:** 0.500
40
+
41
+ ### Example 3
42
+ **Input:** The University of Cambridge is located in the United Kingdom and was founded by King Henry III....
43
+ **Predicted:** 1. The University of Cambridge
44
+ 2. King Henry III...
45
+ **F1 Score:** 0.800
46
+