Token Classification
GLiNER
PyTorch
ONNX
English
Hindi
NER
named-entity-recognition
floatbot
conversational-ai
chatbot
customer-support
Rishi2455 commited on
Commit
fd5e469
·
verified ·
1 Parent(s): 1697b38

Add model card

Browse files
Files changed (1) hide show
  1. README.md +86 -0
README.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: knowledgator/gliner-x-large
4
+ tags:
5
+ - gliner
6
+ - NER
7
+ - named-entity-recognition
8
+ - floatbot
9
+ - conversational-ai
10
+ - chatbot
11
+ - customer-support
12
+ - token-classification
13
+ language:
14
+ - en
15
+ - hi
16
+ datasets:
17
+ - Rishi2455/gliner-floatbot-ai-training
18
+ library_name: gliner
19
+ pipeline_tag: token-classification
20
+ ---
21
+
22
+ # GLiNER Fine-Tuned for Floatbot.ai
23
+
24
+ Fine-tuned version of [knowledgator/gliner-x-large](https://huggingface.co/knowledgator/gliner-x-large) for domain-specific NER in the conversational AI / customer support domain.
25
+
26
+ ## Entity Types (30)
27
+
28
+ This model recognizes 30 entity types relevant to Floatbot.ai's platform:
29
+
30
+ `customer_name` · `organization` · `product_name` · `service_type` · `channel` · `date` · `time` · `monetary_amount` · `order_id` · `ticket_id` · `account_number` · `phone_number` · `email_address` · `complaint_category` · `intent_keyword` · `department` · `plan_name` · `feature_name` · `api_endpoint` · `bot_name` · `language` · `platform` · `integration` · `metric_name` · `percentage` · `duration` · `location` · `priority_level` · `status` · `error_type`
31
+
32
+ ## Usage
33
+
34
+ ```python
35
+ from gliner import GLiNER
36
+
37
+ model = GLiNER.from_pretrained("Rishi2455/gliner-floatbot-ai")
38
+
39
+ text = "Rajesh from Infosys wants to integrate Floatbot with Salesforce for their Mumbai call center."
40
+ labels = ["customer_name", "organization", "product_name", "integration", "location", "service_type"]
41
+
42
+ entities = model.predict_entities(text, labels, threshold=0.4)
43
+ for ent in entities:
44
+ print(f" '{ent['text']}' → {ent['label']} (score: {ent['score']:.3f})")
45
+ ```
46
+
47
+ ## Training Details
48
+
49
+ | Parameter | Value |
50
+ |-----------|-------|
51
+ | Base model | knowledgator/gliner-x-large (1.3B params) |
52
+ | Training samples | 86 |
53
+ | Entity types | 30 |
54
+ | Learning rate (encoder) | 5e-6 |
55
+ | Learning rate (others) | 1e-5 |
56
+ | Loss | Focal loss (α=0.75, γ=2) |
57
+ | Epochs | 12 |
58
+ | Effective batch size | 8 |
59
+
60
+ ## Training Recipe
61
+
62
+ Based on published research:
63
+ - [GLiNER-BioMed](https://arxiv.org/abs/2504.00676) — domain adaptation blueprint
64
+ - [NERCat](https://arxiv.org/abs/2503.14173) — small dataset fine-tuning recipe
65
+ - [GLiNER](https://arxiv.org/abs/2311.08526) — original model architecture
66
+
67
+ ## Training Data & Script
68
+
69
+ See [Rishi2455/gliner-floatbot-ai-training](https://huggingface.co/datasets/Rishi2455/gliner-floatbot-ai-training) for the complete training dataset and fine-tuning script.
70
+
71
+ ## How to Run Training
72
+
73
+ ```bash
74
+ pip install gliner torch transformers accelerate trackio huggingface_hub
75
+ huggingface-cli login
76
+
77
+ # Download and run the training script
78
+ wget https://huggingface.co/datasets/Rishi2455/gliner-floatbot-ai-training/resolve/main/train_gliner.py
79
+ python train_gliner.py
80
+ ```
81
+
82
+ **Hardware required**: GPU with ≥24GB VRAM (A10G, RTX 3090, A100, etc.)
83
+
84
+ ## Status
85
+
86
+ ⏳ **Awaiting training** — Run `train_gliner.py` on a GPU to generate model weights.