Zwounds
/

boolean-search-model

Text Generation

language-to-query

text2text-generation

text-generation-inference

4-bit precision

Model card Files Files and versions

Zwounds commited on Mar 18, 2025

Commit

5043f4a

·

verified ·

1 Parent(s): 6100275

Upload MODEL_CARD.md with huggingface_hub

Files changed (1) hide show

MODEL_CARD.md +2 -13

MODEL_CARD.md CHANGED Viewed

@@ -1,4 +1,4 @@
-# Boolean Search Query LLM
 This model is fine-tuned to convert natural language queries into boolean search expressions, optimized for academic and research database searching.
@@ -61,7 +61,7 @@ Fine-tuned: "artificial intelligence" AND (ethics OR regulation OR policy)  # Pr
 The model was trained on a curated dataset of natural language queries paired with their correct boolean translations. Dataset characteristics:
-- Size: 192 examples
 - Format: Natural query → Boolean expression pairs
 - Source: Manually curated academic search examples
 - Validation: Expert-reviewed for accuracy
@@ -69,9 +69,6 @@ The model was trained on a curated dataset of natural language queries paired wi
 ## Training Process
 - **Method**: LoRA fine-tuning
-- **Epochs**: 6
-- **Learning Rate**: 5e-5 with cosine scheduling
-- **Batch Size**: 16 (4 per device × 4 gradient accumulation steps)
 - **Hardware**: NVIDIA GeForce RTX 4070 Ti SUPER
 ## How to Use
@@ -150,14 +147,6 @@ result = tokenizer.decode(outputs[0], skip_special_tokens=True)
 print(result)  # "climate change" AND "renewable energy"
 ```
-## Evaluation Results
-Our test suite demonstrates consistent improvements over the base model in key areas:
-1. Meta-term removal accuracy: 100%
-2. Proper multi-word term quoting: 95%
-3. Logical grouping accuracy: 98%
-4. Minimal formatting adherence: 97%
 ## Citation
 If you use this model in your research, please cite:

+# Boolean Search Query Model
 This model is fine-tuned to convert natural language queries into boolean search expressions, optimized for academic and research database searching.
 The model was trained on a curated dataset of natural language queries paired with their correct boolean translations. Dataset characteristics:
+- Size: 135 examples
 - Format: Natural query → Boolean expression pairs
 - Source: Manually curated academic search examples
 - Validation: Expert-reviewed for accuracy
 ## Training Process
 - **Method**: LoRA fine-tuning
 - **Hardware**: NVIDIA GeForce RTX 4070 Ti SUPER
 ## How to Use
 print(result)  # "climate change" AND "renewable energy"
 ```
 ## Citation
 If you use this model in your research, please cite: