gpt-neo-125M-nutrition
This model was trained using influence-guided dataset selection, a technique that uses influence scores to identify the most impactful training data for specific concepts.
Model Description
- Base Model: EleutherAI/gpt-neo-125m
- Training Concepts: nutrition analysis, dietary assessment, meal description parsing, food classification, macronutrient estimation
- Training Method: Influence-guided data selection
- Compute Budget: 100 steps per condition
- Total Datasets: 3
Training Approach
This model was trained using three different data selection strategies to validate the effectiveness of influence-guided training:
- Positive Influence: Datasets with high positive influence scores (most aligned with target concepts)
- Random Baseline: Randomly sampled datasets
- Negative Influence: Datasets with high negative influence scores (least aligned)
Benchmark Results
| Condition | Perplexity ↓ | Train Loss ↓ | Eval Loss ↓ |
|---|---|---|---|
| Positive | 1.72 | 0.8731 | 0.5442 |
| Random | 1.20 | 0.2664 | 0.1838 |
| Negative | 1.20 | 0.4567 | 0.1804 |
Lower is better for all metrics
Training Datasets
The model was trained on datasets selected through influence scoring:
Lots-of-LoRAs/task1193_food_course_classification(Influence: 7.766)supergoose/flan_combined_task1193_food_course_classification(Influence: 18.294)supergoose/flan_combined_task527_parsinlu_food_overal_classification(Influence: -0.557)
Intended Use
This model demonstrates the effectiveness of influence-guided training for:
- Concept-specific language modeling
- Data-efficient training
- Dataset curation research
Limitations
- Trained on a limited compute budget for benchmarking purposes
- May not generalize well outside the target concepts: nutrition analysis, dietary assessment, meal description parsing, food classification, macronutrient estimation
- Performance depends on the quality of influence score estimation
Citation
If you use this model or the influence-guided training approach, please cite:
@software{influence_guided_training,
title = {Influence-Guided Dataset Selection for Language Models},
author = {Dowser by Durinn},
year = {2025},
url = {https://huggingface.co/vstrandmoe/gpt-neo-125M-nutrition}
}
Model Card Contact
For questions or feedback, visit Durinn
Generated by Dowser - Dataset discovery and training optimization
- Downloads last month
- 1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support