YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

# CamemBERT Model - NEEDS RETRAINING

Current Status: OUTDATED

This model was trained with CamembertTokenizer (old) but the code now uses CamembertTokenizerFast (required for NER).

Symptoms:

  • Wrong entity predictions (Paris PER instead of DEP)
  • "Departure: Not found" even when cities are mentioned
  • Mixed up entity types (VIA, MISC instead of DEP, ARR)

Fix: Retrain the Model

Option 1: Using Notebook (Recommended) Open: notebooks/Complete_NER_Training_and_Analysis.ipynb Run: Cell 16 (Part 3.2 - Train CamemBERT) Time: ~10-15 min (GPU) or ~45 min (CPU)

Option 2: Python Script python from src.train_camembert import CamemBERTNERTrainer trainer = CamemBERTNERTrainer("camembert-base", "models/camembert", 128) trainer.train("data/train.csv", "data/val.csv", epochs=3, batch_size=16)

After Retraining:

`ash python app.py --model models/camembert --port 5000

Should now correctly predict: "Paris Lyon" DEP: Paris, ARR: Lyon

`


Last Updated: 2026-02-02

Downloads last month
-
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support