# Instructions to Upload to Hugging Face This repository is ready to be pushed to Hugging Face Model Hub! ## Quick Setup (5 minutes) ### Step 1: Create Hugging Face Repository 1. Go to https://huggingface.co/new 2. Fill in: - **Model name**: `nfqa-multilingual-classifier` - **License**: Apache 2.0 (recommended) or your choice - **Visibility**: Public (or Private if you prefer) 3. Click **"Create model"** 4. **Important**: Copy your repository URL from the page ### Step 2: Get Your Access Token 1. Go to https://huggingface.co/settings/tokens 2. Click **"New token"** 3. Name: `model-upload` 4. Type: **Write** (important!) 5. Click **"Generate token"** 6. **Copy the token** (you won't see it again) ### Step 3: Connect This Repository Replace `YOUR_USERNAME` with your actual Hugging Face username: ```bash cd /Users/alisalman/thesis/nfqa-multilingual-classifier # Add Hugging Face as remote git remote add origin https://huggingface.co/YOUR_USERNAME/nfqa-multilingual-classifier # Configure git to use your HF credentials git config credential.helper store # Push to Hugging Face (you'll be prompted for username and token) git push -u origin master ``` When prompted: - **Username**: Your Hugging Face username - **Password**: Paste your access token (not your password!) ### Step 4: Verify Upload 1. Go to `https://huggingface.co/YOUR_USERNAME/nfqa-multilingual-classifier` 2. You should see: - ✅ All model files (11 files) - ✅ README with full documentation - ✅ Training visualizations (confusion matrix, training curves) - ✅ Model card with usage examples 3. Test the **Inference API** widget with a question --- ## Alternative: Use Hugging Face CLI If you prefer using the CLI: ```bash # Install if not already installed pip install --upgrade huggingface_hub # Login huggingface-cli login # Paste your token when prompted # Create repository huggingface-cli repo create nfqa-multilingual-classifier --type model # Upload cd /Users/alisalman/thesis/nfqa-multilingual-classifier huggingface-cli upload nfqa-multilingual-classifier . --repo-type model ``` --- ## What's Included This repository contains: ✅ **Model Files** (1.1 GB total): - `model.safetensors` - Model weights - `config.json` - Model configuration - `tokenizer.json` - Tokenizer - `tokenizer_config.json` - Tokenizer settings - `sentencepiece.bpe.model` - Vocabulary - `special_tokens_map.json` - Special tokens ✅ **Documentation**: - `README.md` - Comprehensive model card - `classification_report.txt` - Per-category performance - `test_results.json` - Detailed evaluation metrics ✅ **Visualizations**: - `confusion_matrix.png` - Test set confusion matrix - `training_curves.png` - Training/validation curves ✅ **Git Configuration**: - `.gitattributes` - LFS tracking for large files - `.gitignore` - Ignore patterns --- ## Before You Push ### Update README Placeholders Edit [README.md](README.md) and replace: - `[Your Name/Organization]` → Your actual name - `[Specify your license]` → Your license choice - `your-username/nfqa-multilingual-classifier` → Your actual repo URL - `[Your email]` → Your contact email - `[Your repository]` → Your GitHub repo (if any) You can edit directly on Hugging Face after uploading, or do it now: ```bash nano README.md # or use your preferred editor ``` --- ## Troubleshooting ### Error: "Repository not found" - Make sure you created the repository on huggingface.co first - Check that the username in the URL matches your HF username ### Error: "Authentication failed" - Make sure you're using your **token** as password, not your account password - Verify the token has **Write** permissions - Try `git credential reject` to clear cached credentials ### Error: "Large file not properly tracked" - LFS is already configured in this repo - Just push normally, git-lfs will handle large files automatically ### Upload is very slow - The model is ~1.1 GB, this is normal - It may take 5-15 minutes depending on your internet speed - Git LFS uploads large files efficiently --- ## After Upload 1. **Test the model**: ```python from transformers import pipeline classifier = pipeline("text-classification", model="YOUR_USERNAME/nfqa-multilingual-classifier") result = classifier("What is the capital of France?") print(result) ``` 2. **Add widget examples** in the README YAML front matter (optional) 3. **Share your model** on social media, papers, etc. 4. **Monitor usage** at `https://huggingface.co/YOUR_USERNAME/nfqa-multilingual-classifier/tree/main` --- ## Quick Reference ```bash # View repository status cd /Users/alisalman/thesis/nfqa-multilingual-classifier git status # View commit history git log --oneline # Check remote URL git remote -v # Push updates (after making changes) git add . git commit -m "Update model card" git push ``` --- **Need help?** - Hugging Face Docs: https://huggingface.co/docs/hub - Git LFS Guide: https://git-lfs.github.com/ **Ready to push?** Follow Step 3 above!