Spaces:

tanmmayyy
/

mcq_generator

Running

App Files Files Community

tanmmayyy commited on 13 days ago

Commit

a50befe

1 Parent(s): c0a212c

for deployment

Browse files

Files changed (11) hide show

.devcontainer/devcontainer.json +0 -33
.gitignore +3 -1
.python_version +0 -1
.streamlit/config.toml +0 -6
README.md +548 -9
app/main.py +83 -126
finetune_t5_file.ipynb +0 -0
requirements.txt +0 -0
runtime.txt +0 -1
setup.sh +0 -1
src/question_generator.py +12 -0

.devcontainer/devcontainer.json DELETED Viewed

@@ -1,33 +0,0 @@
-{
-  "name": "Python 3",
-  // Or use a Dockerfile or Docker Compose file. More info: https://containers.dev/guide/dockerfile
-  "image": "mcr.microsoft.com/devcontainers/python:1-3.11-bookworm",
-  "customizations": {
-    "codespaces": {
-      "openFiles": [
-        "README.md",
-        "app/main.py"
-      ]
-    },
-    "vscode": {
-      "settings": {},
-      "extensions": [
-        "ms-python.python",
-        "ms-python.vscode-pylance"
-      ]
-    }
-  },
-  "updateContentCommand": "[ -f packages.txt ] && sudo apt update && sudo apt upgrade -y && sudo xargs apt install -y <packages.txt; [ -f requirements.txt ] && pip3 install --user -r requirements.txt; pip3 install --user streamlit; echo '✅ Packages installed and Requirements met'",
-  "postAttachCommand": {
-    "server": "streamlit run app/main.py --server.enableCORS false --server.enableXsrfProtection false"
-  },
-  "portsAttributes": {
-    "8501": {
-      "label": "Application",
-      "onAutoForward": "openPreview"
-    }
-  },
-  "forwardPorts": [
-    8501
-  ]
-}

.gitignore CHANGED Viewed

@@ -22,4 +22,6 @@ __pycache__/
 Thumbs.db
 # Jupyter checkpoints
-.ipynb_checkpoints/

 Thumbs.db
 # Jupyter checkpoints
+.ipynb_checkpoints/
+.env

.python_version DELETED Viewed

	@@ -1 +0,0 @@
1	- 3.11

.streamlit/config.toml DELETED Viewed

@@ -1,6 +0,0 @@
-[server]
-headless = true
-port = 8501
-[theme]
-base = "light"

README.md CHANGED Viewed

@@ -1,10 +1,549 @@
 ---
-title: MCQ Generator
-emoji: 📝
-colorFrom: blue
-colorTo: purple
-sdk: streamlit
-sdk_version: 1.33.0
-app_file: app/main.py
-pinned: false
----

+# 📝 MCQ Generator — Automatic Multiple Choice Question Generator
+> **An end-to-end NLP pipeline that reads any text passage and automatically generates a complete multiple-choice quiz with scoring and explanations.**
+Built as a course project for an NLP curriculum covering Modules I–IV: tokenization, word embeddings, transformers, and natural language generation.
 ---
+## 📌 Table of Contents
+1. [What This Project Does](#what-this-project-does)
+2. [Live Demo](#live-demo)
+3. [How It Works — The Full Pipeline](#how-it-works--the-full-pipeline)
+4. [NLP Techniques Used](#nlp-techniques-used)
+5. [Project Structure](#project-structure)
+6. [Each File Explained](#each-file-explained)
+7. [Tech Stack](#tech-stack)
+8. [Setup & Installation](#setup--installation)
+9. [Running the App](#running-the-app)
+10. [Testing Each Module](#testing-each-module)
+11. [Sample Output](#sample-output)
+12. [What Makes a Good Passage](#what-makes-a-good-passage)
+13. [Known Limitations](#known-limitations)
+14. [Future Work](#future-work)
+15. [Related Research](#related-research)
+16. [Course Outcomes Covered](#course-outcomes-covered)
+---
+## What This Project Does
+Given any factual text passage, this system:
+1. **Extracts** the most important sentences using TF-IDF ranking
+2. **Identifies** answer candidates using Named Entity Recognition (NER)
+3. **Generates** natural language questions using a T5 transformer model
+4. **Creates** plausible wrong options (distractors) using WordNet and NER
+5. **Presents** an interactive quiz with scoring and per-question explanations
+**Example:**
+Input passage:
+```
+Albert Einstein was born on March 14, 1879, in Ulm, Germany.
+He was awarded the Nobel Prize in Physics in 1921 for his
+discovery of the photoelectric effect.
+```
+Generated MCQ:
+```
+Q: Where was Albert Einstein born?
+A. France
+B. Germany  ✓
+C. United States
+D. Switzerland
+```
+---
+## Live Demo
+```bash
+streamlit run app/main.py
+```
+Opens at `http://localhost:8501` in your browser.
+---
+## How It Works — The Full Pipeline
+```
+Raw Text Passage
+       │
+       ▼
+┌─────────────────────────────────────────────┐
+│  STEP 1: PREPROCESSING  (preprocessor.py)   │
+│                                             │
+│  • Split into sentences (spaCy)             │
+│  • Rank by TF-IDF score (scikit-learn)      │
+│  • Extract Named Entities (spaCy NER)       │
+│  • Filter answer candidates (blacklist)     │
+└─────────────────┬───────────────────────────┘
+                  │  top sentences + answer candidates
+                  ▼
+┌─────────────────────────────────────────────┐
+│  STEP 2: QUESTION GENERATION                │
+│          (question_generator.py)            │
+│                                             │
+│  • Highlight answer in sentence with <hl>   │
+│  • Feed to T5 transformer model             │
+│  • Generate 3 candidate questions           │
+│  • Validate: reject circular/vague Qs       │
+└─────────────────┬───────────────────────────┘
+                  │  (question, answer) pairs
+                  ▼
+┌─────────────────────────────────────────────┐
+│  STEP 3: DISTRACTOR GENERATION              │
+│          (distractor_generator.py)          │
+│                                             │
+│  Strategy 1: Same-type NER entities         │
+│              from the passage               │
+│  Strategy 2: WordNet hyponym siblings       │
+│  Strategy 3: Cross-label fallback           │
+└─────────────────┬───────────────────────────┘
+                  │  3 wrong options per question
+                  ▼
+┌─────────────────────────────────────────────┐
+│  STEP 4: MCQ ASSEMBLY + VALIDATION          │
+│          (mcq_builder.py)                   │
+│                                             │
+│  • Combine answer + distractors             │
+│  • Shuffle options randomly                 │
+│  • Quality gate: dedup, similarity check    │
+│  • Return list of MCQ objects               │
+└─────────────────┬─────���─────────────────────┘
+                  │  validated MCQ list
+                  ▼
+┌─────────────────────────────────────────────┐
+│  STEP 5: QUIZ UI + SCORING                  │
+│          (app/main.py + evaluator.py)       │
+│                                             │
+│  • Streamlit 3-screen app                   │
+│  • Input → Quiz → Results                   │
+│  • Score, feedback, explanations            │
+└─────────────────────────────────────────────┘
+```
+---
+## NLP Techniques Used
+### Module I — Foundational NLP
+| Technique | Where Used | Purpose |
+|---|---|---|
+| Tokenization | `preprocessor.py` | Split text into sentences and tokens using spaCy |
+| Lemmatization | `preprocessor.py` | Normalize word forms for TF-IDF |
+| Stop word removal | `preprocessor.py` | Filter noise before TF-IDF scoring |
+| Named Entity Recognition (NER) | `preprocessor.py` | Find PERSON, ORG, DATE, GPE as answer candidates |
+| POS Tagging | `preprocessor.py` | Identify nouns and proper nouns |
+| WordNet | `distractor_generator.py` | Find semantically related words as distractors |
+| Synsets / Hyponyms | `distractor_generator.py` | Navigate WordNet hierarchy for same-category words |
+### Module II — Word Representation
+| Technique | Where Used | Purpose |
+|---|---|---|
+| TF-IDF | `preprocessor.py` | Rank sentences by information density |
+| Word Embeddings (GloVe) | `distractor_generator.py` | Optional cosine-similarity based distractor finding |
+**TF-IDF explained:**
+- **TF (Term Frequency)** = how often a word appears in *this* sentence
+- **IDF (Inverse Document Frequency)** = how rare the word is across *all* sentences
+- High TF-IDF score = sentence contains rare, informative words → good question source
+### Module III — Deep Learning for NLP
+| Technique | Where Used | Purpose |
+|---|---|---|
+| Transformers | `question_generator.py` | T5 model for question generation |
+| Transfer Learning | `question_generator.py` | Using pre-trained T5 fine-tuned on SQuAD |
+| Seq2Seq | `question_generator.py` | Encoder-decoder architecture of T5 |
+| Beam Search | `question_generator.py` | Generate multiple question candidates, pick best |
+### Module IV — Advanced NLP
+| Technique | Where Used | Purpose |
+|---|---|---|
+| T5 (Text-to-Text Transfer Transformer) | `question_generator.py` | State-of-the-art QG model |
+| Natural Language Generation (NLG) | `question_generator.py` | Generating grammatical questions |
+| Subword Tokenization (SentencePiece) | `question_generator.py` | T5's tokenizer handles rare/unknown words |
+| Pre-trained Models | `question_generator.py` | `valhalla/t5-small-qg-hl` from HuggingFace |
+---
+## Project Structure
+```
+mcq_generator/
+│
+├── src/                          # Core NLP pipeline modules
+│   ├── __init__.py
+│   ├── preprocessor.py           # Text cleaning, TF-IDF, NER, answer extraction
+│   ├── question_generator.py     # T5-based question generation
+│   ├── distractor_generator.py   # WordNet + NER distractor generation
+│   ├── mcq_builder.py            # Pipeline orchestrator + MCQ dataclass
+│   └── evaluator.py              # Answer checking and scoring
+│
+├── app/                          # Streamlit web application
+│   ├── __init__.py
+│   ├── main.py                   # 3-screen app: input → quiz → results
+│   └── components.py             # Reusable UI components
+│
+├── data/
+│   └── sample_passages.json      # 5 test passages (ISRO, Gandhi, AI, etc.)
+│
+├── models/                       # (gitignored) Downloaded model files
+│   └── README.md
+│
+├── notebooks/                    # Jupyter notebooks for exploration
+│
+├── config.py                     # All settings in one place
+├── requirements.txt              # Python dependencies
+└── README.md                     # This file
+```
+---
+## Each File Explained
+### `config.py`
+Central settings file. Every other module imports from here.
+- Model name, number of questions, sentence count, file paths
+- Change values here to tune the entire system without touching logic files
+### `src/preprocessor.py`
+The NLP foundation of the project.
+**Key functions:**
+- `extract_sentences(text)` — spaCy sentence boundary detection
+- `rank_sentences(sentences)` — TF-IDF scoring, returns top N most informative sentences
+- `extract_answer_candidates(sentence)` — NER-based extraction with strict quality filters
+- `preprocess(text)` — full pipeline, returns structured dict
+**Design decisions:**
+- Only `PERSON`, `ORG`, `GPE`, `DATE`, `EVENT`, `WORK_OF_ART` NER labels are accepted as answers
+- A `BLACKLIST` of 30+ generic words ("annual", "various", "Moon") prevents trivial answers
+- Answers are sorted by priority: PERSON > ORG/GPE > DATE > others
+### `src/question_generator.py`
+Uses the `valhalla/t5-small-qg-hl` model — a T5-small fine-tuned on SQuAD for question generation.
+**How T5 QG works:**
+```
+Input:  "generate question: ISRO was founded in <hl> 1969 <hl> by Vikram Sarabhai."
+Output: "In what year was ISRO founded?"
+```
+**Key functions:**
+- `highlight_answer(sentence, answer)` — wraps answer in `<hl>` tags
+- `generate_question(sentence, answer)` — beam search with 5 beams, 3 candidates
+- `answer_is_addressable(question, answer)` — rejects circular, vague, or short questions
+**Quality filters applied:**
+- Must start with a question word (what/who/when/where/which/how)
+- Answer must NOT appear in the question
+- Abbreviation trap detection (e.g. rejects Q: "What does ISRO stand for?" when A is the full name)
+- Minimum 5 words
+### `src/distractor_generator.py`
+Generates 3 plausible wrong answer options. Uses a priority-based strategy chain.
+**Strategy 1 — Same-label NER (best):**
+Finds other entities of the same NER type from the passage.
+```
+Answer: "1969" (DATE) → Distractors: ["1975", "2008", "2023"]  (other DATEs in passage)
+Answer: "Vikram Sarabhai" (PERSON) → Distractors: ["Kalam", "Dhawan", "Nehru"]
+```
+**Strategy 2 — WordNet hyponyms:**
+Navigates the WordNet hierarchy to find sibling words in the same semantic category.
+```
+Answer: "India" → hypernym: "country" → hyponyms: ["China", "Brazil", "Pakistan"]
+```
+**Strategy 3 — Cross-label fallback:**
+Uses any other named entity from the passage if strategies 1 and 2 fail.
+### `src/mcq_builder.py`
+The single entry point that the UI calls. Orchestrates the entire pipeline.
+**MCQ dataclass:**
+```python
+@dataclass
+class MCQ:
+    question       : str
+    options        : list      # 4 shuffled options
+    correct_index  : int       # index of correct answer (0-3)
+    correct_answer : str
+    explanation    : str       # original sentence
+```
+**Quality gate `is_valid_mcq()`:**
+- No two options can be too similar (catches "WWE" vs "World Wrestling Entertainment")
+- Answer must appear exactly once in options
+- Maximum 1 generic placeholder option allowed
+- Answer must not appear in question text
+### `src/evaluator.py`
+Checks answers and computes scores.
+**Returns:**
+```python
+{
+  "score"     : 7,
+  "total"     : 10,
+  "percentage": 70.0,
+  "feedback"  : "Good effort! Review the explanations...",
+  "results"   : [ {per-question breakdown} ]
+}
+```
+### `app/main.py`
+Streamlit app with 3 screens managed via `st.session_state`:
+- **Screen 1 (input):** Text area + question count slider + Generate button
+- **Screen 2 (quiz):** One question at a time, radio buttons, Previous/Next/Submit
+- **Screen 3 (results):** Score banner + per-question feedback with explanations
+### `app/components.py`
+Reusable display functions:
+- `render_question_card()` — A/B/C/D labelled radio buttons
+- `render_result_card()` — green (correct) / red (wrong) with explanation
+- `render_score_summary()` — score banner + metric cards
+---
+## Tech Stack
+| Library | Version | Purpose |
+|---|---|---|
+| `spaCy` | 3.7.4 | Tokenization, NER, POS tagging, sentence splitting |
+| `transformers` | 4.38.2 | T5 model for question generation |
+| `torch` | 2.2.1 | PyTorch backend for transformers |
+| `nltk` | 3.8.1 | WordNet access for distractor generation |
+| `scikit-learn` | 1.4.1.post1 | TF-IDF vectorizer |
+| `sentencepiece` | latest | T5's subword tokenizer |
+| `streamlit` | 1.33.0 | Web UI framework |
+| `gensim` | 4.3.2 | Word2Vec / GloVe loading (optional) |
+| `numpy` | 1.26.4 | TF-IDF matrix operations |
+**Pre-trained model used:**
+- `valhalla/t5-small-qg-hl` — T5-small fine-tuned on SQuAD 1.0 for answer-aware question generation using highlight format. Hosted on HuggingFace Hub, downloaded automatically on first run (~240MB).
+---
+## Setup & Installation
+### Prerequisites
+- Python 3.11+
+- pip
+- Internet connection (first run downloads the T5 model)
+### Step 1 — Clone the repository
+```bash
+git clone https://github.com/tanmmayyy/mcq-generator.git
+cd mcq-generator
+```
+### Step 2 — Create a virtual environment
+```bash
+python -m venv myenv
+# Windows
+myenv\Scripts\activate
+# Mac/Linux
+source myenv/bin/activate
+```
+### Step 3 — Install dependencies
+```bash
+pip install -r requirements.txt
+pip install sentencepiece   # required for T5 tokenizer
+```
+### Step 4 — Download spaCy language model
+```bash
+# If the default command fails:
+pip install https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-3.7.1/en_core_web_sm-3.7.1-py3-none-any.whl
+```
+### Step 5 — Verify installation
+```bash
+python -c "import spacy; nlp = spacy.load('en_core_web_sm'); print('spaCy OK')"
+python -c "from transformers import pipeline; print('Transformers OK')"
+```
+---
+## Running the App
+```bash
+streamlit run app/main.py
+```
+The app opens at `http://localhost:8501`. On first launch, the T5 model downloads (~240MB) and loads into memory — this takes 1–2 minutes. Subsequent launches are fast.
+---
+## Testing Each Module
+Run these in order to verify each step of the pipeline works independently:
+```bash
+# Step 1 — Test preprocessing (NER, TF-IDF, sentence ranking)
+python src/preprocessor.py
+# Step 2 — Test question generation (T5 model)
+python src/question_generator.py
+# Step 3 — Test distractor generation (WordNet + NER)
+python src/distractor_generator.py
+# Step 4 — Test full pipeline end-to-end
+python src/mcq_builder.py
+# Step 5 — Test scoring
+python src/evaluator.py
+```
+---
+## Sample Output
+**Input passage (ISRO):**
+```
+The Indian Space Research Organisation (ISRO) was founded in 1969 by Vikram Sarabhai.
+ISRO developed India's first satellite, Aryabhata, which was launched in 1975.
+The Chandrayaan-1 mission in 2008 discovered water molecules on the Moon.
+In 2023, Chandrayaan-3 successfully landed near the lunar south pole.
+The Mars Orbiter Mission, also called Mangalyaan, was launched in 2013.
+```
+**Generated questions:**
+```
+Q1: Who founded ISRO?
+    A. Jawaharlal Nehru
+    B. APJ Abdul Kalam
+    C. Vikram Sarabhai  ✓
+    D. Homi Bhabha
+Q2: What was India's first satellite called?
+    A. Chandrayaan
+    B. Mangalyaan
+    C. Rohini
+    D. Aryabhata  ✓
+Q3: When did the Chandrayaan-1 mission take place?
+    A. 1975
+    B. 2013
+    C. 2023
+    D. 2008  ✓
+Q4: What mission made India the first Asian country to reach Mars orbit?
+    A. Chandrayaan-3
+    B. Aryabhata
+    C. Mangalyaan  ✓
+    D. Chandrayaan-1
+```
+---
+## What Makes a Good Passage
+The system performs best on **factual passages** that contain:
+| Works well | Works poorly |
+|---|---|
+| People names (PERSON entities) | Opinion / descriptive text |
+| Specific dates (DATE entities) | Passages with repeated entities |
+| Organisation names (ORG entities) | Very short passages (< 5 sentences) |
+| Place names (GPE entities) | Abstract/philosophical text |
+| One clear fact per sentence | Sentences with multiple facts |
+**Best passage types:** History, science, geography, biographies, Wikipedia-style articles
+**Avoid:** Opinion pieces, marketing content, descriptive narratives without specific facts
+---
+## Known Limitations
+1. **Passage type dependency** — Works best on factual text. Descriptive or opinion text produces poor questions because there are no named entities to use as answers.
+2. **T5-small quality ceiling** — The model used (`t5-small`) has 60M parameters. Larger models like `t5-base` or `t5-large` would produce better questions but require more memory and time.
+3. **Distractor diversity** — When a passage has few named entities, distractors may fall back to generic options. Fine-tuning a separate T5 model on the RACE dataset for distractor generation would fix this.
+4. **English only** — The current pipeline only supports English text. Extending to Hindi or other Indic languages would require multilingual spaCy models and a multilingual QG model.
+5. **No semantic deduplication** — Two questions from the same passage can sometimes be semantically similar even if worded differently.
+---
+## Future Work
+- [ ] Fine-tune a T5 distractor generation model on the RACE dataset (100k exam questions)
+- [ ] Add support for Hindi using IndicNLP + multilingual BERT
+- [ ] Add PDF upload support so users can quiz themselves on any document
+- [ ] BLEU/METEOR/ROUGE automated evaluation of generated questions
+- [ ] Difficulty scoring per question based on distractor plausibility
+- [ ] Export quiz as PDF for offline use
+---
+## Related Research
+Papers that use similar approaches — cited for comparison:
+1. **Automatic Generation of Multiple-Choice Questions (2023)**
+   Zhang et al. — T5 with pre/postprocessing pipelines for MCQ generation
+   https://arxiv.org/abs/2303.14576
+2. **Deep Learning and Linguistic Feature Based Automatic MCQ Generation (Springer, ICDCIT 2022)**
+   Agarwal et al. — DL + linguistic features for MCQ generation (same 3-step pipeline)
+   https://link.springer.com/chapter/10.1007/978-3-030-94876-4_18
+3. **End-to-End MCQ Generation Using T5 (ScienceDirect 2022)**
+   Rodriguez-Torrealba et al. — Full T5-based pipeline with Wikipedia passages
+   https://www.sciencedirect.com/science/article/pii/S0957417422014014
+4. **Leaf — MCQ Generation System (ECIR 2022)**
+   Vachev et al. — Two fine-tuned T5 models: one for QG, one for DG on RACE
+   https://github.com/KristiyanVachev/Leaf-Question-Generation
+5. **Automatic Distractor Generation — Systematic Review (PMC 2024)**
+   Comprehensive review of distractor generation methods including WordNet and T5
+   https://pmc.ncbi.nlm.nih.gov/articles/PMC11623049/
+6. **Automatic Question Generation: A Review (Springer/PMC 2023)**
+   Mulla & Gharpure — Survey of methodologies, datasets, and evaluation metrics
+   https://pmc.ncbi.nlm.nih.gov/articles/PMC9886210/
+**What differentiates this project from the above:**
+- End-to-end pipeline with interactive quiz UI (most papers only generate questions)
+- NER-type-matching distractor strategy (distractors always same entity type as answer)
+- Multi-layer quality filtering at both question and MCQ level
+- Answer circularity detection (rejects questions where answer appears in the question)
+---
+## Course Outcomes Covered
+| CO | Description | How this project covers it |
+|---|---|---|
+| CO1 | Articulate NLP and word representation | TF-IDF, NER, WordNet, word embeddings all implemented and explained |
+| CO2 | Build deep learning models for NLP problems | T5 transformer for QG (seq2seq), beam search decoding, transfer learning |
+| CO3 | Implement ML/DL solutions in real context | End-to-end deployable system with Streamlit UI and interactive demo |
+---
+## Author
+**[Tanmay Jain]**
+[ Bennett University]
+---
+*Built with spaCy, HuggingFace Transformers, NLTK, scikit-learn, and Streamlit.*

app/main.py CHANGED Viewed

@@ -1,36 +1,31 @@
-# ─────────────────────────────────────────────
-#  app/main.py
-#  Streamlit UI — the full interactive quiz app.
-#
-#  Run with:  streamlit run app/main.py
-#
-#  Three screens:
-#    1. INPUT   → user pastes a passage, picks # of questions
-#    2. QUIZ    → one question at a time with radio buttons
-#    3. RESULTS → score + per-question feedback
-# ─────────────────────────────────────────────
 import streamlit as st
 import sys, os
-# Make sure we can import from project root
 sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
 from config import APP_TITLE, APP_ICON, MAX_QUESTIONS
-from src.mcq_builder import build_quiz
-from src.evaluator   import score_quiz
-from app.components  import render_question_card, render_result_card, render_score_summary
-# ─────────────────────────────────────────────
-#  PAGE CONFIG — must be first Streamlit call
-# ─────────────────────────────────────────────
-#cache
-import streamlit as st
 @st.cache_resource
 def load_pipeline():
@@ -40,214 +35,176 @@ def load_pipeline():
 build_quiz = load_pipeline()
-st.set_page_config(
-    page_title = APP_TITLE,
-    page_icon  = APP_ICON,
-    layout     = "centered",
-)
 # ─────────────────────────────────────────────
-#  SESSION STATE INITIALISATION
-#  st.session_state persists values across reruns.
-#  Think of it as the app's memory.
 # ─────────────────────────────────────────────
 def init_state():
     defaults = {
-        "screen"       : "input",   # "input" | "quiz" | "results"
-        "mcqs"         : [],        # list of MCQ objects
-        "current_q"    : 0,         # index of current question
-        "user_answers" : [],        # user's selected option indices
-        "quiz_result"  : None,      # scored result dict
     }
-    for key, val in defaults.items():
-        if key not in st.session_state:
-            st.session_state[key] = val
 init_state()
 # ─────────────────────────────────────────────
-#  HELPER: reset to start a new quiz
 # ─────────────────────────────────────────────
 def reset():
-    st.session_state.screen       = "input"
-    st.session_state.mcqs         = []
-    st.session_state.current_q    = 0
     st.session_state.user_answers = []
-    st.session_state.quiz_result  = None
 # ─────────────────────────────────────────────
 #  SCREEN 1: INPUT
-#  User pastes a passage and hits "Generate Quiz"
 # ─────────────────────────────────────────────
 def screen_input():
     st.title(f"{APP_ICON} {APP_TITLE}")
-    st.write("Paste any text passage below to automatically generate a quiz from it.")
-    st.info(
-        "**For best results**, use factual passages containing: "
-        "**people names, places, dates, organisations, or events.**  \n"
-        "Try: history, science, geography, biographies.  \n"
-        "Avoid opinion or purely descriptive text — they lack named facts."
-    )
-    st.markdown("---")
     passage = st.text_area(
-        label       = "Your passage",
-        placeholder = "Paste a paragraph or article here...",
-        height      = 250,
-        help        = "Minimum ~5 sentences recommended for best results.",
     )
     num_questions = st.slider(
-        label   = "Number of questions",
-        min_value = 3,
-        max_value = MAX_QUESTIONS,
-        value     = 5,
-        step      = 1,
     )
-    st.markdown("---")
-    if st.button("Generate Quiz", type="primary", use_container_width=True):
         if not passage or len(passage.split()) < 30:
-            st.warning("Please paste a longer passage (at least ~30 words).")
             return
-        with st.spinner("Generating questions... this may take 30–60 seconds on first run."):
             try:
                 mcqs = build_quiz(passage, num_questions=num_questions)
             except Exception as e:
-                st.error(f"Something went wrong: {e}")
                 return
         if not mcqs:
-            st.error("Could not generate questions from this passage. Try a different text.")
             return
-        # Store in session and move to quiz screen
-        st.session_state.mcqs         = mcqs
-        st.session_state.user_answers = [-1] * len(mcqs)  # -1 = unanswered
-        st.session_state.current_q    = 0
-        st.session_state.screen       = "quiz"
         st.rerun()
 # ─────────────────────────────────────────────
 #  SCREEN 2: QUIZ
-#  One question at a time, with navigation.
 # ─────────────────────────────────────────────
 def screen_quiz():
-    mcqs      = st.session_state.mcqs
-    current   = st.session_state.current_q
-    total     = len(mcqs)
-    mcq       = mcqs[current]
-    # Progress bar
-    st.progress((current) / total, text=f"Question {current+1} of {total}")
-    st.markdown("---")
-    # Render the question card (defined in components.py)
-    selected_label = render_question_card(mcq, current)
-    st.markdown("---")
     col1, col2, col3 = st.columns([1, 2, 1])
-    # Previous button
     with col1:
         if current > 0:
-            if st.button("← Previous"):
                 st.session_state.current_q -= 1
                 st.rerun()
-    # Next / Submit button
     with col3:
-        # Convert selected label (A/B/C/D) back to index
         if selected_label:
-            selected_index = ord(selected_label) - ord("A")
-            st.session_state.user_answers[current] = selected_index
         if current < total - 1:
             if st.button("Next →", type="primary"):
                 if selected_label is None:
-                    st.warning("Please select an answer before continuing.")
                 else:
                     st.session_state.current_q += 1
                     st.rerun()
         else:
-            # Last question — show Submit button
-            if st.button("Submit Quiz", type="primary"):
-                if selected_label is None:
-                    st.warning("Please select an answer before submitting.")
-                else:
-                    # Score the quiz
-                    result = score_quiz(
-                        st.session_state.mcqs,
-                        st.session_state.user_answers
-                    )
-                    st.session_state.quiz_result = result
-                    st.session_state.screen      = "results"
-                    st.rerun()
-    # Show quit option
     with col2:
-        if st.button("Quit Quiz", help="Return to the input screen"):
             reset()
             st.rerun()
 # ─────────────────────────────────────────────
 #  SCREEN 3: RESULTS
-#  Score summary + per-question breakdown
 # ─────────────────────────────────────────────
 def screen_results():
     result = st.session_state.quiz_result
-    st.title("Quiz Complete!")
-    st.markdown("---")
-    # Score summary banner
     render_score_summary(result)
-    st.markdown("---")
-    st.subheader("Question-by-question breakdown")
-    # Per-question result cards
     for i, r in enumerate(result["results"]):
         render_result_card(r, i + 1)
-    st.markdown("---")
     col1, col2 = st.columns(2)
     with col1:
-        if st.button("Try Another Passage", use_container_width=True):
             reset()
             st.rerun()
     with col2:
-        if st.button("Retake Same Quiz", type="primary", use_container_width=True):
-            # Reset answers but keep the same MCQs
             st.session_state.user_answers = [-1] * len(st.session_state.mcqs)
-            st.session_state.current_q    = 0
-            st.session_state.screen       = "quiz"
             st.rerun()
 # ─────────────────────────────────────────────
-#  ROUTER — picks which screen to show
 # ─────────────────────────────────────────────
 if st.session_state.screen == "input":
     screen_input()
 elif st.session_state.screen == "quiz":
     screen_quiz()
 elif st.session_state.screen == "results":
     screen_results()

 import streamlit as st
 import sys, os
+# ✅ FIX: Add project root first
 sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
 from config import APP_TITLE, APP_ICON, MAX_QUESTIONS
+# ✅ FIRST Streamlit call
+st.set_page_config(
+    page_title=APP_TITLE,
+    page_icon=APP_ICON,
+    layout="centered",
+)
+# Add project root to path
+sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
+from src.evaluator import score_quiz
+from app.components import (
+    render_question_card,
+    render_result_card,
+    render_score_summary
+)
+# ─────────────────────────────────────────────
+#  CACHE MODEL (important for performance)
+# ─────────────────────────────────────────────
 @st.cache_resource
 def load_pipeline():
 build_quiz = load_pipeline()
 # ─────────────────────────────────────────────
+#  SESSION STATE
 # ─────────────────────────────────────────────
 def init_state():
     defaults = {
+        "screen": "input",
+        "mcqs": [],
+        "current_q": 0,
+        "user_answers": [],
+        "quiz_result": None,
     }
+    for k, v in defaults.items():
+        if k not in st.session_state:
+            st.session_state[k] = v
 init_state()
 # ─────────────────────────────────────────────
+#  RESET
 # ─────────────────────────────────────────────
 def reset():
+    st.session_state.screen = "input"
+    st.session_state.mcqs = []
+    st.session_state.current_q = 0
     st.session_state.user_answers = []
+    st.session_state.quiz_result = None
 # ─────────────────────────────────────────────
 #  SCREEN 1: INPUT
 # ─────────────────────────────────────────────
 def screen_input():
     st.title(f"{APP_ICON} {APP_TITLE}")
+    st.write("Paste text to generate MCQs")
     passage = st.text_area(
+        "Your passage",
+        height=250,
+        placeholder="Paste content here..."
     )
     num_questions = st.slider(
+        "Number of questions",
+        3,
+        MAX_QUESTIONS,
+        5
     )
+    if st.button("Generate Quiz", type="primary"):
         if not passage or len(passage.split()) < 30:
+            st.warning("Enter at least 30 words")
             return
+        with st.spinner("Generating questions..."):
             try:
                 mcqs = build_quiz(passage, num_questions=num_questions)
             except Exception as e:
+                st.error(f"Error: {e}")
                 return
         if not mcqs:
+            st.error("Failed to generate questions")
             return
+        st.session_state.mcqs = mcqs
+        st.session_state.user_answers = [-1] * len(mcqs)
+        st.session_state.current_q = 0
+        st.session_state.screen = "quiz"
         st.rerun()
 # ─────────────────────────────────────────────
 #  SCREEN 2: QUIZ
 # ─────────────────────────────────────────────
 def screen_quiz():
+    mcqs = st.session_state.mcqs
+    current = st.session_state.current_q
+    total = len(mcqs)
+    mcq = mcqs[current]
+    st.progress(current / total, text=f"Q {current+1}/{total}")
+    selected_label = render_question_card(mcq, current)
     col1, col2, col3 = st.columns([1, 2, 1])
+    # Previous
     with col1:
         if current > 0:
+            if st.button("← Prev"):
                 st.session_state.current_q -= 1
                 st.rerun()
+    # Next / Submit
     with col3:
         if selected_label:
+            idx = ord(selected_label) - ord("A")
+            st.session_state.user_answers[current] = idx
         if current < total - 1:
             if st.button("Next →", type="primary"):
                 if selected_label is None:
+                    st.warning("Select an answer")
                 else:
                     st.session_state.current_q += 1
                     st.rerun()
         else:
+            if st.button("Submit", type="primary"):
+                result = score_quiz(
+                    st.session_state.mcqs,
+                    st.session_state.user_answers
+                )
+                st.session_state.quiz_result = result
+                st.session_state.screen = "results"
+                st.rerun()
+    # Quit
     with col2:
+        if st.button("Quit"):
             reset()
             st.rerun()
 # ─────────────────────────────────────────────
 #  SCREEN 3: RESULTS
 # ─────────────────────────────────────────────
 def screen_results():
     result = st.session_state.quiz_result
+    st.title("Quiz Complete")
     render_score_summary(result)
     for i, r in enumerate(result["results"]):
         render_result_card(r, i + 1)
     col1, col2 = st.columns(2)
     with col1:
+        if st.button("New Quiz"):
             reset()
             st.rerun()
     with col2:
+        if st.button("Retry"):
             st.session_state.user_answers = [-1] * len(st.session_state.mcqs)
+            st.session_state.current_q = 0
+            st.session_state.screen = "quiz"
             st.rerun()
 # ─────────────────────────────────────────────
+#  ROUTER
 # ─────────────────────────────────────────────
 if st.session_state.screen == "input":
     screen_input()
 elif st.session_state.screen == "quiz":
     screen_quiz()
 elif st.session_state.screen == "results":
     screen_results()

finetune_t5_file.ipynb CHANGED Viewed

The diff for this file is too large to render. See raw diff

requirements.txt CHANGED Viewed

Binary files a/requirements.txt and b/requirements.txt differ

runtime.txt DELETED Viewed

	@@ -1 +0,0 @@
1	- python-3.11

setup.sh DELETED Viewed

	@@ -1 +0,0 @@
1	- python -m spacy download en_core_web_sm

src/question_generator.py CHANGED Viewed

@@ -10,6 +10,18 @@ from transformers import pipeline
 import re
 import sys, os
 sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
 from config import QG_MODEL_NAME, MAX_QUESTIONS

 import re
 import sys, os
+import streamlit as st
+@st.cache_resource
+def load_model():
+    tokenizer = AutoTokenizer.from_pretrained("valhalla/t5-small-qg-hl", use_fast=False)
+    model     = T5ForConditionalGeneration.from_pretrained("valhalla/t5-small-qg-hl")
+    model.eval()
+    return tokenizer, model
+tokenizer, qg_model = load_model()
 sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
 from config import QG_MODEL_NAME, MAX_QUESTIONS