ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems

A Research Implementation of RL-Powered Educational Technology

Property	Value
Algorithm	GRPO + Q-Learning
State Dimension	64 features
Action Dimension	10 doubt predictions
Policy Version	50
Training Samples	200
Final Loss	0.2465
Avg Reward	0.75

Overview

ContextFlow predicts student confusion before it occurs using reinforcement learning and behavioral signal analysis. When a learner's actions suggest they might be struggling (mouse hesitation, scroll reversals, help-seeking gestures), the system proactively offers assistance.

Architecture

┌─────────────────────────────────────────────────┐
│           9 Specialized Agents                   │
├─────────────────────────────────────────────────┤
│ • StudyOrchestrator  • DoubtPredictorAgent      │
│ • BehavioralAgent    • HandGestureAgent          │
│ • RecallAgent       • KnowledgeGraphAgent        │
│ • PeerLearningAgent  • LLMOrchestrator          │
│ • GestureActionMapper • PromptAgent              │
└─────────────────────────────────────────────────┘

Quick Start

# Load the model
from huggingface_hub import hf_hub_download
import pickle

path = hf_hub_download(
    repo_id='namish10/contextflow-rl',
    filename='checkpoint.pkl'
)
with open(path, 'rb') as f:
    checkpoint = pickle.load(f)

print(f"Policy version: {checkpoint.policy_version}")
print(f"Training samples: {checkpoint.training_stats['total_samples']}")

State Vector (64 dimensions)

Component	Dims	Description
Topic Embedding	32	TF-IDF of learning topic
Progress	1	Session progress (0.0-1.0)
Confusion Signals	16	Behavioral indicators
Gesture Signals	14	Hand gesture frequencies
Time Spent	1	Normalized session time

Actions (10 doubt predictions)

what_is_backpropagation
why_gradient_descent
how_overfitting_works
explain_regularization
what_loss_function
how_optimization_works
explain_learning_rate
what_regularization
how_batch_norm_works
explain_softmax

Training Results

Epoch	Loss	Epsilon	Avg Reward
1	1.2456	1.000	0.20
2	0.8923	0.995	0.35
3	0.6541	0.990	0.48
4	0.4127	0.985	0.62
5	0.2465	0.980	0.75

Key Features

Predictive Detection: RL-based confusion prediction before it happens
Multi-Agent Orchestration: 9 specialized agents working together
Gesture Recognition: Privacy-first hand gesture detection with MediaPipe
Face Blurring: Real-time face blur for classroom deployment
Browser AI Launch: Direct AI chat interface from predicted doubts
Spaced Repetition: SM-2 based review scheduling
Knowledge Graphs: Concept mapping and learning paths

Files

File	Description
`checkpoint.pkl`	Trained Q-network weights
`train_rl.py`	Training script with GRPO
`feature_extractor.py`	64-dim state extraction
`inference_example.py`	Usage examples
`demo.ipynb`	Interactive notebook
`RESEARCH_PAPER.md`	Full research paper
`evaluation_results.json`	Training metrics
`requirements.txt`	Dependencies
`app/`	Backend agents (Flask API)
`frontend/`	React frontend

Evaluation

See EVALUATION.md for detailed metrics and production readiness assessment.

Citation

@software{contextflow,
  title={ContextFlow: Predictive Doubt Detection in Adaptive Learning Systems},
  author={ContextFlow Team},
  year={2026},
  version={1.0},
  url={https://huggingface.co/namish10/contextflow-rl}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning