Spaces:
Sleeping
Sleeping
A newer version of the Gradio SDK is available: 6.12.0
Project Submission Details
π Project Name
AnatomyBot - MBBS Anatomy Tutor
π What I want to showcase
I want to showcase an intelligent, interactive educational platform designed specifically for MBBS (medical) students. The goal is to bridge the gap between static textbooks and active learning by providing:
- Visual & Contextual Learning: Instantly fetching relevant anatomical diagrams alongside AI-generated educational summaries.
- Active Recall & Simulation: A "VIVA Mode" that simulates an oral anatomy exam with a strict but fair AI professor, complete with voice interaction.
- Resource Augmentation: A "Book Mode" that allows students to upload their own textbooks and get AI-powered analysis and quiz questions for specific pages.
π οΈ What I've built so far
I have built a fully functional Gradio web application that integrates multiple powerful AI services:
Smart Query Engine:
- Uses Hyperbolic AI (Llama 3.3 70B) to validate if questions are strictly anatomy-related.
- Fetches high-quality diagrams using SERPAPI (Google Images), filtering out non-diagrammatic results.
- Generates structured, emoji-rich educational content (Location, Features, Clinical Significance).
VIVA Training Mode:
- Simulates a real oral exam environment.
- Generates progressive questions (from basic relations to clinical scenarios).
- Voice Interaction: Uses ElevenLabs to vocalize questions, making the experience immersive.
- AI Evaluation: Grades student answers (Distinction, Pass, Borderline, Fail) and provides detailed feedback with reference answers.
Book Learning Mode:
- Allows PDF textbook uploads.
- Extracts and displays specific pages.
- Analyzes page content to generate summaries, clinical points, and self-test questions.
Student System:
- Includes a registration system (SQLite backend) to track student details (Name, Medical School, Year).
π» Tech Stack
- Frontend/UI: Gradio (Python)
- LLM Intelligence: Hyperbolic AI API (Meta Llama 3.3 70B Instruct)
- Image Search: SERPAPI (Google Images Engine)
- Voice Synthesis: ElevenLabs API (Text-to-Speech)
- Database: SQLite (for user management)
- Image/PDF Processing: Pillow, PyPDF2, pdf2image
- Environment: Python, Hugging Face Spaces