TensorCat
/

TensorTalk

Model card Files Files and versions

xet

Community

TensorCat commited on Apr 14

Commit

36f82a2

verified ·

1 Parent(s): 052d67e

Update README.md

Browse files

Files changed (1) hide show

README.md +250 -3

README.md CHANGED Viewed

@@ -1,3 +1,250 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+---
+# TensorTalk / UM_Handbook
+TensorTalk is a handbook-grounded academic chat assistant built for the **Faculty of Computer Science and Information Technology, Universiti Malaya (UM)**.
+This project focuses on turning UM handbook content into a usable question-answering system through:
+- handbook preprocessing
+- source chunk construction
+- supervised QA dataset building
+- Qwen3-8B LoRA fine-tuning
+- merged-model deployment
+- a browser-style HTML chat demo
+---
+## Project Goal
+The main goal of this project is to build a handbook-based assistant that can answer student questions using information learned from the UM handbook domain.
+The current version is designed around:
+- undergraduate and postgraduate handbook content
+- handbook-faithful answers
+- concise student-facing responses
+- a local/demo deployment workflow on DICC and notebook environments
+This project is also intended to support a broader experimental pipeline:
+- **Baseline 1:** closed-book supervised fine-tuning
+- **Baseline 2:** retrieval-augmented version for later comparison
+---
+## What This Project Contains
+### 1. Dataset Preparation
+The project includes scripts and resources for preparing handbook data before fine-tuning:
+- handbook markdown preprocessing
+- source chunk dataset building
+- SFT QA dataset construction
+- configuration management for the preprocessing and dataset pipeline
+### 2. Fine-Tuning Workflow
+The model training workflow uses a Qwen3-8B base model with LoRA-based fine-tuning on the UM handbook QA dataset.
+The fine-tuning workflow includes:
+- notebook-based training on DICC
+- device-aware loading logic
+- train / validation / test style evaluation workflow
+- merged-model export for direct inference
+- LoRA adapter export for optional PEFT-based reuse
+- metrics and prediction file generation
+### 3. Deployment Demo
+The project includes a notebook-based HTML chat UI called **TensorTalk**.
+The demo provides:
+- a browser-style chat layout
+- a handbook-focused system prompt
+- merged-model loading for direct inference
+- a student-facing question-answer workflow
+- a simple deployment path for demonstration purposes
+---
+## Current Project Structure
+```text
+UM_Handbook/
+├── Dataset/
+│   └── SFT_Dataset/
+│       ├── SFT_QA_Training_Ready.jsonl
+│       ├── SFT_QA_Training_Ready_pretty.json
+│       ├── SFT_QA_Metadata.jsonl
+│       └── SFT_QA_Metadata_pretty.json
+├── assets/
+├── outputs/
+│   └── qwen3_um_handbook_optimized_1/
+│       ├── lora_adapter/
+│       ├── merged_model/
+│       ├── trainer_runs/
+│       ├── test_eval_runs/
+│       ├── dataset_split_summary.json
+│       ├── final_metrics.json
+│       ├── test_predictions.jsonl
+│       └── validation_predictions.jsonl
+├── FineTune_QWEN3_UM_Handbook_optimized_1.ipynb
+├── UM_Handbook_Markdown_Preprocess.py
+├── UM_SFT_QA_Dataset_Builder_from_Index.py
+├── UM_Source_Chunk_Dataset_Builder.py
+└── um_handbook_config.py
+```
+---
+## Key Files
+### Training and Data
+- `Dataset/SFT_Dataset/SFT_QA_Training_Ready.jsonl`
+  Main SFT training dataset used for handbook QA fine-tuning.
+- `UM_Handbook_Markdown_Preprocess.py`
+  Preprocesses handbook markdown / extracted source text.
+- `UM_Source_Chunk_Dataset_Builder.py`
+  Builds source chunks for downstream dataset and retrieval-related use.
+- `UM_SFT_QA_Dataset_Builder_from_Index.py`
+  Builds the supervised QA dataset from curated handbook content.
+- `um_handbook_config.py`
+  Central configuration file for paths and data-processing settings.
+### Training Output
+- `outputs/qwen3_um_handbook_optimized_1/merged_model/`
+  Main inference-ready model directory.
+  This is the directory used by the demo chat UI.
+- `outputs/qwen3_um_handbook_optimized_1/lora_adapter/`
+  LoRA adapter weights.
+  This is useful for PEFT-style loading with a base model, but it is not the primary path used by the current demo UI.
+- `outputs/qwen3_um_handbook_optimized_1/final_metrics.json`
+  Final evaluation summary.
+- `outputs/qwen3_um_handbook_optimized_1/validation_predictions.jsonl`
+  Validation-set generated answers for inspection.
+- `outputs/qwen3_um_handbook_optimized_1/test_predictions.jsonl`
+  Test-set generated answers for inspection.
+### Demo
+- `FineTune_QWEN3_UM_Handbook_optimized_1.ipynb`
+  Main notebook that contains the fine-tuning workflow and the TensorTalk HTML chat demo.
+---
+## Model Artifact Notes
+This project may contain several model-related outputs. They are not all used in the same way.
+### `merged_model/`
+This is the most important deployment artifact for the current demo.
+Use this when:
+- running the current TensorTalk HTML chat UI
+- loading the fine-tuned model directly with Hugging Face `from_pretrained(...)`
+- sharing the main inference-ready model
+### `lora_adapter/`
+This contains LoRA delta weights only.
+Use this when:
+- loading the adapter on top of the original base model
+- reusing the fine-tuning result in a PEFT workflow
+- experimenting with a smaller transferable fine-tuning artifact
+### `.pt` exported model file
+If present, the `.pt` file is mainly a saved full-model artifact / backup export.
+Use this when:
+- archiving the full fine-tuned weights
+- running a custom loading workflow that explicitly expects a `.pt` file
+For the current TensorTalk chat UI, the primary runtime artifact is still **`merged_model/`**.
+---
+## Current Demo Behavior
+The current demo is designed to answer questions such as:
+- dress code and appearance guidance
+- programme core courses / credit requirements
+- undergraduate vs postgraduate handbook information
+- academic rules and handbook-supported policy questions
+The answer style is intended to be:
+- handbook-grounded
+- short and direct
+- student-facing
+- non-speculative
+---
+## Example Demo Output
+The screenshot below shows the current TensorTalk chat interface running with the fine-tuned UM handbook model.
+![TensorTalk Demo](assets/tensortalk_demo_chat.jpg)
+---
+## Repository Preview
+The screenshot below shows the current top-level project layout.
+![Repository Structure](assets/repo_structure.png)
+---
+## Suggested Minimal Deployment Package
+If the goal is only to demonstrate the chat UI to teammates, the minimal useful set is:
+- `merged_model/`
+- the chat notebook / UI code
+- optional avatar image under `assets/`
+The following items are not required for a simple demo run:
+- intermediate training checkpoints
+- test evaluation run directories
+- optional full `.pt` export
+- raw training logs not used by the demo
+---
+## Notes
+- The project is organized so that **Dataset**, **models / outputs**, and **demo code** remain separate.
+- The current demo is notebook-friendly and was prepared around a DICC workflow.
+- The deployment path prioritizes clarity and reproducibility over a heavyweight full-stack application setup.
+---
+## Status
+Current project status:
+- handbook preprocessing pipeline prepared
+- supervised QA dataset prepared
+- LoRA fine-tuning workflow completed
+- merged model exported
+- TensorTalk HTML chat demo running
+- evaluation outputs generated
+---
+## Author / Project Name
+**TensorTalk**
+UM Handbook QA / Fine-Tuned Qwen3-8B LoRA Project