Text Generation
Transformers
Safetensors
English
Chinese
qwen3
qwen3-8b
lora
qlora
sft
rag
faiss
dense-retrieval
agent
ppo
rlhf
rule-reward
harness-engineering
um-handbook
question-answering
chatbot
education
tensor-talk
Instructions to use TensorCat/TensorTalk with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use TensorCat/TensorTalk with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="TensorCat/TensorTalk")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("TensorCat/TensorTalk", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use TensorCat/TensorTalk with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "TensorCat/TensorTalk" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TensorCat/TensorTalk", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/TensorCat/TensorTalk
- SGLang
How to use TensorCat/TensorTalk with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "TensorCat/TensorTalk" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TensorCat/TensorTalk", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "TensorCat/TensorTalk" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "TensorCat/TensorTalk", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use TensorCat/TensorTalk with Docker Model Runner:
docker model run hf.co/TensorCat/TensorTalk
Upload 50 files
Browse filesThis view is limited to 50 files because it contains too many changes. See raw diff
- .gitattributes +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/dataset_split_summary.json +14 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/generation_eval/generation_details.jsonl +20 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/generation_eval/generation_metrics.json +10 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/README.md +209 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/adapter_config.json +46 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/adapter_model.safetensors +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/chat_template.jinja +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/tokenizer.json +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/tokenizer_config.json +29 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/training_args.bin +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_augmented_sft_dataset.jsonl +0 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_augmented_sft_dataset_preview.json +212 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_text_length_summary.json +11 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/retrieval_eval/retrieval_details.jsonl +0 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/retrieval_eval/retrieval_metrics.json +10 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/README.md +57 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/README.md +209 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/adapter_config.json +46 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/adapter_model.safetensors +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/chat_template.jinja +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/optimizer.pt +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/rng_state.pth +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/scheduler.pt +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/tokenizer.json +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/tokenizer_config.json +29 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/trainer_state.json +604 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/training_args.bin +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/README.md +209 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/adapter_config.json +46 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/adapter_model.safetensors +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/chat_template.jinja +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/optimizer.pt +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/rng_state.pth +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/scheduler.pt +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/tokenizer.json +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/tokenizer_config.json +29 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/trainer_state.json +706 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/training_args.bin +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/dataset_dict.json +1 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/data-00000-of-00001.arrow +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/dataset_info.json +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/state.json +13 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/data-00000-of-00001.arrow +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/dataset_info.json +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/state.json +13 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/data-00000-of-00001.arrow +3 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/dataset_info.json +89 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/state.json +13 -0
- UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/train_metrics.json +7 -0
.gitattributes
CHANGED
|
@@ -40,3 +40,6 @@ UM_Handbook/assets/tensortalk_demo_chat.jpg filter=lfs diff=lfs merge=lfs -text
|
|
| 40 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/lora_adapter/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 41 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/trainer_runs/checkpoint-1000/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 42 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/trainer_runs/checkpoint-900/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
| 40 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/lora_adapter/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 41 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/trainer_runs/checkpoint-1000/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 42 |
UM_Handbook/outputs/qwen3_um_handbook_baseline2_rag_a100/trainer_runs/checkpoint-900/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
UM_Handbook/outputs/baseline2_rag_harness_agent/dataset_split_summary.json
ADDED
|
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"dataset_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/Dataset/SFT_Dataset/SFT_QA_Training_Ready.jsonl",
|
| 3 |
+
"metadata_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/Dataset/SFT_Dataset/SFT_QA_Metadata.jsonl",
|
| 4 |
+
"rag_kb_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/Dataset/RAG/UM_RAG_Knowledge_Base.jsonl",
|
| 5 |
+
"rag_augmented_dataset_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_augmented_sft_dataset.jsonl",
|
| 6 |
+
"total_examples": 1000,
|
| 7 |
+
"train_examples": 800,
|
| 8 |
+
"validation_examples": 100,
|
| 9 |
+
"test_examples": 100,
|
| 10 |
+
"seed": 42,
|
| 11 |
+
"train_ratio": 0.8,
|
| 12 |
+
"validation_ratio": 0.1,
|
| 13 |
+
"test_ratio": 0.1
|
| 14 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/generation_eval/generation_details.jsonl
ADDED
|
@@ -0,0 +1,20 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{"qa_id": "qa_000031", "index_id": "UMI-0031", "query": "What are the programme goals, objectives, or learning outcomes for Master in Data Science?", "gold_answer": "The programme objectives describe data scientists who can extract meaningful insights, deliver effective data science solutions, sustain lifelong learning with an entrepreneurial mindset, and lead or contribute responsibly in public or private organisations. The learning outcomes highlight mastery of important data science concepts, recommending innovative data science and scientific computing solutions, constructing efficient data science tools, communicating with stakeholders, publishing and presenting solutions, using digital and analytic skills to acquire and interpret knowledge, leadership, lifelong learning, entrepreneurship, and professional ethics.", "plain_answer": "<think>\n\n</think>\n\nThe programme learning outcomes for Master in Data Science include constructing mathematical models for data science problem solving, applying programming and data science techniques to collect, clean, visualise, and analyse data, combining domain knowledge with data analytic techniques to recommend or design data-driven solutions, using communication and teamwork skills to deliver data science solutions through appropriate technical and soft skills, and demonstrating ethical practice in data science.", "rag_answer": "<think>\n\n</think>\n\nThe programme objectives describe data scientists who can extract meaningful insights, deliver effective data science solutions, sustain lifelong learning with an entrepreneurial mindset, and lead or contribute responsibly in public or private organisations. The learning outcomes highlight mastery of important data science concepts, recommending innovative data science and scientific computing solutions, constructing efficient data science tools, communicating with stakeholders, publishing and presenting solutions, using digital and analytic skills to acquire and interpret knowledge, leadership, lifelong learning, entrepreneurship, and professional ethics.", "plain_exact_match": 0.0, "plain_token_f1": 0.4, "rag_exact_match": 0.0, "rag_token_f1": 0.9877, "retrieved_kb_ids": ["RAGKB-0427", "RAGKB-0428", "RAGKB-0429"], "retrieved_sections": ["Master in Data Science", "Master in Data Science", "Master in Data Science"], "retrieved_scopes": ["postgraduate", "postgraduate", "postgraduate"]}
|
| 2 |
+
{"qa_id": "qa_000883", "index_id": "UMI-0883", "query": "Ngl, what distinctive specialization focus is highlighted for Bachelor of Computer Science (Information Systems)?", "gold_answer": "The distinctive specialization focus is Information Systems, with deeper emphasis on Information Systems techniques for analysing information, designing solutions, and developing computer-based systems.", "plain_answer": "<think>\n\n</think>\n\nThe Bachelor of Computer Science (Information Systems) focus is highlighted as a specialization in Information Systems.", "rag_answer": "<think>\n\n</think>\n\nThe distinctive specialization focus is Information Systems, with deeper emphasis on Information Systems techniques for analysing information, designing solutions, and developing computer-based systems.", "plain_exact_match": 0.0, "plain_token_f1": 0.4286, "rag_exact_match": 0.0, "rag_token_f1": 0.96, "retrieved_kb_ids": ["RAGKB-0336", "RAGKB-0338", "RAGKB-0341"], "retrieved_sections": ["Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 3 |
+
{"qa_id": "qa_000387", "index_id": "UMI-0387", "query": "What policy links, regulations, or university rule documents are referenced for undergraduate students?", "gold_answer": "The undergraduate rules section refers students to the University of Malaya Bachelor's Degree rules and regulations, together with related university policy and administrative references.", "plain_answer": "<think>\n\n</think>\n\nThe undergraduate policy references include the Universiti Malaya (Undergraduate) Rules 2019, as well as related university regulations and guidelines linked in the policy section.", "rag_answer": "<think>\n\n</think>\n\nThe undergraduate rules section refers students to the University of Malaya Bachelor's Degree rules and regulations, together with related university policy and administrative references.", "plain_exact_match": 0.0, "plain_token_f1": 0.4706, "rag_exact_match": 0.0, "rag_token_f1": 0.9615, "retrieved_kb_ids": ["RAGKB-0320", "RAGKB-0142", "RAGKB-0049"], "retrieved_sections": ["Undergraduate Rules and Regulations", "Undergraduate Faculty Identity", "Postgraduate General Information"], "retrieved_scopes": ["undergraduate", "undergraduate", "postgraduate"]}
|
| 4 |
+
{"qa_id": "qa_000254", "index_id": "UMI-0254", "query": "What major milestones appear in the Graduate on Time schedule for Master of Computer Science (By Research)?", "gold_answer": "The Graduate on Time schedule for Master of Computer Science (By Research) lays out the expected semester-by-semester milestones for timely completion, including candidature progression, research-related stages, and expected outputs where applicable.", "plain_answer": "<think>\n\n</think>\n\nThe retrieved evidence shows that the Graduate on Time schedule for Master of Computer Science (By Research) includes milestones such as Progress Report I at the end of Semester I, Progress Report II at the end of Semester II, Proposal Presentation and Proposal Review Board for the end of Semester II or the start of Semester III, Examination of Completion for the end of Semester III or the start of Semester IV, and submission of the dissertation.", "rag_answer": "<think>\n\n</think>\n\nThe Graduate on Time schedule for Master of Computer Science (By Research) shows a staged completion path that begins with research confirmation, includes milestones such as proposal development and defence, dissertation editing, feedback incorporation, dissertation submission, progress report submission, and final completion after external examiner assessment.", "plain_exact_match": 0.0, "plain_token_f1": 0.3363, "rag_exact_match": 0.0, "rag_token_f1": 0.3902, "retrieved_kb_ids": ["RAGKB-0509", "RAGKB-0510", "RAGKB-0374"], "retrieved_sections": ["Master of Computer Science (By Research)", "Master of Computer Science (By Research)", "Master of Computer Science (Applied Computing)"], "retrieved_scopes": ["postgraduate", "postgraduate", "postgraduate"]}
|
| 5 |
+
{"qa_id": "qa_000129", "index_id": "UMI-0129", "query": "When was the Faculty of Computer Science and Information Technology officially formed?", "gold_answer": "The Universiti Malaya Council agreed to the formation of FCSIT on 22 September 1994.", "plain_answer": "<think>\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: History of FCSIT\nSubsection: Formation and Early History\nPages: 10, 11\nKnowledge group: KG-0012\nChunk id: SC-00003\nText:\nformed of 53 students from the 1978/1979 Session. The first undergraduate programme offered by the Department of Computer Centre was the Bachelor of Science in Computer Studies. In 1980, the University Senate agreed to the establishment of the Computer Centre as an", "rag_answer": "<think>\n\n</think>\n\nThe Universiti Malaya Council agreed to the formation of FCSIT on 22 September 1994.", "plain_exact_match": 0.0, "plain_token_f1": 0.1591, "rag_exact_match": 0.0, "rag_token_f1": 0.9333, "retrieved_kb_ids": ["RAGKB-0003", "RAGKB-0004", "RAGKB-0002"], "retrieved_sections": ["History of the Faculty", "History of the Faculty", "History of the Faculty"], "retrieved_scopes": ["general", "general", "general"]}
|
| 6 |
+
{"qa_id": "qa_000308", "index_id": "UMI-0308", "query": "What distinctive specialization focus is highlighted for Bachelor of Computer Science (Artificial Intelligence)?", "gold_answer": "The specialization focus highlighted for Bachelor of Computer Science (Artificial Intelligence) is artificial intelligence, as reflected by the programme title and its learning-outcome emphasis.", "plain_answer": "<think>\n\n</think>\n\nThe Bachelor of Computer Science (Artificial Intelligence) programme emphasizes artificial intelligence as its specialization focus, as reflected by the programme title and focus areas.", "rag_answer": "<think>\n\n</think>\n\nThe specialization focus highlighted for Bachelor of Computer Science (Artificial Intelligence) is artificial intelligence, as reflected by the programme title and its learning-outcome emphasis.", "plain_exact_match": 0.0, "plain_token_f1": 0.7451, "rag_exact_match": 0.0, "rag_token_f1": 0.9615, "retrieved_kb_ids": ["RAGKB-0328", "RAGKB-0330", "RAGKB-0333"], "retrieved_sections": ["Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 7 |
+
{"qa_id": "qa_000641", "index_id": "UMI-0641", "query": "Could you list the main lecture periods for bachelor degree students in Semester II of session 2025/2026?", "gold_answer": "For bachelor degree students in Semester II, the main lecture periods are 09.03.2026 to 26.04.2026 and 04.05.2026 to 21.06.2026.", "plain_answer": "<think>\n\n</think>\n\nThe main lecture periods for Bachelor's degree students in Semester II 2025/2026 are Week 1 to Week 6 as the first lecture period, and Week 7 to Week 12 as the second lecture period.", "rag_answer": "<think>\n\n</think>\n\nThe handbook shows the academic calendar for Bachelor Degree Level in session 2025/2026. It breaks down the calendar into orientation or semester preparation, teaching periods divided by scheduled breaks, special revision and examination periods, and the final examination block for each semester.", "plain_exact_match": 0.0, "plain_token_f1": 0.4923, "rag_exact_match": 0.0, "rag_token_f1": 0.25, "retrieved_kb_ids": ["RAGKB-0008", "RAGKB-0009", "RAGKB-0006"], "retrieved_sections": ["Academic Calendar 2025/2026", "Academic Calendar 2025/2026", "Academic Calendar 2025/2026"], "retrieved_scopes": ["undergraduate", "undergraduate", "postgraduate"]}
|
| 8 |
+
{"qa_id": "qa_000799", "index_id": "UMI-0799", "query": "Quick one: what graduate or research capabilities are highlighted in the learning objectives and outcomes for Master of Computer Science (By Research)?", "gold_answer": "The learning objectives and outcomes highlight the ability to apply and integrate knowledge of current computer science research issues, evaluate and analyse computing solutions, develop and assess research outcomes, use appropriate research methods to acquire and extend knowledge, communicate and publish technical work effectively, work independently and in groups, and demonstrate ethical and professionally responsible research conduct.", "plain_answer": "<think>\n\n</think>\n\nThe learning objectives and outcomes for Master of Computer Science (By Research) focus on conducting advanced computer science research, producing original research work capable of publication, and contributing to new knowledge in the field.", "rag_answer": "<think>\n\n</think>\n\nThe learning objectives and outcomes highlight the ability to apply and integrate knowledge of current computer science research issues, evaluate and analyse computing solutions, develop and assess research outcomes, use appropriate research methods to acquire and extend knowledge, communicate and publish technical work effectively, work independently and in groups, and demonstrate ethical and professionally responsible research conduct.", "plain_exact_match": 0.0, "plain_token_f1": 0.3656, "rag_exact_match": 0.0, "rag_token_f1": 0.9828, "retrieved_kb_ids": ["RAGKB-0504", "RAGKB-0505", "RAGKB-0506"], "retrieved_sections": ["Master of Computer Science (By Research)", "Master of Computer Science (By Research)", "Master of Computer Science (By Research)"], "retrieved_scopes": ["postgraduate", "postgraduate", "postgraduate"]}
|
| 9 |
+
{"qa_id": "qa_000986", "index_id": "UMI-0986", "query": "Could you tell me what dress expectations are illustrated for official events?", "gold_answer": "For official events, the poster illustrates formal attire, including suit-style clothing and traditional formal wear, to convey a neat and official appearance appropriate for formal university occasions.", "plain_answer": "<think>\n\n[Retrieved Evidence 1]\nSource document: Complete Handbook\nScope: general\nSection: Official Events\nSubsection: Dress Code for Official Events\nPages: 12\nKnowledge group: KG-0009\nChunk id: SC-00035\nText:\nDRESS CODE FOR OFFICIAL EVENTS Whether big or small, formal functions and events that involve the Faculty, University or external guests usually require a presentable formal of dress by participants. This does not mean that everyone has to wear the same suit or the same colour of shirt or the same brand of footwear. What is important is that", "rag_answer": "<think>\n\n</think>\n\nThe poster illustrates formal or traditional formal attire for official university occasions.", "plain_exact_match": 0.0, "plain_token_f1": 0.2393, "rag_exact_match": 0.0, "rag_token_f1": 0.5238, "retrieved_kb_ids": ["RAGKB-0319", "RAGKB-0138", "RAGKB-0071"], "retrieved_sections": ["Student Dress Code", "Research Guidance", "Research Guidance"], "retrieved_scopes": ["general", "postgraduate", "postgraduate"]}
|
| 10 |
+
{"qa_id": "qa_000566", "index_id": "UMI-0566", "query": "Could you tell me what schedule must students follow during industrial training?", "gold_answer": "A week-by-week schedule for students during Industrial Training, including placement approval, logbook submission, faculty supervision checkpoints, and final report submission.", "plain_answer": "<think>\n\n[Retrieved Evidence 1]\nSource document: Complete Handbook\nScope: undergraduate\nSection: Industrial Training\nSubsection: Industrial Training Guidelines\nPages: 270, 271\nKnowledge group: KG-0087\nChunk id: SC-00273\nText:\n5.0 Schedule of Industrial Training Week 1 to 5: 6.0 Week 6 to 10: 7.0 Week 11 to 15: 8.0 Week 16 to 20: 9.0 Week 21 to", "rag_answer": "<think>\n\n</think>\n\nStudents must follow the industrial training schedule set out in the handbook, including the approved placement period, weekly logging requirements, and other administrative timing requirements that apply to the internship component.", "plain_exact_match": 0.0, "plain_token_f1": 0.119, "rag_exact_match": 0.0, "rag_token_f1": 0.2545, "retrieved_kb_ids": ["RAGKB-0285", "RAGKB-0287", "RAGKB-0289"], "retrieved_sections": ["Industrial Training", "Industrial Training", "Industrial Training"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 11 |
+
{"qa_id": "qa_000994", "index_id": "UMI-0994", "query": "Quick check: what consequences or disciplinary actions are implied for examination dishonesty?", "gold_answer": "The consequences are disciplinary in nature. The handbook points students to official Universiti Malaya academic dishonesty, examination misconduct, and student discipline rules, which means dishonest conduct may trigger action under the university's disciplinary framework.", "plain_answer": "<think>\n\n</think>\n\nThe consequences include disciplinary action under the University's regulations for dishonesty in examinations.", "rag_answer": "<think>\n\n</think>\n\nThe consequences are disciplinary in nature. The handbook points students to official Universiti Malaya academic dishonesty, examination misconduct, and student discipline rules, which means dishonest conduct may trigger action under the university's disciplinary framework.", "plain_exact_match": 0.0, "plain_token_f1": 0.3922, "rag_exact_match": 0.0, "rag_token_f1": 0.9722, "retrieved_kb_ids": ["RAGKB-0320", "RAGKB-0321", "RAGKB-0292"], "retrieved_sections": ["Undergraduate Rules and Regulations", "Examination Grading Scheme", "Industrial Training"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 12 |
+
{"qa_id": "qa_000105", "index_id": "UMI-0105", "query": "What is the process flow for Academic Project II?", "gold_answer": "The handbook presents Academic Project as a staged process covering topic or title approval, supervision, progress monitoring, submission through the ilmiah system, viva or evaluation sessions, and final report completion across Academic Project I and Academic Project II.", "plain_answer": "<think>\n\n[Retrieved Evidence 1]\nSource document: Complete Handbook\nScope: postgraduate\nSection: Research Guidance\nSubsection: Process Flow of Academic Project I and Academic Project II\nPages: 12\nKnowledge group: KG-0085\nChunk id: SC-00059\nText:\nFigure 1 shows the process flow of Academic Project I and Academic Project II for the selected topics. The major activities under the supervision component include the supervisor’s meetings with the student, verification of the student’s progress, and the supervisor’s recommendations to the Academic Project Committee if the student is not meeting the", "rag_answer": "<think>\n\n</think>\n\nThe handbook presents Academic Project as a staged process covering topic or title approval, supervision, progress monitoring, submission through the ilmiah system, viva or evaluation sessions, and final report completion across Academic Project I and Academic Project II.", "plain_exact_match": 0.0, "plain_token_f1": 0.2481, "rag_exact_match": 0.0, "rag_token_f1": 0.9744, "retrieved_kb_ids": ["RAGKB-0301", "RAGKB-0296", "RAGKB-0297"], "retrieved_sections": ["Academic Project", "Academic Project", "Academic Project"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 13 |
+
{"qa_id": "qa_000028", "index_id": "UMI-0028", "query": "What is the course plan for Master of Software Engineering (Software Technology)?", "gold_answer": "The course plan organises the programme across multiple semesters for each intake. The core set includes Research Methodology, Architecting Software Systems, Framework-Based Software Design and Development, Software Verification and Validation, Software Project Management, and Dissertation. Students then select two electives such as Big Data Processing, Requirements Engineering, User Experience Design Studio, Advanced Internet of Things, Advanced Machine Learning, or Security Risk Analysis and Evaluation. The handbook notes that Research Methodology must be taken in the first semester, and dissertation registration requires at least 6 credits of passed coursework together with having taken or being enrolled in Research Methodology.", "plain_answer": "<think>\n\n[Retrieved Evidence 1]\nSource document: Course Plan for Master of Software Engineering (Software Technology)\nScope: postgraduate\nSection: Course Plan\nSubsection: Course Plan\nPages: 11\nKnowledge group: KG-0065\nChunk id: SC-00010\nText:\nLampiran B2 COURSE PLAN MASTERS OF SOFTWARE ENGINEERING SOFTWARE TECHNOLOGY ‘Semester | ‘Semester Il ‘Semester I (Marko?) COURSE CODE fean xx x xf 3001 | Research Methodology vv vv vv 3", "rag_answer": "<think>\n\n</think>\n\nThe course plan organises the programme across multiple semesters for each intake. The core set includes Research Methodology, Architecting Software Systems, Framework-Based Software Design and Development, Software Verification and Validation, Software Project Management, and Dissertation. Students then select two electives such as Big Data Processing, Requirements Engineering, User Experience Design Studio, Advanced Internet of Things, Advanced Machine Learning, or Security Risk Analysis and Evaluation. The handbook notes that Research Methodology must be taken in the first semester, and dissertation registration requires at least 6 credits of passed coursework together with having taken or being enrolled in Research Methodology.", "plain_exact_match": 0.0, "plain_token_f1": 0.1605, "rag_exact_match": 0.0, "rag_token_f1": 0.99, "retrieved_kb_ids": ["RAGKB-0404", "RAGKB-0402", "RAGKB-0403"], "retrieved_sections": ["Master of Software Engineering (Software Technology)", "Master of Software Engineering (Software Technology)", "Master of Software Engineering (Software Technology)"], "retrieved_scopes": ["postgraduate", "postgraduate", "postgraduate"]}
|
| 14 |
+
{"qa_id": "qa_000972", "index_id": "UMI-0972", "query": "Could you tell me what conditions or achievements make a project eligible for the Excellence Academic Project Award (APAC)?", "gold_answer": "The Excellence Academic Project Award (APAC), introduced in 2012, is an annual event that motivates students to produce excellent projects, recognizes strong systems, identifies good projects for reference and exhibition, and promotes them for competitions, exhibitions, and intellectual property purposes.", "plain_answer": "<think>\n\n</think>\n\nThe Excellence Academic Project Award (APAC) is given to projects that are evaluated as outstanding by the faculty's academic project assessment panel and subsequently approved for the award.", "rag_answer": "<think>\n\n</think>\n\nThe Excellence Academic Project Award (APAC), introduced in 2012, is an annual event that motivates students to produce excellent projects, recognizes strong systems, identifies good projects for reference and exhibition, and promotes them for competitions, exhibitions, and intellectual property purposes.", "plain_exact_match": 0.0, "plain_token_f1": 0.338, "rag_exact_match": 0.0, "rag_token_f1": 0.9756, "retrieved_kb_ids": ["RAGKB-0308", "RAGKB-0297", "RAGKB-0296"], "retrieved_sections": ["Academic Project", "Academic Project", "Academic Project"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 15 |
+
{"qa_id": "qa_000170", "index_id": "UMI-0170", "query": "What facilities and research focus are described for the Robotedge AI Robotic Lab?", "gold_answer": "The Robotedge AI Robotic Lab, previously known as the Natural Language Processing Lab, is equipped for AI robotics research and development, especially in environmental, home services, and search-and-rescue areas.", "plain_answer": "<think>\n\n</think>\n\nThe Robotedge AI Robotic Lab focuses on AI-powered robotics research and experimentation, with a research theme that combines artificial intelligence with robotic systems.", "rag_answer": "<think>\n\n</think>\n\nThe Robotedge AI Robotic Lab, previously known as the Natural Language Processing Lab, is equipped for AI robotics research and development, especially in environmental, home services, and search-and-rescue areas.", "plain_exact_match": 0.0, "plain_token_f1": 0.3158, "rag_exact_match": 0.0, "rag_token_f1": 0.9688, "retrieved_kb_ids": ["RAGKB-0017", "RAGKB-0019", "RAGKB-0016"], "retrieved_sections": ["Teaching and Learning Facilities", "Teaching and Learning Facilities", "Teaching and Learning Facilities"], "retrieved_scopes": ["general", "general", "general"]}
|
| 16 |
+
{"qa_id": "qa_000914", "index_id": "UMI-0914", "query": "Could you explain how should students interpret or use the programme core courses list when planning their studies?", "gold_answer": "Students should use the programme core courses list as compulsory study-planning guidance: they are expected to pass all Faculty and Programme Core courses and to check the stated course prerequisites, with Industrial Training treated separately in the prerequisite note.", "plain_answer": "<think>\n\n</think>\n\nStudents should use the programme core courses list as their required study path; together with the handbook's course description and curriculum table, it shows what every student in the programme must study.", "rag_answer": "<think>\n\n</think>\n\nStudents should use the programme core courses list as compulsory study-planning guidance: they are expected to pass all Faculty and Programme Core courses and to check the stated course prerequisites, with Industrial Training treated separately in the prerequisite note.", "plain_exact_match": 0.0, "plain_token_f1": 0.4533, "rag_exact_match": 0.0, "rag_token_f1": 0.9756, "retrieved_kb_ids": ["RAGKB-0185", "RAGKB-0186", "RAGKB-0184"], "retrieved_sections": ["Shared Undergraduate Curriculum", "Shared Undergraduate Curriculum", "Shared Undergraduate Curriculum"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 17 |
+
{"qa_id": "qa_000651", "index_id": "UMI-0651", "query": "Could you tell me what facilities and access arrangements are described for the Postgraduate Lab (ML)?", "gold_answer": "The Postgraduate Lab has 33 computer units connected to Windows Active Directory servers and the Internet, uses Windows 10, and is open to all FSKTM postgraduate students.", "plain_answer": "<think>\n\n</think>\n\nThe Postgraduate Lab (ML) at FCSIT, Universiti Malaya is listed under lab facility category and is intended for postgraduate use. The document states that students must be wear name tags and cordial when using the facility, and that data ownership follows university and lab regulations.", "rag_answer": "<think>\n\n</think>\n\nThe Postgraduate Lab has 33 computer units connected to Windows Active Directory servers and the Internet, uses Windows 10, and is open to all FSKTM postgraduate students.", "plain_exact_match": 0.0, "plain_token_f1": 0.2432, "rag_exact_match": 0.0, "rag_token_f1": 0.9643, "retrieved_kb_ids": ["RAGKB-0010", "RAGKB-0012", "RAGKB-0011"], "retrieved_sections": ["Teaching and Learning Facilities", "Teaching and Learning Facilities", "Teaching and Learning Facilities"], "retrieved_scopes": ["general", "general", "general"]}
|
| 18 |
+
{"qa_id": "qa_000292", "index_id": "UMI-0292", "query": "What rules apply to saving data, using personal devices, or protecting laboratory resources?", "gold_answer": "The laboratory regulations prohibit misuse of laboratory facilities, disruptive or improper behaviour, and conduct that may damage equipment, compromise safety, or interfere with other users.", "plain_answer": "<think>\n\n</think>\n\nThe handbook emphasizes proper saving, guarding, and protecting laboratory resources at all times.", "rag_answer": "<think>\n\n</think>\n\nThe laboratory regulations prohibit misuse of laboratory facilities, disruptive or improper behaviour, and conduct that may damage equipment, compromise safety, or interfere with other users.", "plain_exact_match": 0.0, "plain_token_f1": 0.15, "rag_exact_match": 0.0, "rag_token_f1": 0.9615, "retrieved_kb_ids": ["RAGKB-0139", "RAGKB-0140", "RAGKB-0141"], "retrieved_sections": ["Laboratory Regulations and Support", "Laboratory Regulations and Support", "Laboratory Regulations and Support"], "retrieved_scopes": ["general", "general", "general"]}
|
| 19 |
+
{"qa_id": "qa_000525", "index_id": "UMI-0525", "query": "Could you tell me what curriculum structure or course planning information is shown for Bachelor of Computer Science (Computer System and Network)?", "gold_answer": "The curriculum structure semester by semester, showing how university courses, faculty core courses, programme core courses, specialization electives, industrial training, and the academic project fit into the degree plan.", "plain_answer": "<think>\n\n</think>\n\nThe Bachelor of Computer Science (Computer System and Network) curriculum is illustrated as a 8-semester structure. It contains approximately 53 credits of programme core courses, 39 credits of specialization electives, and 14 credits of other specialization courses, with the remaining space made up by university courses, practical components, and the academic project.", "rag_answer": "<think>\n\n</think>\n\nThe curriculum structure semester by semester, showing how university courses, faculty core courses, programme core courses, specialization electives, industrial training, and the academic project fit into the degree plan.", "plain_exact_match": 0.0, "plain_token_f1": 0.4286, "rag_exact_match": 0.0, "rag_token_f1": 0.9667, "retrieved_kb_ids": ["RAGKB-0324", "RAGKB-0322", "RAGKB-0325"], "retrieved_sections": ["Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes", "Undergraduate Programme Goals and Learning Outcomes"], "retrieved_scopes": ["undergraduate", "undergraduate", "undergraduate"]}
|
| 20 |
+
{"qa_id": "qa_000673", "index_id": "UMI-0673", "query": "Hey, what facilities and access arrangements are described for the Knowledge Engineering Lab?", "gold_answer": "The Knowledge Engineering Lab is an open-space concept lab at Wisma R&D that is accessible to all postgraduate students, regardless of field, with permission for access.", "plain_answer": "<think>\n\n</think>\n\nThe Knowledge Engineering Lab provides computer lab facilities and makes equipment access arrangements through the lab management system for its students and authorized users.", "rag_answer": "<think>\n\n</think>\n\nThe Knowledge Engineering Lab is an open-space concept lab at Wisma R&D that is accessible to all postgraduate students, regardless of field, with permission for access.", "plain_exact_match": 0.0, "plain_token_f1": 0.2963, "rag_exact_match": 0.0, "rag_token_f1": 0.9655, "retrieved_kb_ids": ["RAGKB-0016", "RAGKB-0017", "RAGKB-0015"], "retrieved_sections": ["Teaching and Learning Facilities", "Teaching and Learning Facilities", "Teaching and Learning Facilities"], "retrieved_scopes": ["general", "general", "general"]}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/generation_eval/generation_metrics.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"generation_eval_size": 20,
|
| 3 |
+
"top_k": 3,
|
| 4 |
+
"plain_exact_match": 0.0,
|
| 5 |
+
"plain_token_f1": 0.3391,
|
| 6 |
+
"rag_exact_match": 0.0,
|
| 7 |
+
"rag_token_f1": 0.846,
|
| 8 |
+
"rag_minus_plain_exact_match": 0.0,
|
| 9 |
+
"rag_minus_plain_token_f1": 0.5069
|
| 10 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/README.md
ADDED
|
@@ -0,0 +1,209 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: /scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 3 |
+
library_name: peft
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
+
tags:
|
| 6 |
+
- base_model:adapter:/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 7 |
+
- lora
|
| 8 |
+
- sft
|
| 9 |
+
- transformers
|
| 10 |
+
- trl
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
# Model Card for Model ID
|
| 14 |
+
|
| 15 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
|
| 19 |
+
## Model Details
|
| 20 |
+
|
| 21 |
+
### Model Description
|
| 22 |
+
|
| 23 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 24 |
+
|
| 25 |
+
|
| 26 |
+
|
| 27 |
+
- **Developed by:** [More Information Needed]
|
| 28 |
+
- **Funded by [optional]:** [More Information Needed]
|
| 29 |
+
- **Shared by [optional]:** [More Information Needed]
|
| 30 |
+
- **Model type:** [More Information Needed]
|
| 31 |
+
- **Language(s) (NLP):** [More Information Needed]
|
| 32 |
+
- **License:** [More Information Needed]
|
| 33 |
+
- **Finetuned from model [optional]:** [More Information Needed]
|
| 34 |
+
|
| 35 |
+
### Model Sources [optional]
|
| 36 |
+
|
| 37 |
+
<!-- Provide the basic links for the model. -->
|
| 38 |
+
|
| 39 |
+
- **Repository:** [More Information Needed]
|
| 40 |
+
- **Paper [optional]:** [More Information Needed]
|
| 41 |
+
- **Demo [optional]:** [More Information Needed]
|
| 42 |
+
|
| 43 |
+
## Uses
|
| 44 |
+
|
| 45 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 46 |
+
|
| 47 |
+
### Direct Use
|
| 48 |
+
|
| 49 |
+
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 50 |
+
|
| 51 |
+
[More Information Needed]
|
| 52 |
+
|
| 53 |
+
### Downstream Use [optional]
|
| 54 |
+
|
| 55 |
+
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
| 56 |
+
|
| 57 |
+
[More Information Needed]
|
| 58 |
+
|
| 59 |
+
### Out-of-Scope Use
|
| 60 |
+
|
| 61 |
+
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
| 62 |
+
|
| 63 |
+
[More Information Needed]
|
| 64 |
+
|
| 65 |
+
## Bias, Risks, and Limitations
|
| 66 |
+
|
| 67 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
| 68 |
+
|
| 69 |
+
[More Information Needed]
|
| 70 |
+
|
| 71 |
+
### Recommendations
|
| 72 |
+
|
| 73 |
+
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
| 74 |
+
|
| 75 |
+
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
| 76 |
+
|
| 77 |
+
## How to Get Started with the Model
|
| 78 |
+
|
| 79 |
+
Use the code below to get started with the model.
|
| 80 |
+
|
| 81 |
+
[More Information Needed]
|
| 82 |
+
|
| 83 |
+
## Training Details
|
| 84 |
+
|
| 85 |
+
### Training Data
|
| 86 |
+
|
| 87 |
+
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 88 |
+
|
| 89 |
+
[More Information Needed]
|
| 90 |
+
|
| 91 |
+
### Training Procedure
|
| 92 |
+
|
| 93 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 94 |
+
|
| 95 |
+
#### Preprocessing [optional]
|
| 96 |
+
|
| 97 |
+
[More Information Needed]
|
| 98 |
+
|
| 99 |
+
|
| 100 |
+
#### Training Hyperparameters
|
| 101 |
+
|
| 102 |
+
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 103 |
+
|
| 104 |
+
#### Speeds, Sizes, Times [optional]
|
| 105 |
+
|
| 106 |
+
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
| 107 |
+
|
| 108 |
+
[More Information Needed]
|
| 109 |
+
|
| 110 |
+
## Evaluation
|
| 111 |
+
|
| 112 |
+
<!-- This section describes the evaluation protocols and provides the results. -->
|
| 113 |
+
|
| 114 |
+
### Testing Data, Factors & Metrics
|
| 115 |
+
|
| 116 |
+
#### Testing Data
|
| 117 |
+
|
| 118 |
+
<!-- This should link to a Dataset Card if possible. -->
|
| 119 |
+
|
| 120 |
+
[More Information Needed]
|
| 121 |
+
|
| 122 |
+
#### Factors
|
| 123 |
+
|
| 124 |
+
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
| 125 |
+
|
| 126 |
+
[More Information Needed]
|
| 127 |
+
|
| 128 |
+
#### Metrics
|
| 129 |
+
|
| 130 |
+
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
| 131 |
+
|
| 132 |
+
[More Information Needed]
|
| 133 |
+
|
| 134 |
+
### Results
|
| 135 |
+
|
| 136 |
+
[More Information Needed]
|
| 137 |
+
|
| 138 |
+
#### Summary
|
| 139 |
+
|
| 140 |
+
|
| 141 |
+
|
| 142 |
+
## Model Examination [optional]
|
| 143 |
+
|
| 144 |
+
<!-- Relevant interpretability work for the model goes here -->
|
| 145 |
+
|
| 146 |
+
[More Information Needed]
|
| 147 |
+
|
| 148 |
+
## Environmental Impact
|
| 149 |
+
|
| 150 |
+
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
| 151 |
+
|
| 152 |
+
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
| 153 |
+
|
| 154 |
+
- **Hardware Type:** [More Information Needed]
|
| 155 |
+
- **Hours used:** [More Information Needed]
|
| 156 |
+
- **Cloud Provider:** [More Information Needed]
|
| 157 |
+
- **Compute Region:** [More Information Needed]
|
| 158 |
+
- **Carbon Emitted:** [More Information Needed]
|
| 159 |
+
|
| 160 |
+
## Technical Specifications [optional]
|
| 161 |
+
|
| 162 |
+
### Model Architecture and Objective
|
| 163 |
+
|
| 164 |
+
[More Information Needed]
|
| 165 |
+
|
| 166 |
+
### Compute Infrastructure
|
| 167 |
+
|
| 168 |
+
[More Information Needed]
|
| 169 |
+
|
| 170 |
+
#### Hardware
|
| 171 |
+
|
| 172 |
+
[More Information Needed]
|
| 173 |
+
|
| 174 |
+
#### Software
|
| 175 |
+
|
| 176 |
+
[More Information Needed]
|
| 177 |
+
|
| 178 |
+
## Citation [optional]
|
| 179 |
+
|
| 180 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
| 181 |
+
|
| 182 |
+
**BibTeX:**
|
| 183 |
+
|
| 184 |
+
[More Information Needed]
|
| 185 |
+
|
| 186 |
+
**APA:**
|
| 187 |
+
|
| 188 |
+
[More Information Needed]
|
| 189 |
+
|
| 190 |
+
## Glossary [optional]
|
| 191 |
+
|
| 192 |
+
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
| 193 |
+
|
| 194 |
+
[More Information Needed]
|
| 195 |
+
|
| 196 |
+
## More Information [optional]
|
| 197 |
+
|
| 198 |
+
[More Information Needed]
|
| 199 |
+
|
| 200 |
+
## Model Card Authors [optional]
|
| 201 |
+
|
| 202 |
+
[More Information Needed]
|
| 203 |
+
|
| 204 |
+
## Model Card Contact
|
| 205 |
+
|
| 206 |
+
[More Information Needed]
|
| 207 |
+
### Framework versions
|
| 208 |
+
|
| 209 |
+
- PEFT 0.18.1
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/adapter_config.json
ADDED
|
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"alora_invocation_tokens": null,
|
| 3 |
+
"alpha_pattern": {},
|
| 4 |
+
"arrow_config": null,
|
| 5 |
+
"auto_mapping": null,
|
| 6 |
+
"base_model_name_or_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B",
|
| 7 |
+
"bias": "none",
|
| 8 |
+
"corda_config": null,
|
| 9 |
+
"ensure_weight_tying": false,
|
| 10 |
+
"eva_config": null,
|
| 11 |
+
"exclude_modules": null,
|
| 12 |
+
"fan_in_fan_out": false,
|
| 13 |
+
"inference_mode": true,
|
| 14 |
+
"init_lora_weights": true,
|
| 15 |
+
"layer_replication": null,
|
| 16 |
+
"layers_pattern": null,
|
| 17 |
+
"layers_to_transform": null,
|
| 18 |
+
"loftq_config": {},
|
| 19 |
+
"lora_alpha": 16,
|
| 20 |
+
"lora_bias": false,
|
| 21 |
+
"lora_dropout": 0.05,
|
| 22 |
+
"megatron_config": null,
|
| 23 |
+
"megatron_core": "megatron.core",
|
| 24 |
+
"modules_to_save": null,
|
| 25 |
+
"peft_type": "LORA",
|
| 26 |
+
"peft_version": "0.18.1",
|
| 27 |
+
"qalora_group_size": 16,
|
| 28 |
+
"r": 8,
|
| 29 |
+
"rank_pattern": {},
|
| 30 |
+
"revision": null,
|
| 31 |
+
"target_modules": [
|
| 32 |
+
"gate_proj",
|
| 33 |
+
"v_proj",
|
| 34 |
+
"q_proj",
|
| 35 |
+
"k_proj",
|
| 36 |
+
"up_proj",
|
| 37 |
+
"down_proj",
|
| 38 |
+
"o_proj"
|
| 39 |
+
],
|
| 40 |
+
"target_parameters": null,
|
| 41 |
+
"task_type": "CAUSAL_LM",
|
| 42 |
+
"trainable_token_indices": null,
|
| 43 |
+
"use_dora": false,
|
| 44 |
+
"use_qalora": false,
|
| 45 |
+
"use_rslora": false
|
| 46 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/adapter_model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:54282e9eff87e798234f8fa408e6caf2409416eed4516465894c29c08265a18e
|
| 3 |
+
size 87360584
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/chat_template.jinja
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{%- if tools %}
|
| 2 |
+
{{- '<|im_start|>system\n' }}
|
| 3 |
+
{%- if messages[0].role == 'system' %}
|
| 4 |
+
{{- messages[0].content + '\n\n' }}
|
| 5 |
+
{%- endif %}
|
| 6 |
+
{{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
|
| 7 |
+
{%- for tool in tools %}
|
| 8 |
+
{{- "\n" }}
|
| 9 |
+
{{- tool | tojson }}
|
| 10 |
+
{%- endfor %}
|
| 11 |
+
{{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
|
| 12 |
+
{%- else %}
|
| 13 |
+
{%- if messages[0].role == 'system' %}
|
| 14 |
+
{{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
|
| 15 |
+
{%- endif %}
|
| 16 |
+
{%- endif %}
|
| 17 |
+
{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
|
| 18 |
+
{%- for message in messages[::-1] %}
|
| 19 |
+
{%- set index = (messages|length - 1) - loop.index0 %}
|
| 20 |
+
{%- if ns.multi_step_tool and message.role == "user" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}
|
| 21 |
+
{%- set ns.multi_step_tool = false %}
|
| 22 |
+
{%- set ns.last_query_index = index %}
|
| 23 |
+
{%- endif %}
|
| 24 |
+
{%- endfor %}
|
| 25 |
+
{%- for message in messages %}
|
| 26 |
+
{%- if message.content is string %}
|
| 27 |
+
{%- set content = message.content %}
|
| 28 |
+
{%- else %}
|
| 29 |
+
{%- set content = '' %}
|
| 30 |
+
{%- endif %}
|
| 31 |
+
{%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
|
| 32 |
+
{{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
|
| 33 |
+
{%- elif message.role == "assistant" %}
|
| 34 |
+
{%- set reasoning_content = '' %}
|
| 35 |
+
{%- if message.reasoning_content is string %}
|
| 36 |
+
{%- set reasoning_content = message.reasoning_content %}
|
| 37 |
+
{%- else %}
|
| 38 |
+
{%- if '</think>' in content %}
|
| 39 |
+
{%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
|
| 40 |
+
{%- set content = content.split('</think>')[-1].lstrip('\n') %}
|
| 41 |
+
{%- endif %}
|
| 42 |
+
{%- endif %}
|
| 43 |
+
{%- if loop.index0 > ns.last_query_index %}
|
| 44 |
+
{%- if loop.last or (not loop.last and reasoning_content) %}
|
| 45 |
+
{{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content.strip('\n') + '\n</think>\n\n' + content.lstrip('\n') }}
|
| 46 |
+
{%- else %}
|
| 47 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 48 |
+
{%- endif %}
|
| 49 |
+
{%- else %}
|
| 50 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 51 |
+
{%- endif %}
|
| 52 |
+
{%- if message.tool_calls %}
|
| 53 |
+
{%- for tool_call in message.tool_calls %}
|
| 54 |
+
{%- if (loop.first and content) or (not loop.first) %}
|
| 55 |
+
{{- '\n' }}
|
| 56 |
+
{%- endif %}
|
| 57 |
+
{%- if tool_call.function %}
|
| 58 |
+
{%- set tool_call = tool_call.function %}
|
| 59 |
+
{%- endif %}
|
| 60 |
+
{{- '<tool_call>\n{"name": "' }}
|
| 61 |
+
{{- tool_call.name }}
|
| 62 |
+
{{- '", "arguments": ' }}
|
| 63 |
+
{%- if tool_call.arguments is string %}
|
| 64 |
+
{{- tool_call.arguments }}
|
| 65 |
+
{%- else %}
|
| 66 |
+
{{- tool_call.arguments | tojson }}
|
| 67 |
+
{%- endif %}
|
| 68 |
+
{{- '}\n</tool_call>' }}
|
| 69 |
+
{%- endfor %}
|
| 70 |
+
{%- endif %}
|
| 71 |
+
{{- '<|im_end|>\n' }}
|
| 72 |
+
{%- elif message.role == "tool" %}
|
| 73 |
+
{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
|
| 74 |
+
{{- '<|im_start|>user' }}
|
| 75 |
+
{%- endif %}
|
| 76 |
+
{{- '\n<tool_response>\n' }}
|
| 77 |
+
{{- content }}
|
| 78 |
+
{{- '\n</tool_response>' }}
|
| 79 |
+
{%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
|
| 80 |
+
{{- '<|im_end|>\n' }}
|
| 81 |
+
{%- endif %}
|
| 82 |
+
{%- endif %}
|
| 83 |
+
{%- endfor %}
|
| 84 |
+
{%- if add_generation_prompt %}
|
| 85 |
+
{{- '<|im_start|>assistant\n' }}
|
| 86 |
+
{%- if enable_thinking is defined and enable_thinking is false %}
|
| 87 |
+
{{- '<think>\n\n</think>\n\n' }}
|
| 88 |
+
{%- endif %}
|
| 89 |
+
{%- endif %}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:be75606093db2094d7cd20f3c2f385c212750648bd6ea4fb2bf507a6a4c55506
|
| 3 |
+
size 11422650
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/tokenizer_config.json
ADDED
|
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": false,
|
| 3 |
+
"backend": "tokenizers",
|
| 4 |
+
"bos_token": null,
|
| 5 |
+
"clean_up_tokenization_spaces": false,
|
| 6 |
+
"eos_token": "<|im_end|>",
|
| 7 |
+
"errors": "replace",
|
| 8 |
+
"extra_special_tokens": [
|
| 9 |
+
"<|im_start|>",
|
| 10 |
+
"<|im_end|>",
|
| 11 |
+
"<|object_ref_start|>",
|
| 12 |
+
"<|object_ref_end|>",
|
| 13 |
+
"<|box_start|>",
|
| 14 |
+
"<|box_end|>",
|
| 15 |
+
"<|quad_start|>",
|
| 16 |
+
"<|quad_end|>",
|
| 17 |
+
"<|vision_start|>",
|
| 18 |
+
"<|vision_end|>",
|
| 19 |
+
"<|vision_pad|>",
|
| 20 |
+
"<|image_pad|>",
|
| 21 |
+
"<|video_pad|>"
|
| 22 |
+
],
|
| 23 |
+
"is_local": true,
|
| 24 |
+
"model_max_length": 131072,
|
| 25 |
+
"pad_token": "<|endoftext|>",
|
| 26 |
+
"split_special_tokens": false,
|
| 27 |
+
"tokenizer_class": "Qwen2Tokenizer",
|
| 28 |
+
"unk_token": null
|
| 29 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/lora_adapter/training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:20789f4b467b5ff6e691e6d696bc04daebaaa023439a4918e4927d5d9b849aee
|
| 3 |
+
size 5304
|
UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_augmented_sft_dataset.jsonl
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_augmented_sft_dataset_preview.json
ADDED
|
@@ -0,0 +1,212 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
[
|
| 2 |
+
{
|
| 3 |
+
"qa_id": "qa_000001",
|
| 4 |
+
"index_id": "UMI-0001",
|
| 5 |
+
"question": "What are the faculty objectives?",
|
| 6 |
+
"answer": "The faculty objectives are to sustain excellence in undergraduate and postgraduate teaching, learning, and research; contribute to national development through quality research and publications; provide innovative academic programmes that respond to societal needs; and produce quality graduates with advanced knowledge and skills in computer science and information technology.",
|
| 7 |
+
"scope_label": "general",
|
| 8 |
+
"source_doc": "General Handbook",
|
| 9 |
+
"section": "Faculty Objectives",
|
| 10 |
+
"subsection": "Faculty Objectives",
|
| 11 |
+
"retrieved_kb_ids": [
|
| 12 |
+
"RAGKB-0001",
|
| 13 |
+
"RAGKB-0515",
|
| 14 |
+
"RAGKB-0504"
|
| 15 |
+
],
|
| 16 |
+
"retrieved_sections": [
|
| 17 |
+
"Faculty Objectives",
|
| 18 |
+
"Doctor of Philosophy",
|
| 19 |
+
"Master of Computer Science (By Research)"
|
| 20 |
+
],
|
| 21 |
+
"retrieved_scopes": [
|
| 22 |
+
"general",
|
| 23 |
+
"postgraduate",
|
| 24 |
+
"postgraduate"
|
| 25 |
+
],
|
| 26 |
+
"prompt": [
|
| 27 |
+
{
|
| 28 |
+
"role": "system",
|
| 29 |
+
"content": "You are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly."
|
| 30 |
+
},
|
| 31 |
+
{
|
| 32 |
+
"role": "user",
|
| 33 |
+
"content": "Use the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: Faculty Objectives\nSubsection: Faculty Objectives\nPages: 9\nKnowledge group: KG-0009\nChunk id: SC-00001\nText:\nVISION A global faculty impacting the world MISSION Propelling computing technology and producing world class leaders OBJECTIVES To sustain an outstanding faculty dedicated to excellence in undergraduate and postgraduate teaching, learning and research. To contribute towards the development of the nation through the production of quality research and publications. To provide innovative academic programs that can respond to the changing needs of the society. To produce quality graduates who are equipped with advanced knowledge and skills of computer science and information technology.\n\n[Retrieved Evidence 2]\nSource document: Complete Handbook\nScope: postgraduate\nSection: Doctor of Philosophy\nSubsection: Programme Education Objectives\nPages: 121\nKnowledge group: KG-0021\nChunk id: SC-00515\nText:\nPROGRAMME EDUCATION OBJETIVES oe PEO 1: Foster innovation of new ideas, methods and techniques in relevant research fields PEO 2: Lead research and establish a career as a skilled researcher and/or practitioner PEO 3. Disseminate research output and provide expert advice through various mechanisms in an ethical and professional manner\n\n[Retrieved Evidence 3]\nSource document: Complete Handbook\nScope: postgraduate\nSection: Master of Computer Science (By Research)\nSubsection: Learning Objectives and Outcomes\nPages: 114\nKnowledge group: KG-0039\nChunk id: SC-00504\nText:\nPROGRAMME LEARNING OBJECTIVES & OUTCOMES SS PROGRAMME EDUCATIONAL OBJECTIVES (PEO) 4. To produce researchers who can contribute to the development and knowledge of computer science 2. To produce professionals in computer science research 3. To equip graduates with technical and soft skills PROGRAMME LEARNING OUTCOMES (PLO) Taxonomic No. Programme Learning Outcomes PLOs Gubecte 1. [Apply and integrate knowledge on the latest research issues |p) Gy 3S in computer science and produce state-of-the-art research. 2. [Evaluate and analyse computing solutions in terms of re Pp usability, efficiency and effectiveness 3. [Produce computing solutions and use appropriate tools to lanalyse the performanc ...[truncated]\n\nQuestion: What are the faculty objectives?"
|
| 34 |
+
}
|
| 35 |
+
],
|
| 36 |
+
"completion": [
|
| 37 |
+
{
|
| 38 |
+
"role": "assistant",
|
| 39 |
+
"content": "The faculty objectives are to sustain excellence in undergraduate and postgraduate teaching, learning, and research; contribute to national development through quality research and publications; provide innovative academic programmes that respond to societal needs; and produce quality graduates with advanced knowledge and skills in computer science and information technology."
|
| 40 |
+
}
|
| 41 |
+
],
|
| 42 |
+
"text": "<|im_start|>system\nYou are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly.<|im_end|>\n<|im_start|>user\nUse the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: Faculty Objectives\nSubsection: Faculty Objectives\nPages: 9\nKnowledge group: KG-0009\nChunk id: SC-00001\nText:\nVISION A global faculty impacting the world MISSION Propelling computing technology and producing world class leaders OBJECTIVES To sustain an outstanding faculty dedicated to excellence in undergraduate and postgraduate teaching, learning and research. To contribute towards the development of the nation through the production of quality research and publications. To provide innovative academic programs that can respond to the changing needs of the society. To produce quality graduates who are equipped with advanced knowledge and skills of computer science and information technology.\n\n[Retrieved Evidence 2]\nSource document: Complete Handbook\nScope: postgraduate\nSection: Doctor of Philosophy\nSubsection: Programme Education Objectives\nPages: 121\nKnowledge group: KG-0021\nChunk id: SC-00515\nText:\nPROGRAMME EDUCATION OBJETIVES oe PEO 1: Foster innovation of new ideas, methods and techniques in relevant research fields PEO 2: Lead research and establish a career as a skilled researcher and/or practitioner PEO 3. Disseminate research output and provide expert advice through various mechanisms in an ethical and professional manner\n\n[Retrieved Evidence 3]\nSource document: Complete Handbook\nScope: postgraduate\nSection: Master of Computer Science (By Research)\nSubsection: Learning Objectives and Outcomes\nPages: 114\nKnowledge group: KG-0039\nChunk id: SC-00504\nText:\nPROGRAMME LEARNING OBJECTIVES & OUTCOMES SS PROGRAMME EDUCATIONAL OBJECTIVES (PEO) 4. To produce researchers who can contribute to the development and knowledge of computer science 2. To produce professionals in computer science research 3. To equip graduates with technical and soft skills PROGRAMME LEARNING OUTCOMES (PLO) Taxonomic No. Programme Learning Outcomes PLOs Gubecte 1. [Apply and integrate knowledge on the latest research issues |p) Gy 3S in computer science and produce state-of-the-art research. 2. [Evaluate and analyse computing solutions in terms of re Pp usability, efficiency and effectiveness 3. [Produce computing solutions and use appropriate tools to lanalyse the performanc ...[truncated]\n\nQuestion: What are the faculty objectives?<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nThe faculty objectives are to sustain excellence in undergraduate and postgraduate teaching, learning, and research; contribute to national development through quality research and publications; provide innovative academic programmes that respond to societal needs; and produce quality graduates with advanced knowledge and skills in computer science and information technology.<|im_end|>\n"
|
| 43 |
+
},
|
| 44 |
+
{
|
| 45 |
+
"qa_id": "qa_000002",
|
| 46 |
+
"index_id": "UMI-0002",
|
| 47 |
+
"question": "What is the history overview of the Faculty of Computer Science and Information Technology?",
|
| 48 |
+
"answer": "Universiti Malaya's computing facilities began with the Computer Centre in the 1960s, later expanded into teaching and research, introduced the Diploma in Computer Science in 1974 and research degrees in 1985, then moved toward a full faculty structure in 1993. FCSIT was formally approved in 1994, its main building was opened in 1996, the Bachelor of Information Technology began in 1996/97, and four departments were established in 1997.",
|
| 49 |
+
"scope_label": "general",
|
| 50 |
+
"source_doc": "General Handbook",
|
| 51 |
+
"section": "History of the Faculty",
|
| 52 |
+
"subsection": "History Overview",
|
| 53 |
+
"retrieved_kb_ids": [
|
| 54 |
+
"RAGKB-0004",
|
| 55 |
+
"RAGKB-0003",
|
| 56 |
+
"RAGKB-0002"
|
| 57 |
+
],
|
| 58 |
+
"retrieved_sections": [
|
| 59 |
+
"History of the Faculty",
|
| 60 |
+
"History of the Faculty",
|
| 61 |
+
"History of the Faculty"
|
| 62 |
+
],
|
| 63 |
+
"retrieved_scopes": [
|
| 64 |
+
"general",
|
| 65 |
+
"general",
|
| 66 |
+
"general"
|
| 67 |
+
],
|
| 68 |
+
"prompt": [
|
| 69 |
+
{
|
| 70 |
+
"role": "system",
|
| 71 |
+
"content": "You are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly."
|
| 72 |
+
},
|
| 73 |
+
{
|
| 74 |
+
"role": "user",
|
| 75 |
+
"content": "Use the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 11\nKnowledge group: KG-0011\nChunk id: SC-00004\nText:\nThe Bachelor of Information Technology programme started in the 1996/1997 Session, with an initial intake of 50 students. To accommodate an increase student population, an additional building was built which was officially opened by Datuk Fong Chan Onn, Deputy Minister of Education on September 21st, 1998. Since its establishment, the Faculty of Computer Science and Information Technology have been led by a number of distinguished persons. The following have served as Directors/Dean: HISTORY OF THE FACULTY 1967 – 1973 1973 – 1975 1975 – 1978 1978 – 1982 1982 – 1990 1990 – 1992 1992 – 2000 2000 – 2002 2002 – 2004 2004 – 2005 2005 – 2006 2006 – 2007 2007 – 2009 2009 – 2010 2010 – 2011 2011 – 2 ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00003\nText:\nprogramme. The first undergraduate enrolment for the 1990/1991 Session was 50 students. In April 1st, 1993, the University Senate agreed to the formation of the Computer Centre Study Board. The Board proposed the establishment of a faculty to be called the Faculty of Computer Science and Information Technology (FCSIT). The existing Computer Centre was to be annulled and replaced by a Computer Services Division which was placed under the Chancellery. On September 22nd, 1994, the University of Malaya Council agreed to the formation of the Faculty of Computer Science and Information Technology (FCSIT), and the Computer Services Division. A sum of 4.2 million was obtained from the Ministry of Ed ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00002\nText:\nThe provision of computer facilities and services at the Universiti Malaya (UM) began soon after the Computer Centre was officially formed in 1965. This made the university one of the pioneers in computer usage in Malaysia. In December 1969, the Computer Centre took on an additional role of teaching and research of computer science and information technology. The Computer Centre Board was formed, comprising the Vice-Chancellor (as Chairman), the Director of Computer Centre (as Secretary), and a representative from each Faculty, Institute, Centre and the University Senate. In 1974, the Diploma in Computer Science programme was introduced. From its inception in the 1974/1975 Session to the 199 ...[truncated]\n\nQuestion: What is the history overview of the Faculty of Computer Science and Information Technology?"
|
| 76 |
+
}
|
| 77 |
+
],
|
| 78 |
+
"completion": [
|
| 79 |
+
{
|
| 80 |
+
"role": "assistant",
|
| 81 |
+
"content": "Universiti Malaya's computing facilities began with the Computer Centre in the 1960s, later expanded into teaching and research, introduced the Diploma in Computer Science in 1974 and research degrees in 1985, then moved toward a full faculty structure in 1993. FCSIT was formally approved in 1994, its main building was opened in 1996, the Bachelor of Information Technology began in 1996/97, and four departments were established in 1997."
|
| 82 |
+
}
|
| 83 |
+
],
|
| 84 |
+
"text": "<|im_start|>system\nYou are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly.<|im_end|>\n<|im_start|>user\nUse the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 11\nKnowledge group: KG-0011\nChunk id: SC-00004\nText:\nThe Bachelor of Information Technology programme started in the 1996/1997 Session, with an initial intake of 50 students. To accommodate an increase student population, an additional building was built which was officially opened by Datuk Fong Chan Onn, Deputy Minister of Education on September 21st, 1998. Since its establishment, the Faculty of Computer Science and Information Technology have been led by a number of distinguished persons. The following have served as Directors/Dean: HISTORY OF THE FACULTY 1967 – 1973 1973 – 1975 1975 – 1978 1978 – 1982 1982 – 1990 1990 – 1992 1992 – 2000 2000 – 2002 2002 – 2004 2004 – 2005 2005 – 2006 2006 – 2007 2007 – 2009 2009 – 2010 2010 – 2011 2011 – 2 ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00003\nText:\nprogramme. The first undergraduate enrolment for the 1990/1991 Session was 50 students. In April 1st, 1993, the University Senate agreed to the formation of the Computer Centre Study Board. The Board proposed the establishment of a faculty to be called the Faculty of Computer Science and Information Technology (FCSIT). The existing Computer Centre was to be annulled and replaced by a Computer Services Division which was placed under the Chancellery. On September 22nd, 1994, the University of Malaya Council agreed to the formation of the Faculty of Computer Science and Information Technology (FCSIT), and the Computer Services Division. A sum of 4.2 million was obtained from the Ministry of Ed ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00002\nText:\nThe provision of computer facilities and services at the Universiti Malaya (UM) began soon after the Computer Centre was officially formed in 1965. This made the university one of the pioneers in computer usage in Malaysia. In December 1969, the Computer Centre took on an additional role of teaching and research of computer science and information technology. The Computer Centre Board was formed, comprising the Vice-Chancellor (as Chairman), the Director of Computer Centre (as Secretary), and a representative from each Faculty, Institute, Centre and the University Senate. In 1974, the Diploma in Computer Science programme was introduced. From its inception in the 1974/1975 Session to the 199 ...[truncated]\n\nQuestion: What is the history overview of the Faculty of Computer Science and Information Technology?<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nUniversiti Malaya's computing facilities began with the Computer Centre in the 1960s, later expanded into teaching and research, introduced the Diploma in Computer Science in 1974 and research degrees in 1985, then moved toward a full faculty structure in 1993. FCSIT was formally approved in 1994, its main building was opened in 1996, the Bachelor of Information Technology began in 1996/97, and four departments were established in 1997.<|im_end|>\n"
|
| 85 |
+
},
|
| 86 |
+
{
|
| 87 |
+
"qa_id": "qa_000003",
|
| 88 |
+
"index_id": "UMI-0003",
|
| 89 |
+
"question": "What key milestones are mentioned in the faculty history?",
|
| 90 |
+
"answer": "Key milestones include the formation of the Computer Centre, the start of teaching and research in 1969, the introduction of the Diploma in Computer Science in 1974, approval and administration of the Master of Computer Science and PhD research programmes in 1985, the 1993 proposal to establish FCSIT, formal approval in 1994, the opening of the faculty building in 1996, the start of the Bachelor of Information Technology in 1996/97, and the establishment of four departments in 1997.",
|
| 91 |
+
"scope_label": "general",
|
| 92 |
+
"source_doc": "General Handbook",
|
| 93 |
+
"section": "History of the Faculty",
|
| 94 |
+
"subsection": "History Overview",
|
| 95 |
+
"retrieved_kb_ids": [
|
| 96 |
+
"RAGKB-0002",
|
| 97 |
+
"RAGKB-0003",
|
| 98 |
+
"RAGKB-0005"
|
| 99 |
+
],
|
| 100 |
+
"retrieved_sections": [
|
| 101 |
+
"History of the Faculty",
|
| 102 |
+
"History of the Faculty",
|
| 103 |
+
"History of the Faculty"
|
| 104 |
+
],
|
| 105 |
+
"retrieved_scopes": [
|
| 106 |
+
"general",
|
| 107 |
+
"general",
|
| 108 |
+
"general"
|
| 109 |
+
],
|
| 110 |
+
"prompt": [
|
| 111 |
+
{
|
| 112 |
+
"role": "system",
|
| 113 |
+
"content": "You are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly."
|
| 114 |
+
},
|
| 115 |
+
{
|
| 116 |
+
"role": "user",
|
| 117 |
+
"content": "Use the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00002\nText:\nThe provision of computer facilities and services at the Universiti Malaya (UM) began soon after the Computer Centre was officially formed in 1965. This made the university one of the pioneers in computer usage in Malaysia. In December 1969, the Computer Centre took on an additional role of teaching and research of computer science and information technology. The Computer Centre Board was formed, comprising the Vice-Chancellor (as Chairman), the Director of Computer Centre (as Secretary), and a representative from each Faculty, Institute, Centre and the University Senate. In 1974, the Diploma in Computer Science programme was introduced. From its inception in the 1974/1975 Session to the 199 ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00003\nText:\nprogramme. The first undergraduate enrolment for the 1990/1991 Session was 50 students. In April 1st, 1993, the University Senate agreed to the formation of the Computer Centre Study Board. The Board proposed the establishment of a faculty to be called the Faculty of Computer Science and Information Technology (FCSIT). The existing Computer Centre was to be annulled and replaced by a Computer Services Division which was placed under the Chancellery. On September 22nd, 1994, the University of Malaya Council agreed to the formation of the Faculty of Computer Science and Information Technology (FCSIT), and the Computer Services Division. A sum of 4.2 million was obtained from the Ministry of Ed ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 11\nKnowledge group: KG-0011\nChunk id: SC-00005\nText:\nProfessor Dato' Dr. Ir. Mashkuri Hj. Yaacob Professor Dr. Mohd Sapiyan Baba Professor Dr. David Ngo Chek Ling Professor Dr. Wan Ahmad Tajuddin Wan Abdullah Professor Dr. Siti Salwah Salim Professor Dr. Abdullah Gani Professor Dr. Abrizah Abdullah Professor Datin Dr. Sameem Abdul Kareem Professor Dr. Loo Chu Kiong Professor Ir. Dr. Chan Chee Seng Associate Professor Dr. Norisma Idris\n\nQuestion: What key milestones are mentioned in the faculty history?"
|
| 118 |
+
}
|
| 119 |
+
],
|
| 120 |
+
"completion": [
|
| 121 |
+
{
|
| 122 |
+
"role": "assistant",
|
| 123 |
+
"content": "Key milestones include the formation of the Computer Centre, the start of teaching and research in 1969, the introduction of the Diploma in Computer Science in 1974, approval and administration of the Master of Computer Science and PhD research programmes in 1985, the 1993 proposal to establish FCSIT, formal approval in 1994, the opening of the faculty building in 1996, the start of the Bachelor of Information Technology in 1996/97, and the establishment of four departments in 1997."
|
| 124 |
+
}
|
| 125 |
+
],
|
| 126 |
+
"text": "<|im_start|>system\nYou are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly.<|im_end|>\n<|im_start|>user\nUse the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00002\nText:\nThe provision of computer facilities and services at the Universiti Malaya (UM) began soon after the Computer Centre was officially formed in 1965. This made the university one of the pioneers in computer usage in Malaysia. In December 1969, the Computer Centre took on an additional role of teaching and research of computer science and information technology. The Computer Centre Board was formed, comprising the Vice-Chancellor (as Chairman), the Director of Computer Centre (as Secretary), and a representative from each Faculty, Institute, Centre and the University Senate. In 1974, the Diploma in Computer Science programme was introduced. From its inception in the 1974/1975 Session to the 199 ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 10\nKnowledge group: KG-0011\nChunk id: SC-00003\nText:\nprogramme. The first undergraduate enrolment for the 1990/1991 Session was 50 students. In April 1st, 1993, the University Senate agreed to the formation of the Computer Centre Study Board. The Board proposed the establishment of a faculty to be called the Faculty of Computer Science and Information Technology (FCSIT). The existing Computer Centre was to be annulled and replaced by a Computer Services Division which was placed under the Chancellery. On September 22nd, 1994, the University of Malaya Council agreed to the formation of the Faculty of Computer Science and Information Technology (FCSIT), and the Computer Services Division. A sum of 4.2 million was obtained from the Ministry of Ed ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: general\nSection: History of the Faculty\nSubsection: History Overview\nPages: 11\nKnowledge group: KG-0011\nChunk id: SC-00005\nText:\nProfessor Dato' Dr. Ir. Mashkuri Hj. Yaacob Professor Dr. Mohd Sapiyan Baba Professor Dr. David Ngo Chek Ling Professor Dr. Wan Ahmad Tajuddin Wan Abdullah Professor Dr. Siti Salwah Salim Professor Dr. Abdullah Gani Professor Dr. Abrizah Abdullah Professor Datin Dr. Sameem Abdul Kareem Professor Dr. Loo Chu Kiong Professor Ir. Dr. Chan Chee Seng Associate Professor Dr. Norisma Idris\n\nQuestion: What key milestones are mentioned in the faculty history?<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nKey milestones include the formation of the Computer Centre, the start of teaching and research in 1969, the introduction of the Diploma in Computer Science in 1974, approval and administration of the Master of Computer Science and PhD research programmes in 1985, the 1993 proposal to establish FCSIT, formal approval in 1994, the opening of the faculty building in 1996, the start of the Bachelor of Information Technology in 1996/97, and the establishment of four departments in 1997.<|im_end|>\n"
|
| 127 |
+
},
|
| 128 |
+
{
|
| 129 |
+
"qa_id": "qa_000004",
|
| 130 |
+
"index_id": "UMI-0004",
|
| 131 |
+
"question": "What is the academic calendar for master and doctorate level in session 2025/2026?",
|
| 132 |
+
"answer": "The 2025/2026 master and doctorate academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM PG iFest 2026 is scheduled for 09-11 June 2026.",
|
| 133 |
+
"scope_label": "postgraduate",
|
| 134 |
+
"source_doc": "General Handbook",
|
| 135 |
+
"section": "Academic Calendar 2025/2026",
|
| 136 |
+
"subsection": "Master and Doctorate Level Academic Calendar",
|
| 137 |
+
"retrieved_kb_ids": [
|
| 138 |
+
"RAGKB-0006",
|
| 139 |
+
"RAGKB-0007",
|
| 140 |
+
"RAGKB-0009"
|
| 141 |
+
],
|
| 142 |
+
"retrieved_sections": [
|
| 143 |
+
"Academic Calendar 2025/2026",
|
| 144 |
+
"Academic Calendar 2025/2026",
|
| 145 |
+
"Academic Calendar 2025/2026"
|
| 146 |
+
],
|
| 147 |
+
"retrieved_scopes": [
|
| 148 |
+
"postgraduate",
|
| 149 |
+
"postgraduate",
|
| 150 |
+
"undergraduate"
|
| 151 |
+
],
|
| 152 |
+
"prompt": [
|
| 153 |
+
{
|
| 154 |
+
"role": "system",
|
| 155 |
+
"content": "You are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly."
|
| 156 |
+
},
|
| 157 |
+
{
|
| 158 |
+
"role": "user",
|
| 159 |
+
"content": "Use the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00006\nText:\nLampiran B2 ACADEMIC CALENDAR 2025/2026 ACADEMIC SESSION (MASTER AND DOCTORATE LEVEL) AMENDMENT ‘SEMESTER | ‘Orientation (Week of Welcome)-WOW 1 week 05.10.2025 = «= 12.10.2025 Lectures, 6 weeks\" 19.10.2028 - 23.11.2025 Mid Semester | Break 1 week 24.11.2025 = © 30.11.2025 Lectures. 8 weeks\" 01.12.2025 = © 25.01.2026 Revision Week 1 weeks\" 26.01.2028 © - 01.02.2026 ‘Semester | Final Examination 3 weeks\" 02.02.2026 - © 2202.2026 ‘Semester | Break 2 week 23.02.2028 - 08.03.2026 22 weeks ‘SEMESTER I Lectures: 7 weeks\" 09.03.2026 - 26.04.2026 ‘Mid Semester II Break 1 week 27.04.2028 © - 03.05.2026 Lectures, 7 weeks* 04052026 - 21.06.2026 Revision Week 1 week* 22.06.2026 - 28.06.2026 ‘Semester II ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00007\nText:\n(1) The Module Registration and Examination Schedule can be referred to at tips /umsitsquide um edu my. All, information is subject to change. (0) The Academic Calendar has taken into account public and festive holidays and is subject to change Deepavali 20 October 2025 (Monday) CChvistnas Day 2 December 2025 (Thursday) New Year 04 Janvary 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertitory Day 01 February 2026 (Sunday) ‘Chinese New Year 17 & 18 Fetruary 2026 (Tuesday & Wednesday) Nuzul Al-Quran, 07 March 2026 (Saturday) Edu Fn 20.21 March 2026 (Friday & Saturday) Labour Day 1 May 2026 (Friday) idl Adhal 27 May 2026 (Wednesday) Wesak Day 34 May 2026 (Sunday) His Majesty th ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00009\nText:\n(1) The Module Registration and Examination Schedule can be referred to at https://umsitsquide um edumy, All information is subject to change. (() The Academic Calendar has taken into account public and festive holidays and is subject to change’ Deepavali 20 October 2028 (Monday) ‘Christmas Day 25 December 2025 (Thursday) New Year 01 January 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertory Day (01 February 2026 (Sunday) Chinese New Year 17 & 18 February 2026 (Tuesday & Wednesday) ‘Nuzul Al-Quran 07 March 2026 (Saturday) Eidul Ft 20 & 21 March 2026 (Friday & Saturday) Labour Day, 01 May 2026 (Friday) Eidul Adna (27 May 2026 (Wednesday) Wesak Day 31 May 2026 (Sunday) His Ma ...[truncated]\n\nQuestion: What is the academic calendar for master and doctorate level in session 2025/2026?"
|
| 160 |
+
}
|
| 161 |
+
],
|
| 162 |
+
"completion": [
|
| 163 |
+
{
|
| 164 |
+
"role": "assistant",
|
| 165 |
+
"content": "The 2025/2026 master and doctorate academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM PG iFest 2026 is scheduled for 09-11 June 2026."
|
| 166 |
+
}
|
| 167 |
+
],
|
| 168 |
+
"text": "<|im_start|>system\nYou are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly.<|im_end|>\n<|im_start|>user\nUse the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00006\nText:\nLampiran B2 ACADEMIC CALENDAR 2025/2026 ACADEMIC SESSION (MASTER AND DOCTORATE LEVEL) AMENDMENT ‘SEMESTER | ‘Orientation (Week of Welcome)-WOW 1 week 05.10.2025 = «= 12.10.2025 Lectures, 6 weeks\" 19.10.2028 - 23.11.2025 Mid Semester | Break 1 week 24.11.2025 = © 30.11.2025 Lectures. 8 weeks\" 01.12.2025 = © 25.01.2026 Revision Week 1 weeks\" 26.01.2028 © - 01.02.2026 ‘Semester | Final Examination 3 weeks\" 02.02.2026 - © 2202.2026 ‘Semester | Break 2 week 23.02.2028 - 08.03.2026 22 weeks ‘SEMESTER I Lectures: 7 weeks\" 09.03.2026 - 26.04.2026 ‘Mid Semester II Break 1 week 27.04.2028 © - 03.05.2026 Lectures, 7 weeks* 04052026 - 21.06.2026 Revision Week 1 week* 22.06.2026 - 28.06.2026 ‘Semester II ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00007\nText:\n(1) The Module Registration and Examination Schedule can be referred to at tips /umsitsquide um edu my. All, information is subject to change. (0) The Academic Calendar has taken into account public and festive holidays and is subject to change Deepavali 20 October 2025 (Monday) CChvistnas Day 2 December 2025 (Thursday) New Year 04 Janvary 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertitory Day 01 February 2026 (Sunday) ‘Chinese New Year 17 & 18 Fetruary 2026 (Tuesday & Wednesday) Nuzul Al-Quran, 07 March 2026 (Saturday) Edu Fn 20.21 March 2026 (Friday & Saturday) Labour Day 1 May 2026 (Friday) idl Adhal 27 May 2026 (Wednesday) Wesak Day 34 May 2026 (Sunday) His Majesty th ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00009\nText:\n(1) The Module Registration and Examination Schedule can be referred to at https://umsitsquide um edumy, All information is subject to change. (() The Academic Calendar has taken into account public and festive holidays and is subject to change’ Deepavali 20 October 2028 (Monday) ‘Christmas Day 25 December 2025 (Thursday) New Year 01 January 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertory Day (01 February 2026 (Sunday) Chinese New Year 17 & 18 February 2026 (Tuesday & Wednesday) ‘Nuzul Al-Quran 07 March 2026 (Saturday) Eidul Ft 20 & 21 March 2026 (Friday & Saturday) Labour Day, 01 May 2026 (Friday) Eidul Adna (27 May 2026 (Wednesday) Wesak Day 31 May 2026 (Sunday) His Ma ...[truncated]\n\nQuestion: What is the academic calendar for master and doctorate level in session 2025/2026?<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nThe 2025/2026 master and doctorate academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM PG iFest 2026 is scheduled for 09-11 June 2026.<|im_end|>\n"
|
| 169 |
+
},
|
| 170 |
+
{
|
| 171 |
+
"qa_id": "qa_000005",
|
| 172 |
+
"index_id": "UMI-0005",
|
| 173 |
+
"question": "What is the academic calendar for bachelor degree level in session 2025/2026?",
|
| 174 |
+
"answer": "The 2025/2026 bachelor degree academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM UG iFest 2026 is scheduled for 09-11 June 2026.",
|
| 175 |
+
"scope_label": "undergraduate",
|
| 176 |
+
"source_doc": "General Handbook",
|
| 177 |
+
"section": "Academic Calendar 2025/2026",
|
| 178 |
+
"subsection": "Bachelor Degree Level Academic Calendar",
|
| 179 |
+
"retrieved_kb_ids": [
|
| 180 |
+
"RAGKB-0009",
|
| 181 |
+
"RAGKB-0008",
|
| 182 |
+
"RAGKB-0007"
|
| 183 |
+
],
|
| 184 |
+
"retrieved_sections": [
|
| 185 |
+
"Academic Calendar 2025/2026",
|
| 186 |
+
"Academic Calendar 2025/2026",
|
| 187 |
+
"Academic Calendar 2025/2026"
|
| 188 |
+
],
|
| 189 |
+
"retrieved_scopes": [
|
| 190 |
+
"undergraduate",
|
| 191 |
+
"undergraduate",
|
| 192 |
+
"postgraduate"
|
| 193 |
+
],
|
| 194 |
+
"prompt": [
|
| 195 |
+
{
|
| 196 |
+
"role": "system",
|
| 197 |
+
"content": "You are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly."
|
| 198 |
+
},
|
| 199 |
+
{
|
| 200 |
+
"role": "user",
|
| 201 |
+
"content": "Use the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00009\nText:\n(1) The Module Registration and Examination Schedule can be referred to at https://umsitsquide um edumy, All information is subject to change. (() The Academic Calendar has taken into account public and festive holidays and is subject to change’ Deepavali 20 October 2028 (Monday) ‘Christmas Day 25 December 2025 (Thursday) New Year 01 January 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertory Day (01 February 2026 (Sunday) Chinese New Year 17 & 18 February 2026 (Tuesday & Wednesday) ‘Nuzul Al-Quran 07 March 2026 (Saturday) Eidul Ft 20 & 21 March 2026 (Friday & Saturday) Labour Day, 01 May 2026 (Friday) Eidul Adna (27 May 2026 (Wednesday) Wesak Day 31 May 2026 (Sunday) His Ma ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00008\nText:\nEy ACADEMIC CALENDAR et SESSION 2025/2026 A eee ACADEMIC CALENDAR 2025/2026 ACADEMIC SESSION (BACHELOR DEGREE LEVEL) AMENDMENT SEMESTER | Orientation (Week of Welcome)-WOW 1 week 05.10.2025 - 12.10.2025 Lectures 6 weeks* 13.10.2025 - = 23.11.2025 Mid Semester | Break 1 week 24.11.2025 - 30.11.2025 Lectures, 8 weeks* 01.12.2025 - — 26.01.2026 Revision Week 1 weeks* 26.01.2026 - 01.02.2026 ‘Semester | Final Examination 3 weeks* 02.02.2026 - 22.02.2026 ‘Semester | Break 2 week 23.02.2026 - 08.03.2026 22 weeks ‘SEMESTER I Lectures 7 weeks* 09.03.2026 - 26.04.2026 Mid Semester II Break 1 week 27.04.2026 - 03.05.2026 Lectures. 7 weeks* 04.05.2026 - 21.06.2026 Revision Week 1 week\" 22.06.2026 - 28. ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00007\nText:\n(1) The Module Registration and Examination Schedule can be referred to at tips /umsitsquide um edu my. All, information is subject to change. (0) The Academic Calendar has taken into account public and festive holidays and is subject to change Deepavali 20 October 2025 (Monday) CChvistnas Day 2 December 2025 (Thursday) New Year 04 Janvary 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertitory Day 01 February 2026 (Sunday) ‘Chinese New Year 17 & 18 Fetruary 2026 (Tuesday & Wednesday) Nuzul Al-Quran, 07 March 2026 (Saturday) Edu Fn 20.21 March 2026 (Friday & Saturday) Labour Day 1 May 2026 (Friday) idl Adhal 27 May 2026 (Wednesday) Wesak Day 34 May 2026 (Sunday) His Majesty th ...[truncated]\n\nQuestion: What is the academic calendar for bachelor degree level in session 2025/2026?"
|
| 202 |
+
}
|
| 203 |
+
],
|
| 204 |
+
"completion": [
|
| 205 |
+
{
|
| 206 |
+
"role": "assistant",
|
| 207 |
+
"content": "The 2025/2026 bachelor degree academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM UG iFest 2026 is scheduled for 09-11 June 2026."
|
| 208 |
+
}
|
| 209 |
+
],
|
| 210 |
+
"text": "<|im_start|>system\nYou are an academic assistant for the Faculty of Computer Science and Information Technology, Universiti Malaya. Answer only with handbook-grounded information. Use the retrieved handbook evidence when it is provided. Keep the answer direct, specific, and faithful to the handbook. If the evidence does not support a claim, say so clearly and briefly.<|im_end|>\n<|im_start|>user\nUse the retrieved handbook evidence to answer the question. Base the answer on the evidence and keep it faithful to the handbook.\n\n[Retrieved Evidence 1]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00009\nText:\n(1) The Module Registration and Examination Schedule can be referred to at https://umsitsquide um edumy, All information is subject to change. (() The Academic Calendar has taken into account public and festive holidays and is subject to change’ Deepavali 20 October 2028 (Monday) ‘Christmas Day 25 December 2025 (Thursday) New Year 01 January 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertory Day (01 February 2026 (Sunday) Chinese New Year 17 & 18 February 2026 (Tuesday & Wednesday) ‘Nuzul Al-Quran 07 March 2026 (Saturday) Eidul Ft 20 & 21 March 2026 (Friday & Saturday) Labour Day, 01 May 2026 (Friday) Eidul Adna (27 May 2026 (Wednesday) Wesak Day 31 May 2026 (Sunday) His Ma ...[truncated]\n\n[Retrieved Evidence 2]\nSource document: General Handbook\nScope: undergraduate\nSection: Academic Calendar 2025/2026\nSubsection: Bachelor Degree Level Academic Calendar\nPages: 12\nKnowledge group: KG-0087\nChunk id: SC-00008\nText:\nEy ACADEMIC CALENDAR et SESSION 2025/2026 A eee ACADEMIC CALENDAR 2025/2026 ACADEMIC SESSION (BACHELOR DEGREE LEVEL) AMENDMENT SEMESTER | Orientation (Week of Welcome)-WOW 1 week 05.10.2025 - 12.10.2025 Lectures 6 weeks* 13.10.2025 - = 23.11.2025 Mid Semester | Break 1 week 24.11.2025 - 30.11.2025 Lectures, 8 weeks* 01.12.2025 - — 26.01.2026 Revision Week 1 weeks* 26.01.2026 - 01.02.2026 ‘Semester | Final Examination 3 weeks* 02.02.2026 - 22.02.2026 ‘Semester | Break 2 week 23.02.2026 - 08.03.2026 22 weeks ‘SEMESTER I Lectures 7 weeks* 09.03.2026 - 26.04.2026 Mid Semester II Break 1 week 27.04.2026 - 03.05.2026 Lectures. 7 weeks* 04.05.2026 - 21.06.2026 Revision Week 1 week\" 22.06.2026 - 28. ...[truncated]\n\n[Retrieved Evidence 3]\nSource document: General Handbook\nScope: postgraduate\nSection: Academic Calendar 2025/2026\nSubsection: Master and Doctorate Level Academic Calendar\nPages: 4\nKnowledge group: KG-0064\nChunk id: SC-00007\nText:\n(1) The Module Registration and Examination Schedule can be referred to at tips /umsitsquide um edu my. All, information is subject to change. (0) The Academic Calendar has taken into account public and festive holidays and is subject to change Deepavali 20 October 2025 (Monday) CChvistnas Day 2 December 2025 (Thursday) New Year 04 Janvary 2026 (Thursday) ‘Thaipusam 01 February 2026 (Sunday) Federal Tertitory Day 01 February 2026 (Sunday) ‘Chinese New Year 17 & 18 Fetruary 2026 (Tuesday & Wednesday) Nuzul Al-Quran, 07 March 2026 (Saturday) Edu Fn 20.21 March 2026 (Friday & Saturday) Labour Day 1 May 2026 (Friday) idl Adhal 27 May 2026 (Wednesday) Wesak Day 34 May 2026 (Sunday) His Majesty th ...[truncated]\n\nQuestion: What is the academic calendar for bachelor degree level in session 2025/2026?<|im_end|>\n<|im_start|>assistant\n<think>\n\n</think>\n\nThe 2025/2026 bachelor degree academic calendar with Semester I, Semester II, and a special semester, covering orientation, lecture periods, mid-semester breaks, revision weeks, final examinations, and semester breaks. Semester I: Orientation (WOW) 05.10.2025-12.10.2025; lectures 13.10.2025-23.11.2025 and 01.12.2025-25.01.2026; mid-semester break 24.11.2025-30.11.2025; revision week 26.01.2026-01.02.2026; final examination 02.02.2026-22.02.2026; semester break 23.02.2026-08.03.2026. Semester II: lectures 09.03.2026-26.04.2026 and 04.05.2026-21.06.2026; break 27.04.2026-03.05.2026; revision week 22.06.2026-28.06.2026; final examination 29.06.2026-19.07.2026; semester break 20.07.2026-16.08.2026. Special semester: lectures 27.07.2026-13.09.2026; final examination 14.09.2026-20.09.2026; break 21.09.2026-28.09.2026. Public holidays listed include Deepavali, Christmas, New Year, Thaipusam, Federal Territory Day, Chinese New Year, Nuzul Al-Quran, Eidul Fitri, Labour Day, Eidul Adha, Wesak Day, His Majesty the King's Birthday, Awal Muharram, Prophet Muhammad's Birthday, National Day, and Malaysia Day. The notes state that the module registration and examination schedule can be referred to in the UM system guide, the calendar is subject to change, and UM UG iFest 2026 is scheduled for 09-11 June 2026.<|im_end|>\n"
|
| 211 |
+
}
|
| 212 |
+
]
|
UM_Handbook/outputs/baseline2_rag_harness_agent/rag_augmented_dataset/rag_text_length_summary.json
ADDED
|
@@ -0,0 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"count": 1000,
|
| 3 |
+
"min_tokens": 446,
|
| 4 |
+
"p50_tokens": 800,
|
| 5 |
+
"p90_tokens": 1036,
|
| 6 |
+
"p95_tokens": 1234,
|
| 7 |
+
"p99_tokens": 1488,
|
| 8 |
+
"max_tokens": 1987,
|
| 9 |
+
"configured_max_seq_length": 1024,
|
| 10 |
+
"examples_over_max_seq_length": 114
|
| 11 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/retrieval_eval/retrieval_details.jsonl
ADDED
|
The diff for this file is too large to render.
See raw diff
|
|
|
UM_Handbook/outputs/baseline2_rag_harness_agent/retrieval_eval/retrieval_metrics.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"retrieval_eval_size": 1000,
|
| 3 |
+
"top_k": 3,
|
| 4 |
+
"hit_at_1_primary": 0.821,
|
| 5 |
+
"hit_at_k_primary": 0.954,
|
| 6 |
+
"hit_at_k_same_group": 0.991,
|
| 7 |
+
"scope_match_at_1": 0.996,
|
| 8 |
+
"retriever_type": "dense_embedding + faiss + metadata_rerank",
|
| 9 |
+
"embedding_model_name": "BAAI/bge-base-en-v1.5"
|
| 10 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/README.md
ADDED
|
@@ -0,0 +1,57 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: transformers
|
| 3 |
+
model_name: trainer_runs
|
| 4 |
+
tags:
|
| 5 |
+
- generated_from_trainer
|
| 6 |
+
- sft
|
| 7 |
+
- trl
|
| 8 |
+
licence: license
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# Model Card for trainer_runs
|
| 12 |
+
|
| 13 |
+
This model is a fine-tuned version of [None](https://huggingface.co/None).
|
| 14 |
+
It has been trained using [TRL](https://github.com/huggingface/trl).
|
| 15 |
+
|
| 16 |
+
## Quick start
|
| 17 |
+
|
| 18 |
+
```python
|
| 19 |
+
from transformers import pipeline
|
| 20 |
+
|
| 21 |
+
question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
|
| 22 |
+
generator = pipeline("text-generation", model="None", device="cuda")
|
| 23 |
+
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
|
| 24 |
+
print(output["generated_text"])
|
| 25 |
+
```
|
| 26 |
+
|
| 27 |
+
## Training procedure
|
| 28 |
+
|
| 29 |
+
|
| 30 |
+
|
| 31 |
+
|
| 32 |
+
|
| 33 |
+
This model was trained with SFT.
|
| 34 |
+
|
| 35 |
+
### Framework versions
|
| 36 |
+
|
| 37 |
+
- TRL: 1.1.0
|
| 38 |
+
- Transformers: 5.3.0
|
| 39 |
+
- Pytorch: 2.5.1+cu121
|
| 40 |
+
- Datasets: 2.19.2
|
| 41 |
+
- Tokenizers: 0.22.2
|
| 42 |
+
|
| 43 |
+
## Citations
|
| 44 |
+
|
| 45 |
+
|
| 46 |
+
|
| 47 |
+
Cite TRL as:
|
| 48 |
+
|
| 49 |
+
```bibtex
|
| 50 |
+
@software{vonwerra2020trl,
|
| 51 |
+
title = {{TRL: Transformers Reinforcement Learning}},
|
| 52 |
+
author = {von Werra, Leandro and Belkada, Younes and Tunstall, Lewis and Beeching, Edward and Thrush, Tristan and Lambert, Nathan and Huang, Shengyi and Rasul, Kashif and Gallouédec, Quentin},
|
| 53 |
+
license = {Apache-2.0},
|
| 54 |
+
url = {https://github.com/huggingface/trl},
|
| 55 |
+
year = {2020}
|
| 56 |
+
}
|
| 57 |
+
```
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/README.md
ADDED
|
@@ -0,0 +1,209 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: /scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 3 |
+
library_name: peft
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
+
tags:
|
| 6 |
+
- base_model:adapter:/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 7 |
+
- lora
|
| 8 |
+
- sft
|
| 9 |
+
- transformers
|
| 10 |
+
- trl
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
# Model Card for Model ID
|
| 14 |
+
|
| 15 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
|
| 19 |
+
## Model Details
|
| 20 |
+
|
| 21 |
+
### Model Description
|
| 22 |
+
|
| 23 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 24 |
+
|
| 25 |
+
|
| 26 |
+
|
| 27 |
+
- **Developed by:** [More Information Needed]
|
| 28 |
+
- **Funded by [optional]:** [More Information Needed]
|
| 29 |
+
- **Shared by [optional]:** [More Information Needed]
|
| 30 |
+
- **Model type:** [More Information Needed]
|
| 31 |
+
- **Language(s) (NLP):** [More Information Needed]
|
| 32 |
+
- **License:** [More Information Needed]
|
| 33 |
+
- **Finetuned from model [optional]:** [More Information Needed]
|
| 34 |
+
|
| 35 |
+
### Model Sources [optional]
|
| 36 |
+
|
| 37 |
+
<!-- Provide the basic links for the model. -->
|
| 38 |
+
|
| 39 |
+
- **Repository:** [More Information Needed]
|
| 40 |
+
- **Paper [optional]:** [More Information Needed]
|
| 41 |
+
- **Demo [optional]:** [More Information Needed]
|
| 42 |
+
|
| 43 |
+
## Uses
|
| 44 |
+
|
| 45 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 46 |
+
|
| 47 |
+
### Direct Use
|
| 48 |
+
|
| 49 |
+
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 50 |
+
|
| 51 |
+
[More Information Needed]
|
| 52 |
+
|
| 53 |
+
### Downstream Use [optional]
|
| 54 |
+
|
| 55 |
+
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
| 56 |
+
|
| 57 |
+
[More Information Needed]
|
| 58 |
+
|
| 59 |
+
### Out-of-Scope Use
|
| 60 |
+
|
| 61 |
+
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
| 62 |
+
|
| 63 |
+
[More Information Needed]
|
| 64 |
+
|
| 65 |
+
## Bias, Risks, and Limitations
|
| 66 |
+
|
| 67 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
| 68 |
+
|
| 69 |
+
[More Information Needed]
|
| 70 |
+
|
| 71 |
+
### Recommendations
|
| 72 |
+
|
| 73 |
+
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
| 74 |
+
|
| 75 |
+
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
| 76 |
+
|
| 77 |
+
## How to Get Started with the Model
|
| 78 |
+
|
| 79 |
+
Use the code below to get started with the model.
|
| 80 |
+
|
| 81 |
+
[More Information Needed]
|
| 82 |
+
|
| 83 |
+
## Training Details
|
| 84 |
+
|
| 85 |
+
### Training Data
|
| 86 |
+
|
| 87 |
+
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 88 |
+
|
| 89 |
+
[More Information Needed]
|
| 90 |
+
|
| 91 |
+
### Training Procedure
|
| 92 |
+
|
| 93 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 94 |
+
|
| 95 |
+
#### Preprocessing [optional]
|
| 96 |
+
|
| 97 |
+
[More Information Needed]
|
| 98 |
+
|
| 99 |
+
|
| 100 |
+
#### Training Hyperparameters
|
| 101 |
+
|
| 102 |
+
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 103 |
+
|
| 104 |
+
#### Speeds, Sizes, Times [optional]
|
| 105 |
+
|
| 106 |
+
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
| 107 |
+
|
| 108 |
+
[More Information Needed]
|
| 109 |
+
|
| 110 |
+
## Evaluation
|
| 111 |
+
|
| 112 |
+
<!-- This section describes the evaluation protocols and provides the results. -->
|
| 113 |
+
|
| 114 |
+
### Testing Data, Factors & Metrics
|
| 115 |
+
|
| 116 |
+
#### Testing Data
|
| 117 |
+
|
| 118 |
+
<!-- This should link to a Dataset Card if possible. -->
|
| 119 |
+
|
| 120 |
+
[More Information Needed]
|
| 121 |
+
|
| 122 |
+
#### Factors
|
| 123 |
+
|
| 124 |
+
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
| 125 |
+
|
| 126 |
+
[More Information Needed]
|
| 127 |
+
|
| 128 |
+
#### Metrics
|
| 129 |
+
|
| 130 |
+
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
| 131 |
+
|
| 132 |
+
[More Information Needed]
|
| 133 |
+
|
| 134 |
+
### Results
|
| 135 |
+
|
| 136 |
+
[More Information Needed]
|
| 137 |
+
|
| 138 |
+
#### Summary
|
| 139 |
+
|
| 140 |
+
|
| 141 |
+
|
| 142 |
+
## Model Examination [optional]
|
| 143 |
+
|
| 144 |
+
<!-- Relevant interpretability work for the model goes here -->
|
| 145 |
+
|
| 146 |
+
[More Information Needed]
|
| 147 |
+
|
| 148 |
+
## Environmental Impact
|
| 149 |
+
|
| 150 |
+
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
| 151 |
+
|
| 152 |
+
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
| 153 |
+
|
| 154 |
+
- **Hardware Type:** [More Information Needed]
|
| 155 |
+
- **Hours used:** [More Information Needed]
|
| 156 |
+
- **Cloud Provider:** [More Information Needed]
|
| 157 |
+
- **Compute Region:** [More Information Needed]
|
| 158 |
+
- **Carbon Emitted:** [More Information Needed]
|
| 159 |
+
|
| 160 |
+
## Technical Specifications [optional]
|
| 161 |
+
|
| 162 |
+
### Model Architecture and Objective
|
| 163 |
+
|
| 164 |
+
[More Information Needed]
|
| 165 |
+
|
| 166 |
+
### Compute Infrastructure
|
| 167 |
+
|
| 168 |
+
[More Information Needed]
|
| 169 |
+
|
| 170 |
+
#### Hardware
|
| 171 |
+
|
| 172 |
+
[More Information Needed]
|
| 173 |
+
|
| 174 |
+
#### Software
|
| 175 |
+
|
| 176 |
+
[More Information Needed]
|
| 177 |
+
|
| 178 |
+
## Citation [optional]
|
| 179 |
+
|
| 180 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
| 181 |
+
|
| 182 |
+
**BibTeX:**
|
| 183 |
+
|
| 184 |
+
[More Information Needed]
|
| 185 |
+
|
| 186 |
+
**APA:**
|
| 187 |
+
|
| 188 |
+
[More Information Needed]
|
| 189 |
+
|
| 190 |
+
## Glossary [optional]
|
| 191 |
+
|
| 192 |
+
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
| 193 |
+
|
| 194 |
+
[More Information Needed]
|
| 195 |
+
|
| 196 |
+
## More Information [optional]
|
| 197 |
+
|
| 198 |
+
[More Information Needed]
|
| 199 |
+
|
| 200 |
+
## Model Card Authors [optional]
|
| 201 |
+
|
| 202 |
+
[More Information Needed]
|
| 203 |
+
|
| 204 |
+
## Model Card Contact
|
| 205 |
+
|
| 206 |
+
[More Information Needed]
|
| 207 |
+
### Framework versions
|
| 208 |
+
|
| 209 |
+
- PEFT 0.18.1
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/adapter_config.json
ADDED
|
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"alora_invocation_tokens": null,
|
| 3 |
+
"alpha_pattern": {},
|
| 4 |
+
"arrow_config": null,
|
| 5 |
+
"auto_mapping": null,
|
| 6 |
+
"base_model_name_or_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B",
|
| 7 |
+
"bias": "none",
|
| 8 |
+
"corda_config": null,
|
| 9 |
+
"ensure_weight_tying": false,
|
| 10 |
+
"eva_config": null,
|
| 11 |
+
"exclude_modules": null,
|
| 12 |
+
"fan_in_fan_out": false,
|
| 13 |
+
"inference_mode": true,
|
| 14 |
+
"init_lora_weights": true,
|
| 15 |
+
"layer_replication": null,
|
| 16 |
+
"layers_pattern": null,
|
| 17 |
+
"layers_to_transform": null,
|
| 18 |
+
"loftq_config": {},
|
| 19 |
+
"lora_alpha": 16,
|
| 20 |
+
"lora_bias": false,
|
| 21 |
+
"lora_dropout": 0.05,
|
| 22 |
+
"megatron_config": null,
|
| 23 |
+
"megatron_core": "megatron.core",
|
| 24 |
+
"modules_to_save": null,
|
| 25 |
+
"peft_type": "LORA",
|
| 26 |
+
"peft_version": "0.18.1",
|
| 27 |
+
"qalora_group_size": 16,
|
| 28 |
+
"r": 8,
|
| 29 |
+
"rank_pattern": {},
|
| 30 |
+
"revision": null,
|
| 31 |
+
"target_modules": [
|
| 32 |
+
"gate_proj",
|
| 33 |
+
"v_proj",
|
| 34 |
+
"q_proj",
|
| 35 |
+
"k_proj",
|
| 36 |
+
"up_proj",
|
| 37 |
+
"down_proj",
|
| 38 |
+
"o_proj"
|
| 39 |
+
],
|
| 40 |
+
"target_parameters": null,
|
| 41 |
+
"task_type": "CAUSAL_LM",
|
| 42 |
+
"trainable_token_indices": null,
|
| 43 |
+
"use_dora": false,
|
| 44 |
+
"use_qalora": false,
|
| 45 |
+
"use_rslora": false
|
| 46 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/adapter_model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:54282e9eff87e798234f8fa408e6caf2409416eed4516465894c29c08265a18e
|
| 3 |
+
size 87360584
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/chat_template.jinja
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{%- if tools %}
|
| 2 |
+
{{- '<|im_start|>system\n' }}
|
| 3 |
+
{%- if messages[0].role == 'system' %}
|
| 4 |
+
{{- messages[0].content + '\n\n' }}
|
| 5 |
+
{%- endif %}
|
| 6 |
+
{{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
|
| 7 |
+
{%- for tool in tools %}
|
| 8 |
+
{{- "\n" }}
|
| 9 |
+
{{- tool | tojson }}
|
| 10 |
+
{%- endfor %}
|
| 11 |
+
{{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
|
| 12 |
+
{%- else %}
|
| 13 |
+
{%- if messages[0].role == 'system' %}
|
| 14 |
+
{{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
|
| 15 |
+
{%- endif %}
|
| 16 |
+
{%- endif %}
|
| 17 |
+
{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
|
| 18 |
+
{%- for message in messages[::-1] %}
|
| 19 |
+
{%- set index = (messages|length - 1) - loop.index0 %}
|
| 20 |
+
{%- if ns.multi_step_tool and message.role == "user" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}
|
| 21 |
+
{%- set ns.multi_step_tool = false %}
|
| 22 |
+
{%- set ns.last_query_index = index %}
|
| 23 |
+
{%- endif %}
|
| 24 |
+
{%- endfor %}
|
| 25 |
+
{%- for message in messages %}
|
| 26 |
+
{%- if message.content is string %}
|
| 27 |
+
{%- set content = message.content %}
|
| 28 |
+
{%- else %}
|
| 29 |
+
{%- set content = '' %}
|
| 30 |
+
{%- endif %}
|
| 31 |
+
{%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
|
| 32 |
+
{{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
|
| 33 |
+
{%- elif message.role == "assistant" %}
|
| 34 |
+
{%- set reasoning_content = '' %}
|
| 35 |
+
{%- if message.reasoning_content is string %}
|
| 36 |
+
{%- set reasoning_content = message.reasoning_content %}
|
| 37 |
+
{%- else %}
|
| 38 |
+
{%- if '</think>' in content %}
|
| 39 |
+
{%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
|
| 40 |
+
{%- set content = content.split('</think>')[-1].lstrip('\n') %}
|
| 41 |
+
{%- endif %}
|
| 42 |
+
{%- endif %}
|
| 43 |
+
{%- if loop.index0 > ns.last_query_index %}
|
| 44 |
+
{%- if loop.last or (not loop.last and reasoning_content) %}
|
| 45 |
+
{{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content.strip('\n') + '\n</think>\n\n' + content.lstrip('\n') }}
|
| 46 |
+
{%- else %}
|
| 47 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 48 |
+
{%- endif %}
|
| 49 |
+
{%- else %}
|
| 50 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 51 |
+
{%- endif %}
|
| 52 |
+
{%- if message.tool_calls %}
|
| 53 |
+
{%- for tool_call in message.tool_calls %}
|
| 54 |
+
{%- if (loop.first and content) or (not loop.first) %}
|
| 55 |
+
{{- '\n' }}
|
| 56 |
+
{%- endif %}
|
| 57 |
+
{%- if tool_call.function %}
|
| 58 |
+
{%- set tool_call = tool_call.function %}
|
| 59 |
+
{%- endif %}
|
| 60 |
+
{{- '<tool_call>\n{"name": "' }}
|
| 61 |
+
{{- tool_call.name }}
|
| 62 |
+
{{- '", "arguments": ' }}
|
| 63 |
+
{%- if tool_call.arguments is string %}
|
| 64 |
+
{{- tool_call.arguments }}
|
| 65 |
+
{%- else %}
|
| 66 |
+
{{- tool_call.arguments | tojson }}
|
| 67 |
+
{%- endif %}
|
| 68 |
+
{{- '}\n</tool_call>' }}
|
| 69 |
+
{%- endfor %}
|
| 70 |
+
{%- endif %}
|
| 71 |
+
{{- '<|im_end|>\n' }}
|
| 72 |
+
{%- elif message.role == "tool" %}
|
| 73 |
+
{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
|
| 74 |
+
{{- '<|im_start|>user' }}
|
| 75 |
+
{%- endif %}
|
| 76 |
+
{{- '\n<tool_response>\n' }}
|
| 77 |
+
{{- content }}
|
| 78 |
+
{{- '\n</tool_response>' }}
|
| 79 |
+
{%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
|
| 80 |
+
{{- '<|im_end|>\n' }}
|
| 81 |
+
{%- endif %}
|
| 82 |
+
{%- endif %}
|
| 83 |
+
{%- endfor %}
|
| 84 |
+
{%- if add_generation_prompt %}
|
| 85 |
+
{{- '<|im_start|>assistant\n' }}
|
| 86 |
+
{%- if enable_thinking is defined and enable_thinking is false %}
|
| 87 |
+
{{- '<think>\n\n</think>\n\n' }}
|
| 88 |
+
{%- endif %}
|
| 89 |
+
{%- endif %}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/optimizer.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a1aa3f44290e2213c380548285e633d4ec72f400599c12bf70522b5b505b4627
|
| 3 |
+
size 175019258
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/rng_state.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ddf117f629d64665e096b2be07b01ac7ba6e77931c35740af816bf218c7b9127
|
| 3 |
+
size 14244
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/scheduler.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:85590a667b733e66a7f08005b8e865d1bbbeba867a23d9a471017de48d623149
|
| 3 |
+
size 1064
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:be75606093db2094d7cd20f3c2f385c212750648bd6ea4fb2bf507a6a4c55506
|
| 3 |
+
size 11422650
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/tokenizer_config.json
ADDED
|
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": false,
|
| 3 |
+
"backend": "tokenizers",
|
| 4 |
+
"bos_token": null,
|
| 5 |
+
"clean_up_tokenization_spaces": false,
|
| 6 |
+
"eos_token": "<|im_end|>",
|
| 7 |
+
"errors": "replace",
|
| 8 |
+
"extra_special_tokens": [
|
| 9 |
+
"<|im_start|>",
|
| 10 |
+
"<|im_end|>",
|
| 11 |
+
"<|object_ref_start|>",
|
| 12 |
+
"<|object_ref_end|>",
|
| 13 |
+
"<|box_start|>",
|
| 14 |
+
"<|box_end|>",
|
| 15 |
+
"<|quad_start|>",
|
| 16 |
+
"<|quad_end|>",
|
| 17 |
+
"<|vision_start|>",
|
| 18 |
+
"<|vision_end|>",
|
| 19 |
+
"<|vision_pad|>",
|
| 20 |
+
"<|image_pad|>",
|
| 21 |
+
"<|video_pad|>"
|
| 22 |
+
],
|
| 23 |
+
"is_local": true,
|
| 24 |
+
"model_max_length": 131072,
|
| 25 |
+
"pad_token": "<|endoftext|>",
|
| 26 |
+
"split_special_tokens": false,
|
| 27 |
+
"tokenizer_class": "Qwen2Tokenizer",
|
| 28 |
+
"unk_token": null
|
| 29 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/trainer_state.json
ADDED
|
@@ -0,0 +1,604 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"best_global_step": 1100,
|
| 3 |
+
"best_metric": 0.11906072497367859,
|
| 4 |
+
"best_model_checkpoint": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100",
|
| 5 |
+
"epoch": 11.0,
|
| 6 |
+
"eval_steps": 500,
|
| 7 |
+
"global_step": 1100,
|
| 8 |
+
"is_hyper_param_search": false,
|
| 9 |
+
"is_local_process_zero": true,
|
| 10 |
+
"is_world_process_zero": true,
|
| 11 |
+
"log_history": [
|
| 12 |
+
{
|
| 13 |
+
"entropy": 1.4356346559524535,
|
| 14 |
+
"epoch": 0.25,
|
| 15 |
+
"grad_norm": 0.24204915761947632,
|
| 16 |
+
"learning_rate": 7.999919440291627e-05,
|
| 17 |
+
"loss": 2.3678880310058594,
|
| 18 |
+
"mean_token_accuracy": 0.5813769394159317,
|
| 19 |
+
"num_tokens": 160725.0,
|
| 20 |
+
"step": 25
|
| 21 |
+
},
|
| 22 |
+
{
|
| 23 |
+
"entropy": 1.8811942958831787,
|
| 24 |
+
"epoch": 0.5,
|
| 25 |
+
"grad_norm": 0.17289401590824127,
|
| 26 |
+
"learning_rate": 7.995766313163572e-05,
|
| 27 |
+
"loss": 1.7852125549316407,
|
| 28 |
+
"mean_token_accuracy": 0.6483715295791626,
|
| 29 |
+
"num_tokens": 327066.0,
|
| 30 |
+
"step": 50
|
| 31 |
+
},
|
| 32 |
+
{
|
| 33 |
+
"entropy": 1.4657313966751098,
|
| 34 |
+
"epoch": 0.75,
|
| 35 |
+
"grad_norm": 0.16354545950889587,
|
| 36 |
+
"learning_rate": 7.985326923450766e-05,
|
| 37 |
+
"loss": 1.4699134826660156,
|
| 38 |
+
"mean_token_accuracy": 0.7016089594364167,
|
| 39 |
+
"num_tokens": 490539.0,
|
| 40 |
+
"step": 75
|
| 41 |
+
},
|
| 42 |
+
{
|
| 43 |
+
"entropy": 1.3245354783535004,
|
| 44 |
+
"epoch": 1.0,
|
| 45 |
+
"grad_norm": 0.279362291097641,
|
| 46 |
+
"learning_rate": 7.968617694722514e-05,
|
| 47 |
+
"loss": 1.3126080322265625,
|
| 48 |
+
"mean_token_accuracy": 0.7263329613208771,
|
| 49 |
+
"num_tokens": 656891.0,
|
| 50 |
+
"step": 100
|
| 51 |
+
},
|
| 52 |
+
{
|
| 53 |
+
"epoch": 1.0,
|
| 54 |
+
"eval_entropy": 1.1590555493648236,
|
| 55 |
+
"eval_loss": 1.1591017246246338,
|
| 56 |
+
"eval_mean_token_accuracy": 0.7455048836194552,
|
| 57 |
+
"eval_num_tokens": 656891.0,
|
| 58 |
+
"eval_runtime": 14.0274,
|
| 59 |
+
"eval_samples_per_second": 7.129,
|
| 60 |
+
"eval_steps_per_second": 0.927,
|
| 61 |
+
"step": 100
|
| 62 |
+
},
|
| 63 |
+
{
|
| 64 |
+
"entropy": 0.9851375865936279,
|
| 65 |
+
"epoch": 1.25,
|
| 66 |
+
"grad_norm": 0.405873566865921,
|
| 67 |
+
"learning_rate": 7.945664914451888e-05,
|
| 68 |
+
"loss": 0.9941243743896484,
|
| 69 |
+
"mean_token_accuracy": 0.7774216139316559,
|
| 70 |
+
"num_tokens": 820900.0,
|
| 71 |
+
"step": 125
|
| 72 |
+
},
|
| 73 |
+
{
|
| 74 |
+
"entropy": 0.9000456941127777,
|
| 75 |
+
"epoch": 1.5,
|
| 76 |
+
"grad_norm": 0.4880390465259552,
|
| 77 |
+
"learning_rate": 7.916504692659478e-05,
|
| 78 |
+
"loss": 0.8994422149658203,
|
| 79 |
+
"mean_token_accuracy": 0.8044012486934662,
|
| 80 |
+
"num_tokens": 983751.0,
|
| 81 |
+
"step": 150
|
| 82 |
+
},
|
| 83 |
+
{
|
| 84 |
+
"entropy": 0.7057543742656708,
|
| 85 |
+
"epoch": 1.75,
|
| 86 |
+
"grad_norm": 0.6458895206451416,
|
| 87 |
+
"learning_rate": 7.881182905103986e-05,
|
| 88 |
+
"loss": 0.69064697265625,
|
| 89 |
+
"mean_token_accuracy": 0.8439933550357819,
|
| 90 |
+
"num_tokens": 1148624.0,
|
| 91 |
+
"step": 175
|
| 92 |
+
},
|
| 93 |
+
{
|
| 94 |
+
"entropy": 0.5627843302488327,
|
| 95 |
+
"epoch": 2.0,
|
| 96 |
+
"grad_norm": 0.5327032804489136,
|
| 97 |
+
"learning_rate": 7.839755121109077e-05,
|
| 98 |
+
"loss": 0.5385809707641601,
|
| 99 |
+
"mean_token_accuracy": 0.8772480082511902,
|
| 100 |
+
"num_tokens": 1313782.0,
|
| 101 |
+
"step": 200
|
| 102 |
+
},
|
| 103 |
+
{
|
| 104 |
+
"epoch": 2.0,
|
| 105 |
+
"eval_entropy": 0.6312807431587806,
|
| 106 |
+
"eval_loss": 0.5925119519233704,
|
| 107 |
+
"eval_mean_token_accuracy": 0.861605126124162,
|
| 108 |
+
"eval_num_tokens": 1313782.0,
|
| 109 |
+
"eval_runtime": 14.0529,
|
| 110 |
+
"eval_samples_per_second": 7.116,
|
| 111 |
+
"eval_steps_per_second": 0.925,
|
| 112 |
+
"step": 200
|
| 113 |
+
},
|
| 114 |
+
{
|
| 115 |
+
"entropy": 0.44288761138916016,
|
| 116 |
+
"epoch": 2.25,
|
| 117 |
+
"grad_norm": 0.5565536022186279,
|
| 118 |
+
"learning_rate": 7.792286516139999e-05,
|
| 119 |
+
"loss": 0.4078195571899414,
|
| 120 |
+
"mean_token_accuracy": 0.9039785468578339,
|
| 121 |
+
"num_tokens": 1480714.0,
|
| 122 |
+
"step": 225
|
| 123 |
+
},
|
| 124 |
+
{
|
| 125 |
+
"entropy": 0.44057594060897826,
|
| 126 |
+
"epoch": 2.5,
|
| 127 |
+
"grad_norm": 1.3234641551971436,
|
| 128 |
+
"learning_rate": 7.738851769267544e-05,
|
| 129 |
+
"loss": 0.4113604736328125,
|
| 130 |
+
"mean_token_accuracy": 0.9079041624069214,
|
| 131 |
+
"num_tokens": 1645148.0,
|
| 132 |
+
"step": 250
|
| 133 |
+
},
|
| 134 |
+
{
|
| 135 |
+
"entropy": 0.34622341573238374,
|
| 136 |
+
"epoch": 2.75,
|
| 137 |
+
"grad_norm": 0.4904310405254364,
|
| 138 |
+
"learning_rate": 7.679534945680635e-05,
|
| 139 |
+
"loss": 0.316553897857666,
|
| 140 |
+
"mean_token_accuracy": 0.9257376372814179,
|
| 141 |
+
"num_tokens": 1805051.0,
|
| 142 |
+
"step": 275
|
| 143 |
+
},
|
| 144 |
+
{
|
| 145 |
+
"entropy": 0.37137761026620864,
|
| 146 |
+
"epoch": 3.0,
|
| 147 |
+
"grad_norm": 0.8455039858818054,
|
| 148 |
+
"learning_rate": 7.614429364432367e-05,
|
| 149 |
+
"loss": 0.3286180114746094,
|
| 150 |
+
"mean_token_accuracy": 0.9244595277309418,
|
| 151 |
+
"num_tokens": 1970673.0,
|
| 152 |
+
"step": 300
|
| 153 |
+
},
|
| 154 |
+
{
|
| 155 |
+
"epoch": 3.0,
|
| 156 |
+
"eval_entropy": 0.42906437584987056,
|
| 157 |
+
"eval_loss": 0.37689468264579773,
|
| 158 |
+
"eval_mean_token_accuracy": 0.9097504982581506,
|
| 159 |
+
"eval_num_tokens": 1970673.0,
|
| 160 |
+
"eval_runtime": 14.1516,
|
| 161 |
+
"eval_samples_per_second": 7.066,
|
| 162 |
+
"eval_steps_per_second": 0.919,
|
| 163 |
+
"step": 300
|
| 164 |
+
},
|
| 165 |
+
{
|
| 166 |
+
"entropy": 0.25241469651460646,
|
| 167 |
+
"epoch": 3.25,
|
| 168 |
+
"grad_norm": 0.4860665500164032,
|
| 169 |
+
"learning_rate": 7.543637451627623e-05,
|
| 170 |
+
"loss": 0.20183673858642578,
|
| 171 |
+
"mean_token_accuracy": 0.9500378572940826,
|
| 172 |
+
"num_tokens": 2133206.0,
|
| 173 |
+
"step": 325
|
| 174 |
+
},
|
| 175 |
+
{
|
| 176 |
+
"entropy": 0.2856204105913639,
|
| 177 |
+
"epoch": 3.5,
|
| 178 |
+
"grad_norm": 0.7240539193153381,
|
| 179 |
+
"learning_rate": 7.467270579283153e-05,
|
| 180 |
+
"loss": 0.2465129852294922,
|
| 181 |
+
"mean_token_accuracy": 0.944399631023407,
|
| 182 |
+
"num_tokens": 2297449.0,
|
| 183 |
+
"step": 350
|
| 184 |
+
},
|
| 185 |
+
{
|
| 186 |
+
"entropy": 0.23731404632329942,
|
| 187 |
+
"epoch": 3.75,
|
| 188 |
+
"grad_norm": 0.9069561958312988,
|
| 189 |
+
"learning_rate": 7.385448890113697e-05,
|
| 190 |
+
"loss": 0.19913478851318359,
|
| 191 |
+
"mean_token_accuracy": 0.9529761910438538,
|
| 192 |
+
"num_tokens": 2462390.0,
|
| 193 |
+
"step": 375
|
| 194 |
+
},
|
| 195 |
+
{
|
| 196 |
+
"entropy": 0.23737725749611854,
|
| 197 |
+
"epoch": 4.0,
|
| 198 |
+
"grad_norm": 0.8365960717201233,
|
| 199 |
+
"learning_rate": 7.29830110851977e-05,
|
| 200 |
+
"loss": 0.20468717575073242,
|
| 201 |
+
"mean_token_accuracy": 0.9518069052696227,
|
| 202 |
+
"num_tokens": 2627564.0,
|
| 203 |
+
"step": 400
|
| 204 |
+
},
|
| 205 |
+
{
|
| 206 |
+
"epoch": 4.0,
|
| 207 |
+
"eval_entropy": 0.3099417881323741,
|
| 208 |
+
"eval_loss": 0.28181254863739014,
|
| 209 |
+
"eval_mean_token_accuracy": 0.9319974917631882,
|
| 210 |
+
"eval_num_tokens": 2627564.0,
|
| 211 |
+
"eval_runtime": 14.0762,
|
| 212 |
+
"eval_samples_per_second": 7.104,
|
| 213 |
+
"eval_steps_per_second": 0.924,
|
| 214 |
+
"step": 400
|
| 215 |
+
},
|
| 216 |
+
{
|
| 217 |
+
"entropy": 0.17392504781484605,
|
| 218 |
+
"epoch": 4.25,
|
| 219 |
+
"grad_norm": 0.573874831199646,
|
| 220 |
+
"learning_rate": 7.205964338074462e-05,
|
| 221 |
+
"loss": 0.13600427627563477,
|
| 222 |
+
"mean_token_accuracy": 0.966830027103424,
|
| 223 |
+
"num_tokens": 2789214.0,
|
| 224 |
+
"step": 425
|
| 225 |
+
},
|
| 226 |
+
{
|
| 227 |
+
"entropy": 0.1851605027168989,
|
| 228 |
+
"epoch": 4.5,
|
| 229 |
+
"grad_norm": 0.3293238878250122,
|
| 230 |
+
"learning_rate": 7.108583845827884e-05,
|
| 231 |
+
"loss": 0.14525031089782714,
|
| 232 |
+
"mean_token_accuracy": 0.9660202085971832,
|
| 233 |
+
"num_tokens": 2955470.0,
|
| 234 |
+
"step": 450
|
| 235 |
+
},
|
| 236 |
+
{
|
| 237 |
+
"entropy": 0.18264326125383376,
|
| 238 |
+
"epoch": 4.75,
|
| 239 |
+
"grad_norm": 0.60421222448349,
|
| 240 |
+
"learning_rate": 7.00631283376857e-05,
|
| 241 |
+
"loss": 0.15052377700805664,
|
| 242 |
+
"mean_token_accuracy": 0.9645662713050842,
|
| 243 |
+
"num_tokens": 3119188.0,
|
| 244 |
+
"step": 475
|
| 245 |
+
},
|
| 246 |
+
{
|
| 247 |
+
"entropy": 0.16633620224893092,
|
| 248 |
+
"epoch": 5.0,
|
| 249 |
+
"grad_norm": 0.6254115700721741,
|
| 250 |
+
"learning_rate": 6.899312197801396e-05,
|
| 251 |
+
"loss": 0.1348426628112793,
|
| 252 |
+
"mean_token_accuracy": 0.9678875958919525,
|
| 253 |
+
"num_tokens": 3284455.0,
|
| 254 |
+
"step": 500
|
| 255 |
+
},
|
| 256 |
+
{
|
| 257 |
+
"epoch": 5.0,
|
| 258 |
+
"eval_entropy": 0.24236516998364374,
|
| 259 |
+
"eval_loss": 0.22977329790592194,
|
| 260 |
+
"eval_mean_token_accuracy": 0.9444432900502131,
|
| 261 |
+
"eval_num_tokens": 3284455.0,
|
| 262 |
+
"eval_runtime": 13.8939,
|
| 263 |
+
"eval_samples_per_second": 7.197,
|
| 264 |
+
"eval_steps_per_second": 0.936,
|
| 265 |
+
"step": 500
|
| 266 |
+
},
|
| 267 |
+
{
|
| 268 |
+
"entropy": 0.13423476360738276,
|
| 269 |
+
"epoch": 5.25,
|
| 270 |
+
"grad_norm": 0.36637410521507263,
|
| 271 |
+
"learning_rate": 6.787750274621175e-05,
|
| 272 |
+
"loss": 0.10897525787353515,
|
| 273 |
+
"mean_token_accuracy": 0.9732790982723236,
|
| 274 |
+
"num_tokens": 3449400.0,
|
| 275 |
+
"step": 525
|
| 276 |
+
},
|
| 277 |
+
{
|
| 278 |
+
"entropy": 0.1397440279275179,
|
| 279 |
+
"epoch": 5.5,
|
| 280 |
+
"grad_norm": 0.5530615448951721,
|
| 281 |
+
"learning_rate": 6.671802576880203e-05,
|
| 282 |
+
"loss": 0.0994186782836914,
|
| 283 |
+
"mean_token_accuracy": 0.9755746126174927,
|
| 284 |
+
"num_tokens": 3613152.0,
|
| 285 |
+
"step": 550
|
| 286 |
+
},
|
| 287 |
+
{
|
| 288 |
+
"entropy": 0.12368398927152156,
|
| 289 |
+
"epoch": 5.75,
|
| 290 |
+
"grad_norm": 0.9692996144294739,
|
| 291 |
+
"learning_rate": 6.551651517066341e-05,
|
| 292 |
+
"loss": 0.09958896636962891,
|
| 293 |
+
"mean_token_accuracy": 0.9760513007640839,
|
| 294 |
+
"num_tokens": 3776158.0,
|
| 295 |
+
"step": 575
|
| 296 |
+
},
|
| 297 |
+
{
|
| 298 |
+
"entropy": 0.12079268284142017,
|
| 299 |
+
"epoch": 6.0,
|
| 300 |
+
"grad_norm": 0.4396827220916748,
|
| 301 |
+
"learning_rate": 6.427486120526078e-05,
|
| 302 |
+
"loss": 0.09088186264038085,
|
| 303 |
+
"mean_token_accuracy": 0.9771714675426483,
|
| 304 |
+
"num_tokens": 3941346.0,
|
| 305 |
+
"step": 600
|
| 306 |
+
},
|
| 307 |
+
{
|
| 308 |
+
"epoch": 6.0,
|
| 309 |
+
"eval_entropy": 0.1859686219921479,
|
| 310 |
+
"eval_loss": 0.19470009207725525,
|
| 311 |
+
"eval_mean_token_accuracy": 0.9534304233697745,
|
| 312 |
+
"eval_num_tokens": 3941346.0,
|
| 313 |
+
"eval_runtime": 13.9237,
|
| 314 |
+
"eval_samples_per_second": 7.182,
|
| 315 |
+
"eval_steps_per_second": 0.934,
|
| 316 |
+
"step": 600
|
| 317 |
+
},
|
| 318 |
+
{
|
| 319 |
+
"entropy": 0.12473276376724243,
|
| 320 |
+
"epoch": 6.25,
|
| 321 |
+
"grad_norm": 0.34290623664855957,
|
| 322 |
+
"learning_rate": 6.29950172808403e-05,
|
| 323 |
+
"loss": 0.09884612083435058,
|
| 324 |
+
"mean_token_accuracy": 0.9764992237091065,
|
| 325 |
+
"num_tokens": 4106933.0,
|
| 326 |
+
"step": 625
|
| 327 |
+
},
|
| 328 |
+
{
|
| 329 |
+
"entropy": 0.085831069201231,
|
| 330 |
+
"epoch": 6.5,
|
| 331 |
+
"grad_norm": 0.48308831453323364,
|
| 332 |
+
"learning_rate": 6.167899688726744e-05,
|
| 333 |
+
"loss": 0.056854352951049805,
|
| 334 |
+
"mean_token_accuracy": 0.9848374032974243,
|
| 335 |
+
"num_tokens": 4268402.0,
|
| 336 |
+
"step": 650
|
| 337 |
+
},
|
| 338 |
+
{
|
| 339 |
+
"entropy": 0.10950819738209247,
|
| 340 |
+
"epoch": 6.75,
|
| 341 |
+
"grad_norm": 0.856299102306366,
|
| 342 |
+
"learning_rate": 6.032887042834277e-05,
|
| 343 |
+
"loss": 0.08215932846069336,
|
| 344 |
+
"mean_token_accuracy": 0.9799254047870636,
|
| 345 |
+
"num_tokens": 4434736.0,
|
| 346 |
+
"step": 675
|
| 347 |
+
},
|
| 348 |
+
{
|
| 349 |
+
"entropy": 0.07904369287192821,
|
| 350 |
+
"epoch": 7.0,
|
| 351 |
+
"grad_norm": 0.36722248792648315,
|
| 352 |
+
"learning_rate": 5.89467619645789e-05,
|
| 353 |
+
"loss": 0.05222135543823242,
|
| 354 |
+
"mean_token_accuracy": 0.9856019783020019,
|
| 355 |
+
"num_tokens": 4598237.0,
|
| 356 |
+
"step": 700
|
| 357 |
+
},
|
| 358 |
+
{
|
| 359 |
+
"epoch": 7.0,
|
| 360 |
+
"eval_entropy": 0.1536997648385855,
|
| 361 |
+
"eval_loss": 0.17453822493553162,
|
| 362 |
+
"eval_mean_token_accuracy": 0.958331937973316,
|
| 363 |
+
"eval_num_tokens": 4598237.0,
|
| 364 |
+
"eval_runtime": 14.081,
|
| 365 |
+
"eval_samples_per_second": 7.102,
|
| 366 |
+
"eval_steps_per_second": 0.923,
|
| 367 |
+
"step": 700
|
| 368 |
+
},
|
| 369 |
+
{
|
| 370 |
+
"entropy": 0.09128913629800081,
|
| 371 |
+
"epoch": 7.25,
|
| 372 |
+
"grad_norm": 0.3062293827533722,
|
| 373 |
+
"learning_rate": 5.75348458715631e-05,
|
| 374 |
+
"loss": 0.0616093111038208,
|
| 375 |
+
"mean_token_accuracy": 0.9844523584842682,
|
| 376 |
+
"num_tokens": 4762560.0,
|
| 377 |
+
"step": 725
|
| 378 |
+
},
|
| 379 |
+
{
|
| 380 |
+
"entropy": 0.08702207624912262,
|
| 381 |
+
"epoch": 7.5,
|
| 382 |
+
"grad_norm": 0.4253726601600647,
|
| 383 |
+
"learning_rate": 5.609534341916273e-05,
|
| 384 |
+
"loss": 0.05795002460479736,
|
| 385 |
+
"mean_token_accuracy": 0.9847881937026978,
|
| 386 |
+
"num_tokens": 4927776.0,
|
| 387 |
+
"step": 750
|
| 388 |
+
},
|
| 389 |
+
{
|
| 390 |
+
"entropy": 0.05697074756026268,
|
| 391 |
+
"epoch": 7.75,
|
| 392 |
+
"grad_norm": 0.2904660403728485,
|
| 393 |
+
"learning_rate": 5.463051927695499e-05,
|
| 394 |
+
"loss": 0.03703362941741943,
|
| 395 |
+
"mean_token_accuracy": 0.9894271802902221,
|
| 396 |
+
"num_tokens": 5093008.0,
|
| 397 |
+
"step": 775
|
| 398 |
+
},
|
| 399 |
+
{
|
| 400 |
+
"entropy": 0.07158231139183044,
|
| 401 |
+
"epoch": 8.0,
|
| 402 |
+
"grad_norm": 0.18282686173915863,
|
| 403 |
+
"learning_rate": 5.3142677951379145e-05,
|
| 404 |
+
"loss": 0.044614124298095706,
|
| 405 |
+
"mean_token_accuracy": 0.9877620100975036,
|
| 406 |
+
"num_tokens": 5255128.0,
|
| 407 |
+
"step": 800
|
| 408 |
+
},
|
| 409 |
+
{
|
| 410 |
+
"epoch": 8.0,
|
| 411 |
+
"eval_entropy": 0.12773674267988938,
|
| 412 |
+
"eval_loss": 0.15855245292186737,
|
| 413 |
+
"eval_mean_token_accuracy": 0.9648683758882376,
|
| 414 |
+
"eval_num_tokens": 5255128.0,
|
| 415 |
+
"eval_runtime": 14.1317,
|
| 416 |
+
"eval_samples_per_second": 7.076,
|
| 417 |
+
"eval_steps_per_second": 0.92,
|
| 418 |
+
"step": 800
|
| 419 |
+
},
|
| 420 |
+
{
|
| 421 |
+
"entropy": 0.05208727709949017,
|
| 422 |
+
"epoch": 8.25,
|
| 423 |
+
"grad_norm": 0.341634601354599,
|
| 424 |
+
"learning_rate": 5.163416016021597e-05,
|
| 425 |
+
"loss": 0.029668822288513183,
|
| 426 |
+
"mean_token_accuracy": 0.9911812472343445,
|
| 427 |
+
"num_tokens": 5418711.0,
|
| 428 |
+
"step": 825
|
| 429 |
+
},
|
| 430 |
+
{
|
| 431 |
+
"entropy": 0.06870386745780706,
|
| 432 |
+
"epoch": 8.5,
|
| 433 |
+
"grad_norm": 0.4494340121746063,
|
| 434 |
+
"learning_rate": 5.010733915009846e-05,
|
| 435 |
+
"loss": 0.041636109352111816,
|
| 436 |
+
"mean_token_accuracy": 0.9882563936710358,
|
| 437 |
+
"num_tokens": 5585074.0,
|
| 438 |
+
"step": 850
|
| 439 |
+
},
|
| 440 |
+
{
|
| 441 |
+
"entropy": 0.061128609701991084,
|
| 442 |
+
"epoch": 8.75,
|
| 443 |
+
"grad_norm": 0.27014654874801636,
|
| 444 |
+
"learning_rate": 4.856461696284722e-05,
|
| 445 |
+
"loss": 0.0354688835144043,
|
| 446 |
+
"mean_token_accuracy": 0.9902391600608825,
|
| 447 |
+
"num_tokens": 5748942.0,
|
| 448 |
+
"step": 875
|
| 449 |
+
},
|
| 450 |
+
{
|
| 451 |
+
"entropy": 0.06322231356054545,
|
| 452 |
+
"epoch": 9.0,
|
| 453 |
+
"grad_norm": 0.44081079959869385,
|
| 454 |
+
"learning_rate": 4.7008420656504246e-05,
|
| 455 |
+
"loss": 0.04019084930419922,
|
| 456 |
+
"mean_token_accuracy": 0.9888646650314331,
|
| 457 |
+
"num_tokens": 5912019.0,
|
| 458 |
+
"step": 900
|
| 459 |
+
},
|
| 460 |
+
{
|
| 461 |
+
"epoch": 9.0,
|
| 462 |
+
"eval_entropy": 0.10374558430451614,
|
| 463 |
+
"eval_loss": 0.13253994286060333,
|
| 464 |
+
"eval_mean_token_accuracy": 0.9727041262846726,
|
| 465 |
+
"eval_num_tokens": 5912019.0,
|
| 466 |
+
"eval_runtime": 13.9522,
|
| 467 |
+
"eval_samples_per_second": 7.167,
|
| 468 |
+
"eval_steps_per_second": 0.932,
|
| 469 |
+
"step": 900
|
| 470 |
+
},
|
| 471 |
+
{
|
| 472 |
+
"entropy": 0.05023659970611334,
|
| 473 |
+
"epoch": 9.25,
|
| 474 |
+
"grad_norm": 0.17937935888767242,
|
| 475 |
+
"learning_rate": 4.5441198487010574e-05,
|
| 476 |
+
"loss": 0.027511301040649413,
|
| 477 |
+
"mean_token_accuracy": 0.9921806335449219,
|
| 478 |
+
"num_tokens": 6076215.0,
|
| 479 |
+
"step": 925
|
| 480 |
+
},
|
| 481 |
+
{
|
| 482 |
+
"entropy": 0.04388071570545435,
|
| 483 |
+
"epoch": 9.5,
|
| 484 |
+
"grad_norm": 0.19000087678432465,
|
| 485 |
+
"learning_rate": 4.386541605653456e-05,
|
| 486 |
+
"loss": 0.023981449604034425,
|
| 487 |
+
"mean_token_accuracy": 0.9924513137340546,
|
| 488 |
+
"num_tokens": 6239231.0,
|
| 489 |
+
"step": 950
|
| 490 |
+
},
|
| 491 |
+
{
|
| 492 |
+
"entropy": 0.047076338082551954,
|
| 493 |
+
"epoch": 9.75,
|
| 494 |
+
"grad_norm": 0.32662832736968994,
|
| 495 |
+
"learning_rate": 4.228355243451073e-05,
|
| 496 |
+
"loss": 0.029095897674560545,
|
| 497 |
+
"mean_token_accuracy": 0.9913454830646515,
|
| 498 |
+
"num_tokens": 6404599.0,
|
| 499 |
+
"step": 975
|
| 500 |
+
},
|
| 501 |
+
{
|
| 502 |
+
"entropy": 0.043895740248262885,
|
| 503 |
+
"epoch": 10.0,
|
| 504 |
+
"grad_norm": 0.2123415768146515,
|
| 505 |
+
"learning_rate": 4.0698096257491346e-05,
|
| 506 |
+
"loss": 0.025430002212524415,
|
| 507 |
+
"mean_token_accuracy": 0.9922421896457672,
|
| 508 |
+
"num_tokens": 6568910.0,
|
| 509 |
+
"step": 1000
|
| 510 |
+
},
|
| 511 |
+
{
|
| 512 |
+
"epoch": 10.0,
|
| 513 |
+
"eval_entropy": 0.09539230053241436,
|
| 514 |
+
"eval_loss": 0.13063254952430725,
|
| 515 |
+
"eval_mean_token_accuracy": 0.972310735629155,
|
| 516 |
+
"eval_num_tokens": 6568910.0,
|
| 517 |
+
"eval_runtime": 13.8054,
|
| 518 |
+
"eval_samples_per_second": 7.244,
|
| 519 |
+
"eval_steps_per_second": 0.942,
|
| 520 |
+
"step": 1000
|
| 521 |
+
},
|
| 522 |
+
{
|
| 523 |
+
"entropy": 0.04173858344554901,
|
| 524 |
+
"epoch": 10.25,
|
| 525 |
+
"grad_norm": 0.2956293225288391,
|
| 526 |
+
"learning_rate": 3.911154181394682e-05,
|
| 527 |
+
"loss": 0.024647462368011474,
|
| 528 |
+
"mean_token_accuracy": 0.9924235260486602,
|
| 529 |
+
"num_tokens": 6732529.0,
|
| 530 |
+
"step": 1025
|
| 531 |
+
},
|
| 532 |
+
{
|
| 533 |
+
"entropy": 0.035978049002587796,
|
| 534 |
+
"epoch": 10.5,
|
| 535 |
+
"grad_norm": 0.19044138491153717,
|
| 536 |
+
"learning_rate": 3.7526385120174296e-05,
|
| 537 |
+
"loss": 0.019353010654449464,
|
| 538 |
+
"mean_token_accuracy": 0.9938314747810364,
|
| 539 |
+
"num_tokens": 6897778.0,
|
| 540 |
+
"step": 1050
|
| 541 |
+
},
|
| 542 |
+
{
|
| 543 |
+
"entropy": 0.03361038889735937,
|
| 544 |
+
"epoch": 10.75,
|
| 545 |
+
"grad_norm": 0.1833367496728897,
|
| 546 |
+
"learning_rate": 3.594511999348787e-05,
|
| 547 |
+
"loss": 0.018770104646682738,
|
| 548 |
+
"mean_token_accuracy": 0.9937199604511261,
|
| 549 |
+
"num_tokens": 7061058.0,
|
| 550 |
+
"step": 1075
|
| 551 |
+
},
|
| 552 |
+
{
|
| 553 |
+
"entropy": 0.0329610108025372,
|
| 554 |
+
"epoch": 11.0,
|
| 555 |
+
"grad_norm": 0.1862572729587555,
|
| 556 |
+
"learning_rate": 3.437023412886855e-05,
|
| 557 |
+
"loss": 0.01859575867652893,
|
| 558 |
+
"mean_token_accuracy": 0.9939316511154175,
|
| 559 |
+
"num_tokens": 7225801.0,
|
| 560 |
+
"step": 1100
|
| 561 |
+
},
|
| 562 |
+
{
|
| 563 |
+
"epoch": 11.0,
|
| 564 |
+
"eval_entropy": 0.07002563731601605,
|
| 565 |
+
"eval_loss": 0.11906072497367859,
|
| 566 |
+
"eval_mean_token_accuracy": 0.9767012825379005,
|
| 567 |
+
"eval_num_tokens": 7225801.0,
|
| 568 |
+
"eval_runtime": 13.7737,
|
| 569 |
+
"eval_samples_per_second": 7.26,
|
| 570 |
+
"eval_steps_per_second": 0.944,
|
| 571 |
+
"step": 1100
|
| 572 |
+
}
|
| 573 |
+
],
|
| 574 |
+
"logging_steps": 25,
|
| 575 |
+
"max_steps": 2000,
|
| 576 |
+
"num_input_tokens_seen": 0,
|
| 577 |
+
"num_train_epochs": 20,
|
| 578 |
+
"save_steps": 500,
|
| 579 |
+
"stateful_callbacks": {
|
| 580 |
+
"EarlyStoppingCallback": {
|
| 581 |
+
"args": {
|
| 582 |
+
"early_stopping_patience": 2,
|
| 583 |
+
"early_stopping_threshold": 0.0
|
| 584 |
+
},
|
| 585 |
+
"attributes": {
|
| 586 |
+
"early_stopping_patience_counter": 0
|
| 587 |
+
}
|
| 588 |
+
},
|
| 589 |
+
"TrainerControl": {
|
| 590 |
+
"args": {
|
| 591 |
+
"should_epoch_stop": false,
|
| 592 |
+
"should_evaluate": false,
|
| 593 |
+
"should_log": false,
|
| 594 |
+
"should_save": true,
|
| 595 |
+
"should_training_stop": false
|
| 596 |
+
},
|
| 597 |
+
"attributes": {}
|
| 598 |
+
}
|
| 599 |
+
},
|
| 600 |
+
"total_flos": 3.795035557618483e+17,
|
| 601 |
+
"train_batch_size": 4,
|
| 602 |
+
"trial_name": null,
|
| 603 |
+
"trial_params": null
|
| 604 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100/training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:20789f4b467b5ff6e691e6d696bc04daebaaa023439a4918e4927d5d9b849aee
|
| 3 |
+
size 5304
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/README.md
ADDED
|
@@ -0,0 +1,209 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
base_model: /scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 3 |
+
library_name: peft
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
+
tags:
|
| 6 |
+
- base_model:adapter:/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B
|
| 7 |
+
- lora
|
| 8 |
+
- sft
|
| 9 |
+
- transformers
|
| 10 |
+
- trl
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
# Model Card for Model ID
|
| 14 |
+
|
| 15 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 16 |
+
|
| 17 |
+
|
| 18 |
+
|
| 19 |
+
## Model Details
|
| 20 |
+
|
| 21 |
+
### Model Description
|
| 22 |
+
|
| 23 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 24 |
+
|
| 25 |
+
|
| 26 |
+
|
| 27 |
+
- **Developed by:** [More Information Needed]
|
| 28 |
+
- **Funded by [optional]:** [More Information Needed]
|
| 29 |
+
- **Shared by [optional]:** [More Information Needed]
|
| 30 |
+
- **Model type:** [More Information Needed]
|
| 31 |
+
- **Language(s) (NLP):** [More Information Needed]
|
| 32 |
+
- **License:** [More Information Needed]
|
| 33 |
+
- **Finetuned from model [optional]:** [More Information Needed]
|
| 34 |
+
|
| 35 |
+
### Model Sources [optional]
|
| 36 |
+
|
| 37 |
+
<!-- Provide the basic links for the model. -->
|
| 38 |
+
|
| 39 |
+
- **Repository:** [More Information Needed]
|
| 40 |
+
- **Paper [optional]:** [More Information Needed]
|
| 41 |
+
- **Demo [optional]:** [More Information Needed]
|
| 42 |
+
|
| 43 |
+
## Uses
|
| 44 |
+
|
| 45 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 46 |
+
|
| 47 |
+
### Direct Use
|
| 48 |
+
|
| 49 |
+
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 50 |
+
|
| 51 |
+
[More Information Needed]
|
| 52 |
+
|
| 53 |
+
### Downstream Use [optional]
|
| 54 |
+
|
| 55 |
+
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
| 56 |
+
|
| 57 |
+
[More Information Needed]
|
| 58 |
+
|
| 59 |
+
### Out-of-Scope Use
|
| 60 |
+
|
| 61 |
+
<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
|
| 62 |
+
|
| 63 |
+
[More Information Needed]
|
| 64 |
+
|
| 65 |
+
## Bias, Risks, and Limitations
|
| 66 |
+
|
| 67 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
| 68 |
+
|
| 69 |
+
[More Information Needed]
|
| 70 |
+
|
| 71 |
+
### Recommendations
|
| 72 |
+
|
| 73 |
+
<!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
|
| 74 |
+
|
| 75 |
+
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
|
| 76 |
+
|
| 77 |
+
## How to Get Started with the Model
|
| 78 |
+
|
| 79 |
+
Use the code below to get started with the model.
|
| 80 |
+
|
| 81 |
+
[More Information Needed]
|
| 82 |
+
|
| 83 |
+
## Training Details
|
| 84 |
+
|
| 85 |
+
### Training Data
|
| 86 |
+
|
| 87 |
+
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
| 88 |
+
|
| 89 |
+
[More Information Needed]
|
| 90 |
+
|
| 91 |
+
### Training Procedure
|
| 92 |
+
|
| 93 |
+
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 94 |
+
|
| 95 |
+
#### Preprocessing [optional]
|
| 96 |
+
|
| 97 |
+
[More Information Needed]
|
| 98 |
+
|
| 99 |
+
|
| 100 |
+
#### Training Hyperparameters
|
| 101 |
+
|
| 102 |
+
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 103 |
+
|
| 104 |
+
#### Speeds, Sizes, Times [optional]
|
| 105 |
+
|
| 106 |
+
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
| 107 |
+
|
| 108 |
+
[More Information Needed]
|
| 109 |
+
|
| 110 |
+
## Evaluation
|
| 111 |
+
|
| 112 |
+
<!-- This section describes the evaluation protocols and provides the results. -->
|
| 113 |
+
|
| 114 |
+
### Testing Data, Factors & Metrics
|
| 115 |
+
|
| 116 |
+
#### Testing Data
|
| 117 |
+
|
| 118 |
+
<!-- This should link to a Dataset Card if possible. -->
|
| 119 |
+
|
| 120 |
+
[More Information Needed]
|
| 121 |
+
|
| 122 |
+
#### Factors
|
| 123 |
+
|
| 124 |
+
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
| 125 |
+
|
| 126 |
+
[More Information Needed]
|
| 127 |
+
|
| 128 |
+
#### Metrics
|
| 129 |
+
|
| 130 |
+
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
| 131 |
+
|
| 132 |
+
[More Information Needed]
|
| 133 |
+
|
| 134 |
+
### Results
|
| 135 |
+
|
| 136 |
+
[More Information Needed]
|
| 137 |
+
|
| 138 |
+
#### Summary
|
| 139 |
+
|
| 140 |
+
|
| 141 |
+
|
| 142 |
+
## Model Examination [optional]
|
| 143 |
+
|
| 144 |
+
<!-- Relevant interpretability work for the model goes here -->
|
| 145 |
+
|
| 146 |
+
[More Information Needed]
|
| 147 |
+
|
| 148 |
+
## Environmental Impact
|
| 149 |
+
|
| 150 |
+
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
| 151 |
+
|
| 152 |
+
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
| 153 |
+
|
| 154 |
+
- **Hardware Type:** [More Information Needed]
|
| 155 |
+
- **Hours used:** [More Information Needed]
|
| 156 |
+
- **Cloud Provider:** [More Information Needed]
|
| 157 |
+
- **Compute Region:** [More Information Needed]
|
| 158 |
+
- **Carbon Emitted:** [More Information Needed]
|
| 159 |
+
|
| 160 |
+
## Technical Specifications [optional]
|
| 161 |
+
|
| 162 |
+
### Model Architecture and Objective
|
| 163 |
+
|
| 164 |
+
[More Information Needed]
|
| 165 |
+
|
| 166 |
+
### Compute Infrastructure
|
| 167 |
+
|
| 168 |
+
[More Information Needed]
|
| 169 |
+
|
| 170 |
+
#### Hardware
|
| 171 |
+
|
| 172 |
+
[More Information Needed]
|
| 173 |
+
|
| 174 |
+
#### Software
|
| 175 |
+
|
| 176 |
+
[More Information Needed]
|
| 177 |
+
|
| 178 |
+
## Citation [optional]
|
| 179 |
+
|
| 180 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
| 181 |
+
|
| 182 |
+
**BibTeX:**
|
| 183 |
+
|
| 184 |
+
[More Information Needed]
|
| 185 |
+
|
| 186 |
+
**APA:**
|
| 187 |
+
|
| 188 |
+
[More Information Needed]
|
| 189 |
+
|
| 190 |
+
## Glossary [optional]
|
| 191 |
+
|
| 192 |
+
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
| 193 |
+
|
| 194 |
+
[More Information Needed]
|
| 195 |
+
|
| 196 |
+
## More Information [optional]
|
| 197 |
+
|
| 198 |
+
[More Information Needed]
|
| 199 |
+
|
| 200 |
+
## Model Card Authors [optional]
|
| 201 |
+
|
| 202 |
+
[More Information Needed]
|
| 203 |
+
|
| 204 |
+
## Model Card Contact
|
| 205 |
+
|
| 206 |
+
[More Information Needed]
|
| 207 |
+
### Framework versions
|
| 208 |
+
|
| 209 |
+
- PEFT 0.18.1
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/adapter_config.json
ADDED
|
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"alora_invocation_tokens": null,
|
| 3 |
+
"alpha_pattern": {},
|
| 4 |
+
"arrow_config": null,
|
| 5 |
+
"auto_mapping": null,
|
| 6 |
+
"base_model_name_or_path": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/models/Qwen3-8B",
|
| 7 |
+
"bias": "none",
|
| 8 |
+
"corda_config": null,
|
| 9 |
+
"ensure_weight_tying": false,
|
| 10 |
+
"eva_config": null,
|
| 11 |
+
"exclude_modules": null,
|
| 12 |
+
"fan_in_fan_out": false,
|
| 13 |
+
"inference_mode": true,
|
| 14 |
+
"init_lora_weights": true,
|
| 15 |
+
"layer_replication": null,
|
| 16 |
+
"layers_pattern": null,
|
| 17 |
+
"layers_to_transform": null,
|
| 18 |
+
"loftq_config": {},
|
| 19 |
+
"lora_alpha": 16,
|
| 20 |
+
"lora_bias": false,
|
| 21 |
+
"lora_dropout": 0.05,
|
| 22 |
+
"megatron_config": null,
|
| 23 |
+
"megatron_core": "megatron.core",
|
| 24 |
+
"modules_to_save": null,
|
| 25 |
+
"peft_type": "LORA",
|
| 26 |
+
"peft_version": "0.18.1",
|
| 27 |
+
"qalora_group_size": 16,
|
| 28 |
+
"r": 8,
|
| 29 |
+
"rank_pattern": {},
|
| 30 |
+
"revision": null,
|
| 31 |
+
"target_modules": [
|
| 32 |
+
"gate_proj",
|
| 33 |
+
"v_proj",
|
| 34 |
+
"q_proj",
|
| 35 |
+
"k_proj",
|
| 36 |
+
"up_proj",
|
| 37 |
+
"down_proj",
|
| 38 |
+
"o_proj"
|
| 39 |
+
],
|
| 40 |
+
"target_parameters": null,
|
| 41 |
+
"task_type": "CAUSAL_LM",
|
| 42 |
+
"trainable_token_indices": null,
|
| 43 |
+
"use_dora": false,
|
| 44 |
+
"use_qalora": false,
|
| 45 |
+
"use_rslora": false
|
| 46 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/adapter_model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d68d0d9a654d1eb05183823ecd5454f8992f9e496fa6e5ecde9ffb8804eb2036
|
| 3 |
+
size 87360584
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/chat_template.jinja
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{%- if tools %}
|
| 2 |
+
{{- '<|im_start|>system\n' }}
|
| 3 |
+
{%- if messages[0].role == 'system' %}
|
| 4 |
+
{{- messages[0].content + '\n\n' }}
|
| 5 |
+
{%- endif %}
|
| 6 |
+
{{- "# Tools\n\nYou may call one or more functions to assist with the user query.\n\nYou are provided with function signatures within <tools></tools> XML tags:\n<tools>" }}
|
| 7 |
+
{%- for tool in tools %}
|
| 8 |
+
{{- "\n" }}
|
| 9 |
+
{{- tool | tojson }}
|
| 10 |
+
{%- endfor %}
|
| 11 |
+
{{- "\n</tools>\n\nFor each function call, return a json object with function name and arguments within <tool_call></tool_call> XML tags:\n<tool_call>\n{\"name\": <function-name>, \"arguments\": <args-json-object>}\n</tool_call><|im_end|>\n" }}
|
| 12 |
+
{%- else %}
|
| 13 |
+
{%- if messages[0].role == 'system' %}
|
| 14 |
+
{{- '<|im_start|>system\n' + messages[0].content + '<|im_end|>\n' }}
|
| 15 |
+
{%- endif %}
|
| 16 |
+
{%- endif %}
|
| 17 |
+
{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
|
| 18 |
+
{%- for message in messages[::-1] %}
|
| 19 |
+
{%- set index = (messages|length - 1) - loop.index0 %}
|
| 20 |
+
{%- if ns.multi_step_tool and message.role == "user" and message.content is string and not(message.content.startswith('<tool_response>') and message.content.endswith('</tool_response>')) %}
|
| 21 |
+
{%- set ns.multi_step_tool = false %}
|
| 22 |
+
{%- set ns.last_query_index = index %}
|
| 23 |
+
{%- endif %}
|
| 24 |
+
{%- endfor %}
|
| 25 |
+
{%- for message in messages %}
|
| 26 |
+
{%- if message.content is string %}
|
| 27 |
+
{%- set content = message.content %}
|
| 28 |
+
{%- else %}
|
| 29 |
+
{%- set content = '' %}
|
| 30 |
+
{%- endif %}
|
| 31 |
+
{%- if (message.role == "user") or (message.role == "system" and not loop.first) %}
|
| 32 |
+
{{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
|
| 33 |
+
{%- elif message.role == "assistant" %}
|
| 34 |
+
{%- set reasoning_content = '' %}
|
| 35 |
+
{%- if message.reasoning_content is string %}
|
| 36 |
+
{%- set reasoning_content = message.reasoning_content %}
|
| 37 |
+
{%- else %}
|
| 38 |
+
{%- if '</think>' in content %}
|
| 39 |
+
{%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
|
| 40 |
+
{%- set content = content.split('</think>')[-1].lstrip('\n') %}
|
| 41 |
+
{%- endif %}
|
| 42 |
+
{%- endif %}
|
| 43 |
+
{%- if loop.index0 > ns.last_query_index %}
|
| 44 |
+
{%- if loop.last or (not loop.last and reasoning_content) %}
|
| 45 |
+
{{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content.strip('\n') + '\n</think>\n\n' + content.lstrip('\n') }}
|
| 46 |
+
{%- else %}
|
| 47 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 48 |
+
{%- endif %}
|
| 49 |
+
{%- else %}
|
| 50 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 51 |
+
{%- endif %}
|
| 52 |
+
{%- if message.tool_calls %}
|
| 53 |
+
{%- for tool_call in message.tool_calls %}
|
| 54 |
+
{%- if (loop.first and content) or (not loop.first) %}
|
| 55 |
+
{{- '\n' }}
|
| 56 |
+
{%- endif %}
|
| 57 |
+
{%- if tool_call.function %}
|
| 58 |
+
{%- set tool_call = tool_call.function %}
|
| 59 |
+
{%- endif %}
|
| 60 |
+
{{- '<tool_call>\n{"name": "' }}
|
| 61 |
+
{{- tool_call.name }}
|
| 62 |
+
{{- '", "arguments": ' }}
|
| 63 |
+
{%- if tool_call.arguments is string %}
|
| 64 |
+
{{- tool_call.arguments }}
|
| 65 |
+
{%- else %}
|
| 66 |
+
{{- tool_call.arguments | tojson }}
|
| 67 |
+
{%- endif %}
|
| 68 |
+
{{- '}\n</tool_call>' }}
|
| 69 |
+
{%- endfor %}
|
| 70 |
+
{%- endif %}
|
| 71 |
+
{{- '<|im_end|>\n' }}
|
| 72 |
+
{%- elif message.role == "tool" %}
|
| 73 |
+
{%- if loop.first or (messages[loop.index0 - 1].role != "tool") %}
|
| 74 |
+
{{- '<|im_start|>user' }}
|
| 75 |
+
{%- endif %}
|
| 76 |
+
{{- '\n<tool_response>\n' }}
|
| 77 |
+
{{- content }}
|
| 78 |
+
{{- '\n</tool_response>' }}
|
| 79 |
+
{%- if loop.last or (messages[loop.index0 + 1].role != "tool") %}
|
| 80 |
+
{{- '<|im_end|>\n' }}
|
| 81 |
+
{%- endif %}
|
| 82 |
+
{%- endif %}
|
| 83 |
+
{%- endfor %}
|
| 84 |
+
{%- if add_generation_prompt %}
|
| 85 |
+
{{- '<|im_start|>assistant\n' }}
|
| 86 |
+
{%- if enable_thinking is defined and enable_thinking is false %}
|
| 87 |
+
{{- '<think>\n\n</think>\n\n' }}
|
| 88 |
+
{%- endif %}
|
| 89 |
+
{%- endif %}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/optimizer.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a2f20969db1d5d958e322a2117c0eb3586841fe48295a3ae9117feada1454001
|
| 3 |
+
size 175019258
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/rng_state.pth
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:77b91d1b8dc7aac8a06ffacb1744b237d3ef30130e28567e0f0f6be4f5ee1e4d
|
| 3 |
+
size 14244
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/scheduler.pt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ee1e0f2b5d2ce8cedcbbda81df0279f5aa839f1a9da3b1113c83ef7fa3c6d03f
|
| 3 |
+
size 1064
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:be75606093db2094d7cd20f3c2f385c212750648bd6ea4fb2bf507a6a4c55506
|
| 3 |
+
size 11422650
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/tokenizer_config.json
ADDED
|
@@ -0,0 +1,29 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": false,
|
| 3 |
+
"backend": "tokenizers",
|
| 4 |
+
"bos_token": null,
|
| 5 |
+
"clean_up_tokenization_spaces": false,
|
| 6 |
+
"eos_token": "<|im_end|>",
|
| 7 |
+
"errors": "replace",
|
| 8 |
+
"extra_special_tokens": [
|
| 9 |
+
"<|im_start|>",
|
| 10 |
+
"<|im_end|>",
|
| 11 |
+
"<|object_ref_start|>",
|
| 12 |
+
"<|object_ref_end|>",
|
| 13 |
+
"<|box_start|>",
|
| 14 |
+
"<|box_end|>",
|
| 15 |
+
"<|quad_start|>",
|
| 16 |
+
"<|quad_end|>",
|
| 17 |
+
"<|vision_start|>",
|
| 18 |
+
"<|vision_end|>",
|
| 19 |
+
"<|vision_pad|>",
|
| 20 |
+
"<|image_pad|>",
|
| 21 |
+
"<|video_pad|>"
|
| 22 |
+
],
|
| 23 |
+
"is_local": true,
|
| 24 |
+
"model_max_length": 131072,
|
| 25 |
+
"pad_token": "<|endoftext|>",
|
| 26 |
+
"split_special_tokens": false,
|
| 27 |
+
"tokenizer_class": "Qwen2Tokenizer",
|
| 28 |
+
"unk_token": null
|
| 29 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/trainer_state.json
ADDED
|
@@ -0,0 +1,706 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"best_global_step": 1100,
|
| 3 |
+
"best_metric": 0.11906072497367859,
|
| 4 |
+
"best_model_checkpoint": "/scr/user/kevin2002/TensorCat/NLP/UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1100",
|
| 5 |
+
"epoch": 13.0,
|
| 6 |
+
"eval_steps": 500,
|
| 7 |
+
"global_step": 1300,
|
| 8 |
+
"is_hyper_param_search": false,
|
| 9 |
+
"is_local_process_zero": true,
|
| 10 |
+
"is_world_process_zero": true,
|
| 11 |
+
"log_history": [
|
| 12 |
+
{
|
| 13 |
+
"entropy": 1.4356346559524535,
|
| 14 |
+
"epoch": 0.25,
|
| 15 |
+
"grad_norm": 0.24204915761947632,
|
| 16 |
+
"learning_rate": 7.999919440291627e-05,
|
| 17 |
+
"loss": 2.3678880310058594,
|
| 18 |
+
"mean_token_accuracy": 0.5813769394159317,
|
| 19 |
+
"num_tokens": 160725.0,
|
| 20 |
+
"step": 25
|
| 21 |
+
},
|
| 22 |
+
{
|
| 23 |
+
"entropy": 1.8811942958831787,
|
| 24 |
+
"epoch": 0.5,
|
| 25 |
+
"grad_norm": 0.17289401590824127,
|
| 26 |
+
"learning_rate": 7.995766313163572e-05,
|
| 27 |
+
"loss": 1.7852125549316407,
|
| 28 |
+
"mean_token_accuracy": 0.6483715295791626,
|
| 29 |
+
"num_tokens": 327066.0,
|
| 30 |
+
"step": 50
|
| 31 |
+
},
|
| 32 |
+
{
|
| 33 |
+
"entropy": 1.4657313966751098,
|
| 34 |
+
"epoch": 0.75,
|
| 35 |
+
"grad_norm": 0.16354545950889587,
|
| 36 |
+
"learning_rate": 7.985326923450766e-05,
|
| 37 |
+
"loss": 1.4699134826660156,
|
| 38 |
+
"mean_token_accuracy": 0.7016089594364167,
|
| 39 |
+
"num_tokens": 490539.0,
|
| 40 |
+
"step": 75
|
| 41 |
+
},
|
| 42 |
+
{
|
| 43 |
+
"entropy": 1.3245354783535004,
|
| 44 |
+
"epoch": 1.0,
|
| 45 |
+
"grad_norm": 0.279362291097641,
|
| 46 |
+
"learning_rate": 7.968617694722514e-05,
|
| 47 |
+
"loss": 1.3126080322265625,
|
| 48 |
+
"mean_token_accuracy": 0.7263329613208771,
|
| 49 |
+
"num_tokens": 656891.0,
|
| 50 |
+
"step": 100
|
| 51 |
+
},
|
| 52 |
+
{
|
| 53 |
+
"epoch": 1.0,
|
| 54 |
+
"eval_entropy": 1.1590555493648236,
|
| 55 |
+
"eval_loss": 1.1591017246246338,
|
| 56 |
+
"eval_mean_token_accuracy": 0.7455048836194552,
|
| 57 |
+
"eval_num_tokens": 656891.0,
|
| 58 |
+
"eval_runtime": 14.0274,
|
| 59 |
+
"eval_samples_per_second": 7.129,
|
| 60 |
+
"eval_steps_per_second": 0.927,
|
| 61 |
+
"step": 100
|
| 62 |
+
},
|
| 63 |
+
{
|
| 64 |
+
"entropy": 0.9851375865936279,
|
| 65 |
+
"epoch": 1.25,
|
| 66 |
+
"grad_norm": 0.405873566865921,
|
| 67 |
+
"learning_rate": 7.945664914451888e-05,
|
| 68 |
+
"loss": 0.9941243743896484,
|
| 69 |
+
"mean_token_accuracy": 0.7774216139316559,
|
| 70 |
+
"num_tokens": 820900.0,
|
| 71 |
+
"step": 125
|
| 72 |
+
},
|
| 73 |
+
{
|
| 74 |
+
"entropy": 0.9000456941127777,
|
| 75 |
+
"epoch": 1.5,
|
| 76 |
+
"grad_norm": 0.4880390465259552,
|
| 77 |
+
"learning_rate": 7.916504692659478e-05,
|
| 78 |
+
"loss": 0.8994422149658203,
|
| 79 |
+
"mean_token_accuracy": 0.8044012486934662,
|
| 80 |
+
"num_tokens": 983751.0,
|
| 81 |
+
"step": 150
|
| 82 |
+
},
|
| 83 |
+
{
|
| 84 |
+
"entropy": 0.7057543742656708,
|
| 85 |
+
"epoch": 1.75,
|
| 86 |
+
"grad_norm": 0.6458895206451416,
|
| 87 |
+
"learning_rate": 7.881182905103986e-05,
|
| 88 |
+
"loss": 0.69064697265625,
|
| 89 |
+
"mean_token_accuracy": 0.8439933550357819,
|
| 90 |
+
"num_tokens": 1148624.0,
|
| 91 |
+
"step": 175
|
| 92 |
+
},
|
| 93 |
+
{
|
| 94 |
+
"entropy": 0.5627843302488327,
|
| 95 |
+
"epoch": 2.0,
|
| 96 |
+
"grad_norm": 0.5327032804489136,
|
| 97 |
+
"learning_rate": 7.839755121109077e-05,
|
| 98 |
+
"loss": 0.5385809707641601,
|
| 99 |
+
"mean_token_accuracy": 0.8772480082511902,
|
| 100 |
+
"num_tokens": 1313782.0,
|
| 101 |
+
"step": 200
|
| 102 |
+
},
|
| 103 |
+
{
|
| 104 |
+
"epoch": 2.0,
|
| 105 |
+
"eval_entropy": 0.6312807431587806,
|
| 106 |
+
"eval_loss": 0.5925119519233704,
|
| 107 |
+
"eval_mean_token_accuracy": 0.861605126124162,
|
| 108 |
+
"eval_num_tokens": 1313782.0,
|
| 109 |
+
"eval_runtime": 14.0529,
|
| 110 |
+
"eval_samples_per_second": 7.116,
|
| 111 |
+
"eval_steps_per_second": 0.925,
|
| 112 |
+
"step": 200
|
| 113 |
+
},
|
| 114 |
+
{
|
| 115 |
+
"entropy": 0.44288761138916016,
|
| 116 |
+
"epoch": 2.25,
|
| 117 |
+
"grad_norm": 0.5565536022186279,
|
| 118 |
+
"learning_rate": 7.792286516139999e-05,
|
| 119 |
+
"loss": 0.4078195571899414,
|
| 120 |
+
"mean_token_accuracy": 0.9039785468578339,
|
| 121 |
+
"num_tokens": 1480714.0,
|
| 122 |
+
"step": 225
|
| 123 |
+
},
|
| 124 |
+
{
|
| 125 |
+
"entropy": 0.44057594060897826,
|
| 126 |
+
"epoch": 2.5,
|
| 127 |
+
"grad_norm": 1.3234641551971436,
|
| 128 |
+
"learning_rate": 7.738851769267544e-05,
|
| 129 |
+
"loss": 0.4113604736328125,
|
| 130 |
+
"mean_token_accuracy": 0.9079041624069214,
|
| 131 |
+
"num_tokens": 1645148.0,
|
| 132 |
+
"step": 250
|
| 133 |
+
},
|
| 134 |
+
{
|
| 135 |
+
"entropy": 0.34622341573238374,
|
| 136 |
+
"epoch": 2.75,
|
| 137 |
+
"grad_norm": 0.4904310405254364,
|
| 138 |
+
"learning_rate": 7.679534945680635e-05,
|
| 139 |
+
"loss": 0.316553897857666,
|
| 140 |
+
"mean_token_accuracy": 0.9257376372814179,
|
| 141 |
+
"num_tokens": 1805051.0,
|
| 142 |
+
"step": 275
|
| 143 |
+
},
|
| 144 |
+
{
|
| 145 |
+
"entropy": 0.37137761026620864,
|
| 146 |
+
"epoch": 3.0,
|
| 147 |
+
"grad_norm": 0.8455039858818054,
|
| 148 |
+
"learning_rate": 7.614429364432367e-05,
|
| 149 |
+
"loss": 0.3286180114746094,
|
| 150 |
+
"mean_token_accuracy": 0.9244595277309418,
|
| 151 |
+
"num_tokens": 1970673.0,
|
| 152 |
+
"step": 300
|
| 153 |
+
},
|
| 154 |
+
{
|
| 155 |
+
"epoch": 3.0,
|
| 156 |
+
"eval_entropy": 0.42906437584987056,
|
| 157 |
+
"eval_loss": 0.37689468264579773,
|
| 158 |
+
"eval_mean_token_accuracy": 0.9097504982581506,
|
| 159 |
+
"eval_num_tokens": 1970673.0,
|
| 160 |
+
"eval_runtime": 14.1516,
|
| 161 |
+
"eval_samples_per_second": 7.066,
|
| 162 |
+
"eval_steps_per_second": 0.919,
|
| 163 |
+
"step": 300
|
| 164 |
+
},
|
| 165 |
+
{
|
| 166 |
+
"entropy": 0.25241469651460646,
|
| 167 |
+
"epoch": 3.25,
|
| 168 |
+
"grad_norm": 0.4860665500164032,
|
| 169 |
+
"learning_rate": 7.543637451627623e-05,
|
| 170 |
+
"loss": 0.20183673858642578,
|
| 171 |
+
"mean_token_accuracy": 0.9500378572940826,
|
| 172 |
+
"num_tokens": 2133206.0,
|
| 173 |
+
"step": 325
|
| 174 |
+
},
|
| 175 |
+
{
|
| 176 |
+
"entropy": 0.2856204105913639,
|
| 177 |
+
"epoch": 3.5,
|
| 178 |
+
"grad_norm": 0.7240539193153381,
|
| 179 |
+
"learning_rate": 7.467270579283153e-05,
|
| 180 |
+
"loss": 0.2465129852294922,
|
| 181 |
+
"mean_token_accuracy": 0.944399631023407,
|
| 182 |
+
"num_tokens": 2297449.0,
|
| 183 |
+
"step": 350
|
| 184 |
+
},
|
| 185 |
+
{
|
| 186 |
+
"entropy": 0.23731404632329942,
|
| 187 |
+
"epoch": 3.75,
|
| 188 |
+
"grad_norm": 0.9069561958312988,
|
| 189 |
+
"learning_rate": 7.385448890113697e-05,
|
| 190 |
+
"loss": 0.19913478851318359,
|
| 191 |
+
"mean_token_accuracy": 0.9529761910438538,
|
| 192 |
+
"num_tokens": 2462390.0,
|
| 193 |
+
"step": 375
|
| 194 |
+
},
|
| 195 |
+
{
|
| 196 |
+
"entropy": 0.23737725749611854,
|
| 197 |
+
"epoch": 4.0,
|
| 198 |
+
"grad_norm": 0.8365960717201233,
|
| 199 |
+
"learning_rate": 7.29830110851977e-05,
|
| 200 |
+
"loss": 0.20468717575073242,
|
| 201 |
+
"mean_token_accuracy": 0.9518069052696227,
|
| 202 |
+
"num_tokens": 2627564.0,
|
| 203 |
+
"step": 400
|
| 204 |
+
},
|
| 205 |
+
{
|
| 206 |
+
"epoch": 4.0,
|
| 207 |
+
"eval_entropy": 0.3099417881323741,
|
| 208 |
+
"eval_loss": 0.28181254863739014,
|
| 209 |
+
"eval_mean_token_accuracy": 0.9319974917631882,
|
| 210 |
+
"eval_num_tokens": 2627564.0,
|
| 211 |
+
"eval_runtime": 14.0762,
|
| 212 |
+
"eval_samples_per_second": 7.104,
|
| 213 |
+
"eval_steps_per_second": 0.924,
|
| 214 |
+
"step": 400
|
| 215 |
+
},
|
| 216 |
+
{
|
| 217 |
+
"entropy": 0.17392504781484605,
|
| 218 |
+
"epoch": 4.25,
|
| 219 |
+
"grad_norm": 0.573874831199646,
|
| 220 |
+
"learning_rate": 7.205964338074462e-05,
|
| 221 |
+
"loss": 0.13600427627563477,
|
| 222 |
+
"mean_token_accuracy": 0.966830027103424,
|
| 223 |
+
"num_tokens": 2789214.0,
|
| 224 |
+
"step": 425
|
| 225 |
+
},
|
| 226 |
+
{
|
| 227 |
+
"entropy": 0.1851605027168989,
|
| 228 |
+
"epoch": 4.5,
|
| 229 |
+
"grad_norm": 0.3293238878250122,
|
| 230 |
+
"learning_rate": 7.108583845827884e-05,
|
| 231 |
+
"loss": 0.14525031089782714,
|
| 232 |
+
"mean_token_accuracy": 0.9660202085971832,
|
| 233 |
+
"num_tokens": 2955470.0,
|
| 234 |
+
"step": 450
|
| 235 |
+
},
|
| 236 |
+
{
|
| 237 |
+
"entropy": 0.18264326125383376,
|
| 238 |
+
"epoch": 4.75,
|
| 239 |
+
"grad_norm": 0.60421222448349,
|
| 240 |
+
"learning_rate": 7.00631283376857e-05,
|
| 241 |
+
"loss": 0.15052377700805664,
|
| 242 |
+
"mean_token_accuracy": 0.9645662713050842,
|
| 243 |
+
"num_tokens": 3119188.0,
|
| 244 |
+
"step": 475
|
| 245 |
+
},
|
| 246 |
+
{
|
| 247 |
+
"entropy": 0.16633620224893092,
|
| 248 |
+
"epoch": 5.0,
|
| 249 |
+
"grad_norm": 0.6254115700721741,
|
| 250 |
+
"learning_rate": 6.899312197801396e-05,
|
| 251 |
+
"loss": 0.1348426628112793,
|
| 252 |
+
"mean_token_accuracy": 0.9678875958919525,
|
| 253 |
+
"num_tokens": 3284455.0,
|
| 254 |
+
"step": 500
|
| 255 |
+
},
|
| 256 |
+
{
|
| 257 |
+
"epoch": 5.0,
|
| 258 |
+
"eval_entropy": 0.24236516998364374,
|
| 259 |
+
"eval_loss": 0.22977329790592194,
|
| 260 |
+
"eval_mean_token_accuracy": 0.9444432900502131,
|
| 261 |
+
"eval_num_tokens": 3284455.0,
|
| 262 |
+
"eval_runtime": 13.8939,
|
| 263 |
+
"eval_samples_per_second": 7.197,
|
| 264 |
+
"eval_steps_per_second": 0.936,
|
| 265 |
+
"step": 500
|
| 266 |
+
},
|
| 267 |
+
{
|
| 268 |
+
"entropy": 0.13423476360738276,
|
| 269 |
+
"epoch": 5.25,
|
| 270 |
+
"grad_norm": 0.36637410521507263,
|
| 271 |
+
"learning_rate": 6.787750274621175e-05,
|
| 272 |
+
"loss": 0.10897525787353515,
|
| 273 |
+
"mean_token_accuracy": 0.9732790982723236,
|
| 274 |
+
"num_tokens": 3449400.0,
|
| 275 |
+
"step": 525
|
| 276 |
+
},
|
| 277 |
+
{
|
| 278 |
+
"entropy": 0.1397440279275179,
|
| 279 |
+
"epoch": 5.5,
|
| 280 |
+
"grad_norm": 0.5530615448951721,
|
| 281 |
+
"learning_rate": 6.671802576880203e-05,
|
| 282 |
+
"loss": 0.0994186782836914,
|
| 283 |
+
"mean_token_accuracy": 0.9755746126174927,
|
| 284 |
+
"num_tokens": 3613152.0,
|
| 285 |
+
"step": 550
|
| 286 |
+
},
|
| 287 |
+
{
|
| 288 |
+
"entropy": 0.12368398927152156,
|
| 289 |
+
"epoch": 5.75,
|
| 290 |
+
"grad_norm": 0.9692996144294739,
|
| 291 |
+
"learning_rate": 6.551651517066341e-05,
|
| 292 |
+
"loss": 0.09958896636962891,
|
| 293 |
+
"mean_token_accuracy": 0.9760513007640839,
|
| 294 |
+
"num_tokens": 3776158.0,
|
| 295 |
+
"step": 575
|
| 296 |
+
},
|
| 297 |
+
{
|
| 298 |
+
"entropy": 0.12079268284142017,
|
| 299 |
+
"epoch": 6.0,
|
| 300 |
+
"grad_norm": 0.4396827220916748,
|
| 301 |
+
"learning_rate": 6.427486120526078e-05,
|
| 302 |
+
"loss": 0.09088186264038085,
|
| 303 |
+
"mean_token_accuracy": 0.9771714675426483,
|
| 304 |
+
"num_tokens": 3941346.0,
|
| 305 |
+
"step": 600
|
| 306 |
+
},
|
| 307 |
+
{
|
| 308 |
+
"epoch": 6.0,
|
| 309 |
+
"eval_entropy": 0.1859686219921479,
|
| 310 |
+
"eval_loss": 0.19470009207725525,
|
| 311 |
+
"eval_mean_token_accuracy": 0.9534304233697745,
|
| 312 |
+
"eval_num_tokens": 3941346.0,
|
| 313 |
+
"eval_runtime": 13.9237,
|
| 314 |
+
"eval_samples_per_second": 7.182,
|
| 315 |
+
"eval_steps_per_second": 0.934,
|
| 316 |
+
"step": 600
|
| 317 |
+
},
|
| 318 |
+
{
|
| 319 |
+
"entropy": 0.12473276376724243,
|
| 320 |
+
"epoch": 6.25,
|
| 321 |
+
"grad_norm": 0.34290623664855957,
|
| 322 |
+
"learning_rate": 6.29950172808403e-05,
|
| 323 |
+
"loss": 0.09884612083435058,
|
| 324 |
+
"mean_token_accuracy": 0.9764992237091065,
|
| 325 |
+
"num_tokens": 4106933.0,
|
| 326 |
+
"step": 625
|
| 327 |
+
},
|
| 328 |
+
{
|
| 329 |
+
"entropy": 0.085831069201231,
|
| 330 |
+
"epoch": 6.5,
|
| 331 |
+
"grad_norm": 0.48308831453323364,
|
| 332 |
+
"learning_rate": 6.167899688726744e-05,
|
| 333 |
+
"loss": 0.056854352951049805,
|
| 334 |
+
"mean_token_accuracy": 0.9848374032974243,
|
| 335 |
+
"num_tokens": 4268402.0,
|
| 336 |
+
"step": 650
|
| 337 |
+
},
|
| 338 |
+
{
|
| 339 |
+
"entropy": 0.10950819738209247,
|
| 340 |
+
"epoch": 6.75,
|
| 341 |
+
"grad_norm": 0.856299102306366,
|
| 342 |
+
"learning_rate": 6.032887042834277e-05,
|
| 343 |
+
"loss": 0.08215932846069336,
|
| 344 |
+
"mean_token_accuracy": 0.9799254047870636,
|
| 345 |
+
"num_tokens": 4434736.0,
|
| 346 |
+
"step": 675
|
| 347 |
+
},
|
| 348 |
+
{
|
| 349 |
+
"entropy": 0.07904369287192821,
|
| 350 |
+
"epoch": 7.0,
|
| 351 |
+
"grad_norm": 0.36722248792648315,
|
| 352 |
+
"learning_rate": 5.89467619645789e-05,
|
| 353 |
+
"loss": 0.05222135543823242,
|
| 354 |
+
"mean_token_accuracy": 0.9856019783020019,
|
| 355 |
+
"num_tokens": 4598237.0,
|
| 356 |
+
"step": 700
|
| 357 |
+
},
|
| 358 |
+
{
|
| 359 |
+
"epoch": 7.0,
|
| 360 |
+
"eval_entropy": 0.1536997648385855,
|
| 361 |
+
"eval_loss": 0.17453822493553162,
|
| 362 |
+
"eval_mean_token_accuracy": 0.958331937973316,
|
| 363 |
+
"eval_num_tokens": 4598237.0,
|
| 364 |
+
"eval_runtime": 14.081,
|
| 365 |
+
"eval_samples_per_second": 7.102,
|
| 366 |
+
"eval_steps_per_second": 0.923,
|
| 367 |
+
"step": 700
|
| 368 |
+
},
|
| 369 |
+
{
|
| 370 |
+
"entropy": 0.09128913629800081,
|
| 371 |
+
"epoch": 7.25,
|
| 372 |
+
"grad_norm": 0.3062293827533722,
|
| 373 |
+
"learning_rate": 5.75348458715631e-05,
|
| 374 |
+
"loss": 0.0616093111038208,
|
| 375 |
+
"mean_token_accuracy": 0.9844523584842682,
|
| 376 |
+
"num_tokens": 4762560.0,
|
| 377 |
+
"step": 725
|
| 378 |
+
},
|
| 379 |
+
{
|
| 380 |
+
"entropy": 0.08702207624912262,
|
| 381 |
+
"epoch": 7.5,
|
| 382 |
+
"grad_norm": 0.4253726601600647,
|
| 383 |
+
"learning_rate": 5.609534341916273e-05,
|
| 384 |
+
"loss": 0.05795002460479736,
|
| 385 |
+
"mean_token_accuracy": 0.9847881937026978,
|
| 386 |
+
"num_tokens": 4927776.0,
|
| 387 |
+
"step": 750
|
| 388 |
+
},
|
| 389 |
+
{
|
| 390 |
+
"entropy": 0.05697074756026268,
|
| 391 |
+
"epoch": 7.75,
|
| 392 |
+
"grad_norm": 0.2904660403728485,
|
| 393 |
+
"learning_rate": 5.463051927695499e-05,
|
| 394 |
+
"loss": 0.03703362941741943,
|
| 395 |
+
"mean_token_accuracy": 0.9894271802902221,
|
| 396 |
+
"num_tokens": 5093008.0,
|
| 397 |
+
"step": 775
|
| 398 |
+
},
|
| 399 |
+
{
|
| 400 |
+
"entropy": 0.07158231139183044,
|
| 401 |
+
"epoch": 8.0,
|
| 402 |
+
"grad_norm": 0.18282686173915863,
|
| 403 |
+
"learning_rate": 5.3142677951379145e-05,
|
| 404 |
+
"loss": 0.044614124298095706,
|
| 405 |
+
"mean_token_accuracy": 0.9877620100975036,
|
| 406 |
+
"num_tokens": 5255128.0,
|
| 407 |
+
"step": 800
|
| 408 |
+
},
|
| 409 |
+
{
|
| 410 |
+
"epoch": 8.0,
|
| 411 |
+
"eval_entropy": 0.12773674267988938,
|
| 412 |
+
"eval_loss": 0.15855245292186737,
|
| 413 |
+
"eval_mean_token_accuracy": 0.9648683758882376,
|
| 414 |
+
"eval_num_tokens": 5255128.0,
|
| 415 |
+
"eval_runtime": 14.1317,
|
| 416 |
+
"eval_samples_per_second": 7.076,
|
| 417 |
+
"eval_steps_per_second": 0.92,
|
| 418 |
+
"step": 800
|
| 419 |
+
},
|
| 420 |
+
{
|
| 421 |
+
"entropy": 0.05208727709949017,
|
| 422 |
+
"epoch": 8.25,
|
| 423 |
+
"grad_norm": 0.341634601354599,
|
| 424 |
+
"learning_rate": 5.163416016021597e-05,
|
| 425 |
+
"loss": 0.029668822288513183,
|
| 426 |
+
"mean_token_accuracy": 0.9911812472343445,
|
| 427 |
+
"num_tokens": 5418711.0,
|
| 428 |
+
"step": 825
|
| 429 |
+
},
|
| 430 |
+
{
|
| 431 |
+
"entropy": 0.06870386745780706,
|
| 432 |
+
"epoch": 8.5,
|
| 433 |
+
"grad_norm": 0.4494340121746063,
|
| 434 |
+
"learning_rate": 5.010733915009846e-05,
|
| 435 |
+
"loss": 0.041636109352111816,
|
| 436 |
+
"mean_token_accuracy": 0.9882563936710358,
|
| 437 |
+
"num_tokens": 5585074.0,
|
| 438 |
+
"step": 850
|
| 439 |
+
},
|
| 440 |
+
{
|
| 441 |
+
"entropy": 0.061128609701991084,
|
| 442 |
+
"epoch": 8.75,
|
| 443 |
+
"grad_norm": 0.27014654874801636,
|
| 444 |
+
"learning_rate": 4.856461696284722e-05,
|
| 445 |
+
"loss": 0.0354688835144043,
|
| 446 |
+
"mean_token_accuracy": 0.9902391600608825,
|
| 447 |
+
"num_tokens": 5748942.0,
|
| 448 |
+
"step": 875
|
| 449 |
+
},
|
| 450 |
+
{
|
| 451 |
+
"entropy": 0.06322231356054545,
|
| 452 |
+
"epoch": 9.0,
|
| 453 |
+
"grad_norm": 0.44081079959869385,
|
| 454 |
+
"learning_rate": 4.7008420656504246e-05,
|
| 455 |
+
"loss": 0.04019084930419922,
|
| 456 |
+
"mean_token_accuracy": 0.9888646650314331,
|
| 457 |
+
"num_tokens": 5912019.0,
|
| 458 |
+
"step": 900
|
| 459 |
+
},
|
| 460 |
+
{
|
| 461 |
+
"epoch": 9.0,
|
| 462 |
+
"eval_entropy": 0.10374558430451614,
|
| 463 |
+
"eval_loss": 0.13253994286060333,
|
| 464 |
+
"eval_mean_token_accuracy": 0.9727041262846726,
|
| 465 |
+
"eval_num_tokens": 5912019.0,
|
| 466 |
+
"eval_runtime": 13.9522,
|
| 467 |
+
"eval_samples_per_second": 7.167,
|
| 468 |
+
"eval_steps_per_second": 0.932,
|
| 469 |
+
"step": 900
|
| 470 |
+
},
|
| 471 |
+
{
|
| 472 |
+
"entropy": 0.05023659970611334,
|
| 473 |
+
"epoch": 9.25,
|
| 474 |
+
"grad_norm": 0.17937935888767242,
|
| 475 |
+
"learning_rate": 4.5441198487010574e-05,
|
| 476 |
+
"loss": 0.027511301040649413,
|
| 477 |
+
"mean_token_accuracy": 0.9921806335449219,
|
| 478 |
+
"num_tokens": 6076215.0,
|
| 479 |
+
"step": 925
|
| 480 |
+
},
|
| 481 |
+
{
|
| 482 |
+
"entropy": 0.04388071570545435,
|
| 483 |
+
"epoch": 9.5,
|
| 484 |
+
"grad_norm": 0.19000087678432465,
|
| 485 |
+
"learning_rate": 4.386541605653456e-05,
|
| 486 |
+
"loss": 0.023981449604034425,
|
| 487 |
+
"mean_token_accuracy": 0.9924513137340546,
|
| 488 |
+
"num_tokens": 6239231.0,
|
| 489 |
+
"step": 950
|
| 490 |
+
},
|
| 491 |
+
{
|
| 492 |
+
"entropy": 0.047076338082551954,
|
| 493 |
+
"epoch": 9.75,
|
| 494 |
+
"grad_norm": 0.32662832736968994,
|
| 495 |
+
"learning_rate": 4.228355243451073e-05,
|
| 496 |
+
"loss": 0.029095897674560545,
|
| 497 |
+
"mean_token_accuracy": 0.9913454830646515,
|
| 498 |
+
"num_tokens": 6404599.0,
|
| 499 |
+
"step": 975
|
| 500 |
+
},
|
| 501 |
+
{
|
| 502 |
+
"entropy": 0.043895740248262885,
|
| 503 |
+
"epoch": 10.0,
|
| 504 |
+
"grad_norm": 0.2123415768146515,
|
| 505 |
+
"learning_rate": 4.0698096257491346e-05,
|
| 506 |
+
"loss": 0.025430002212524415,
|
| 507 |
+
"mean_token_accuracy": 0.9922421896457672,
|
| 508 |
+
"num_tokens": 6568910.0,
|
| 509 |
+
"step": 1000
|
| 510 |
+
},
|
| 511 |
+
{
|
| 512 |
+
"epoch": 10.0,
|
| 513 |
+
"eval_entropy": 0.09539230053241436,
|
| 514 |
+
"eval_loss": 0.13063254952430725,
|
| 515 |
+
"eval_mean_token_accuracy": 0.972310735629155,
|
| 516 |
+
"eval_num_tokens": 6568910.0,
|
| 517 |
+
"eval_runtime": 13.8054,
|
| 518 |
+
"eval_samples_per_second": 7.244,
|
| 519 |
+
"eval_steps_per_second": 0.942,
|
| 520 |
+
"step": 1000
|
| 521 |
+
},
|
| 522 |
+
{
|
| 523 |
+
"entropy": 0.04173858344554901,
|
| 524 |
+
"epoch": 10.25,
|
| 525 |
+
"grad_norm": 0.2956293225288391,
|
| 526 |
+
"learning_rate": 3.911154181394682e-05,
|
| 527 |
+
"loss": 0.024647462368011474,
|
| 528 |
+
"mean_token_accuracy": 0.9924235260486602,
|
| 529 |
+
"num_tokens": 6732529.0,
|
| 530 |
+
"step": 1025
|
| 531 |
+
},
|
| 532 |
+
{
|
| 533 |
+
"entropy": 0.035978049002587796,
|
| 534 |
+
"epoch": 10.5,
|
| 535 |
+
"grad_norm": 0.19044138491153717,
|
| 536 |
+
"learning_rate": 3.7526385120174296e-05,
|
| 537 |
+
"loss": 0.019353010654449464,
|
| 538 |
+
"mean_token_accuracy": 0.9938314747810364,
|
| 539 |
+
"num_tokens": 6897778.0,
|
| 540 |
+
"step": 1050
|
| 541 |
+
},
|
| 542 |
+
{
|
| 543 |
+
"entropy": 0.03361038889735937,
|
| 544 |
+
"epoch": 10.75,
|
| 545 |
+
"grad_norm": 0.1833367496728897,
|
| 546 |
+
"learning_rate": 3.594511999348787e-05,
|
| 547 |
+
"loss": 0.018770104646682738,
|
| 548 |
+
"mean_token_accuracy": 0.9937199604511261,
|
| 549 |
+
"num_tokens": 7061058.0,
|
| 550 |
+
"step": 1075
|
| 551 |
+
},
|
| 552 |
+
{
|
| 553 |
+
"entropy": 0.0329610108025372,
|
| 554 |
+
"epoch": 11.0,
|
| 555 |
+
"grad_norm": 0.1862572729587555,
|
| 556 |
+
"learning_rate": 3.437023412886855e-05,
|
| 557 |
+
"loss": 0.01859575867652893,
|
| 558 |
+
"mean_token_accuracy": 0.9939316511154175,
|
| 559 |
+
"num_tokens": 7225801.0,
|
| 560 |
+
"step": 1100
|
| 561 |
+
},
|
| 562 |
+
{
|
| 563 |
+
"epoch": 11.0,
|
| 564 |
+
"eval_entropy": 0.07002563731601605,
|
| 565 |
+
"eval_loss": 0.11906072497367859,
|
| 566 |
+
"eval_mean_token_accuracy": 0.9767012825379005,
|
| 567 |
+
"eval_num_tokens": 7225801.0,
|
| 568 |
+
"eval_runtime": 13.7737,
|
| 569 |
+
"eval_samples_per_second": 7.26,
|
| 570 |
+
"eval_steps_per_second": 0.944,
|
| 571 |
+
"step": 1100
|
| 572 |
+
},
|
| 573 |
+
{
|
| 574 |
+
"entropy": 0.029755542390048503,
|
| 575 |
+
"epoch": 11.25,
|
| 576 |
+
"grad_norm": 0.13579222559928894,
|
| 577 |
+
"learning_rate": 3.280420518524585e-05,
|
| 578 |
+
"loss": 0.016698834896087648,
|
| 579 |
+
"mean_token_accuracy": 0.9944922685623169,
|
| 580 |
+
"num_tokens": 7392621.0,
|
| 581 |
+
"step": 1125
|
| 582 |
+
},
|
| 583 |
+
{
|
| 584 |
+
"entropy": 0.02796286340802908,
|
| 585 |
+
"epoch": 11.5,
|
| 586 |
+
"grad_norm": 0.14825628697872162,
|
| 587 |
+
"learning_rate": 3.124949688756846e-05,
|
| 588 |
+
"loss": 0.015717124938964842,
|
| 589 |
+
"mean_token_accuracy": 0.9946336007118225,
|
| 590 |
+
"num_tokens": 7556769.0,
|
| 591 |
+
"step": 1150
|
| 592 |
+
},
|
| 593 |
+
{
|
| 594 |
+
"entropy": 0.0260389681532979,
|
| 595 |
+
"epoch": 11.75,
|
| 596 |
+
"grad_norm": 0.1284666806459427,
|
| 597 |
+
"learning_rate": 2.9708555150796346e-05,
|
| 598 |
+
"loss": 0.016165239810943602,
|
| 599 |
+
"mean_token_accuracy": 0.9943143832683563,
|
| 600 |
+
"num_tokens": 7719224.0,
|
| 601 |
+
"step": 1175
|
| 602 |
+
},
|
| 603 |
+
{
|
| 604 |
+
"entropy": 0.03029863853007555,
|
| 605 |
+
"epoch": 12.0,
|
| 606 |
+
"grad_norm": 0.2951240837574005,
|
| 607 |
+
"learning_rate": 2.8183804231912004e-05,
|
| 608 |
+
"loss": 0.01682651400566101,
|
| 609 |
+
"mean_token_accuracy": 0.9942524111270905,
|
| 610 |
+
"num_tokens": 7882692.0,
|
| 611 |
+
"step": 1200
|
| 612 |
+
},
|
| 613 |
+
{
|
| 614 |
+
"epoch": 12.0,
|
| 615 |
+
"eval_entropy": 0.06255707159065284,
|
| 616 |
+
"eval_loss": 0.11945243179798126,
|
| 617 |
+
"eval_mean_token_accuracy": 0.9772726480777447,
|
| 618 |
+
"eval_num_tokens": 7882692.0,
|
| 619 |
+
"eval_runtime": 13.7991,
|
| 620 |
+
"eval_samples_per_second": 7.247,
|
| 621 |
+
"eval_steps_per_second": 0.942,
|
| 622 |
+
"step": 1200
|
| 623 |
+
},
|
| 624 |
+
{
|
| 625 |
+
"entropy": 0.02317443212494254,
|
| 626 |
+
"epoch": 12.25,
|
| 627 |
+
"grad_norm": 0.13226298987865448,
|
| 628 |
+
"learning_rate": 2.6677642916004772e-05,
|
| 629 |
+
"loss": 0.012957496643066406,
|
| 630 |
+
"mean_token_accuracy": 0.9950127184391022,
|
| 631 |
+
"num_tokens": 8049057.0,
|
| 632 |
+
"step": 1225
|
| 633 |
+
},
|
| 634 |
+
{
|
| 635 |
+
"entropy": 0.0250219264626503,
|
| 636 |
+
"epoch": 12.5,
|
| 637 |
+
"grad_norm": 0.11246883124113083,
|
| 638 |
+
"learning_rate": 2.519244074242825e-05,
|
| 639 |
+
"loss": 0.01474016547203064,
|
| 640 |
+
"mean_token_accuracy": 0.9946307218074799,
|
| 641 |
+
"num_tokens": 8212177.0,
|
| 642 |
+
"step": 1250
|
| 643 |
+
},
|
| 644 |
+
{
|
| 645 |
+
"entropy": 0.024466073755174875,
|
| 646 |
+
"epoch": 12.75,
|
| 647 |
+
"grad_norm": 0.12266653031110764,
|
| 648 |
+
"learning_rate": 2.3730534276968e-05,
|
| 649 |
+
"loss": 0.014008394479751586,
|
| 650 |
+
"mean_token_accuracy": 0.9946387422084808,
|
| 651 |
+
"num_tokens": 8376959.0,
|
| 652 |
+
"step": 1275
|
| 653 |
+
},
|
| 654 |
+
{
|
| 655 |
+
"entropy": 0.02350047092884779,
|
| 656 |
+
"epoch": 13.0,
|
| 657 |
+
"grad_norm": 0.10886064171791077,
|
| 658 |
+
"learning_rate": 2.2294223435884402e-05,
|
| 659 |
+
"loss": 0.014072380065917968,
|
| 660 |
+
"mean_token_accuracy": 0.9947505962848663,
|
| 661 |
+
"num_tokens": 8539583.0,
|
| 662 |
+
"step": 1300
|
| 663 |
+
},
|
| 664 |
+
{
|
| 665 |
+
"epoch": 13.0,
|
| 666 |
+
"eval_entropy": 0.05241998714896349,
|
| 667 |
+
"eval_loss": 0.12038982659578323,
|
| 668 |
+
"eval_mean_token_accuracy": 0.9782212514143723,
|
| 669 |
+
"eval_num_tokens": 8539583.0,
|
| 670 |
+
"eval_runtime": 13.7732,
|
| 671 |
+
"eval_samples_per_second": 7.26,
|
| 672 |
+
"eval_steps_per_second": 0.944,
|
| 673 |
+
"step": 1300
|
| 674 |
+
}
|
| 675 |
+
],
|
| 676 |
+
"logging_steps": 25,
|
| 677 |
+
"max_steps": 2000,
|
| 678 |
+
"num_input_tokens_seen": 0,
|
| 679 |
+
"num_train_epochs": 20,
|
| 680 |
+
"save_steps": 500,
|
| 681 |
+
"stateful_callbacks": {
|
| 682 |
+
"EarlyStoppingCallback": {
|
| 683 |
+
"args": {
|
| 684 |
+
"early_stopping_patience": 2,
|
| 685 |
+
"early_stopping_threshold": 0.0
|
| 686 |
+
},
|
| 687 |
+
"attributes": {
|
| 688 |
+
"early_stopping_patience_counter": 2
|
| 689 |
+
}
|
| 690 |
+
},
|
| 691 |
+
"TrainerControl": {
|
| 692 |
+
"args": {
|
| 693 |
+
"should_epoch_stop": false,
|
| 694 |
+
"should_evaluate": false,
|
| 695 |
+
"should_log": false,
|
| 696 |
+
"should_save": true,
|
| 697 |
+
"should_training_stop": true
|
| 698 |
+
},
|
| 699 |
+
"attributes": {}
|
| 700 |
+
}
|
| 701 |
+
},
|
| 702 |
+
"total_flos": 4.4872917665132544e+17,
|
| 703 |
+
"train_batch_size": 4,
|
| 704 |
+
"trial_name": null,
|
| 705 |
+
"trial_params": null
|
| 706 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/checkpoint-1300/training_args.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:20789f4b467b5ff6e691e6d696bc04daebaaa023439a4918e4927d5d9b849aee
|
| 3 |
+
size 5304
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/dataset_dict.json
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
{"splits": ["train", "validation", "test"]}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/data-00000-of-00001.arrow
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f025159053af787b7124324d5700120792c277fbd717701fed35627723dd5c86
|
| 3 |
+
size 787088
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/dataset_info.json
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"citation": "",
|
| 3 |
+
"description": "",
|
| 4 |
+
"features": {
|
| 5 |
+
"qa_id": {
|
| 6 |
+
"dtype": "string",
|
| 7 |
+
"_type": "Value"
|
| 8 |
+
},
|
| 9 |
+
"index_id": {
|
| 10 |
+
"dtype": "string",
|
| 11 |
+
"_type": "Value"
|
| 12 |
+
},
|
| 13 |
+
"question": {
|
| 14 |
+
"dtype": "string",
|
| 15 |
+
"_type": "Value"
|
| 16 |
+
},
|
| 17 |
+
"answer": {
|
| 18 |
+
"dtype": "string",
|
| 19 |
+
"_type": "Value"
|
| 20 |
+
},
|
| 21 |
+
"scope_label": {
|
| 22 |
+
"dtype": "string",
|
| 23 |
+
"_type": "Value"
|
| 24 |
+
},
|
| 25 |
+
"source_doc": {
|
| 26 |
+
"dtype": "string",
|
| 27 |
+
"_type": "Value"
|
| 28 |
+
},
|
| 29 |
+
"section": {
|
| 30 |
+
"dtype": "string",
|
| 31 |
+
"_type": "Value"
|
| 32 |
+
},
|
| 33 |
+
"subsection": {
|
| 34 |
+
"dtype": "string",
|
| 35 |
+
"_type": "Value"
|
| 36 |
+
},
|
| 37 |
+
"retrieved_kb_ids": {
|
| 38 |
+
"feature": {
|
| 39 |
+
"dtype": "string",
|
| 40 |
+
"_type": "Value"
|
| 41 |
+
},
|
| 42 |
+
"_type": "Sequence"
|
| 43 |
+
},
|
| 44 |
+
"retrieved_sections": {
|
| 45 |
+
"feature": {
|
| 46 |
+
"dtype": "string",
|
| 47 |
+
"_type": "Value"
|
| 48 |
+
},
|
| 49 |
+
"_type": "Sequence"
|
| 50 |
+
},
|
| 51 |
+
"retrieved_scopes": {
|
| 52 |
+
"feature": {
|
| 53 |
+
"dtype": "string",
|
| 54 |
+
"_type": "Value"
|
| 55 |
+
},
|
| 56 |
+
"_type": "Sequence"
|
| 57 |
+
},
|
| 58 |
+
"prompt": [
|
| 59 |
+
{
|
| 60 |
+
"content": {
|
| 61 |
+
"dtype": "string",
|
| 62 |
+
"_type": "Value"
|
| 63 |
+
},
|
| 64 |
+
"role": {
|
| 65 |
+
"dtype": "string",
|
| 66 |
+
"_type": "Value"
|
| 67 |
+
}
|
| 68 |
+
}
|
| 69 |
+
],
|
| 70 |
+
"completion": [
|
| 71 |
+
{
|
| 72 |
+
"content": {
|
| 73 |
+
"dtype": "string",
|
| 74 |
+
"_type": "Value"
|
| 75 |
+
},
|
| 76 |
+
"role": {
|
| 77 |
+
"dtype": "string",
|
| 78 |
+
"_type": "Value"
|
| 79 |
+
}
|
| 80 |
+
}
|
| 81 |
+
],
|
| 82 |
+
"text": {
|
| 83 |
+
"dtype": "string",
|
| 84 |
+
"_type": "Value"
|
| 85 |
+
}
|
| 86 |
+
},
|
| 87 |
+
"homepage": "",
|
| 88 |
+
"license": ""
|
| 89 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/test/state.json
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_data_files": [
|
| 3 |
+
{
|
| 4 |
+
"filename": "data-00000-of-00001.arrow"
|
| 5 |
+
}
|
| 6 |
+
],
|
| 7 |
+
"_fingerprint": "936c078af06a1d62",
|
| 8 |
+
"_format_columns": null,
|
| 9 |
+
"_format_kwargs": {},
|
| 10 |
+
"_format_type": null,
|
| 11 |
+
"_output_all_columns": false,
|
| 12 |
+
"_split": null
|
| 13 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/data-00000-of-00001.arrow
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a886ad01c88df3805ff1754bc09af8c94c3cf72fe5908edaff92b5b70943c8c7
|
| 3 |
+
size 6287752
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/dataset_info.json
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"citation": "",
|
| 3 |
+
"description": "",
|
| 4 |
+
"features": {
|
| 5 |
+
"qa_id": {
|
| 6 |
+
"dtype": "string",
|
| 7 |
+
"_type": "Value"
|
| 8 |
+
},
|
| 9 |
+
"index_id": {
|
| 10 |
+
"dtype": "string",
|
| 11 |
+
"_type": "Value"
|
| 12 |
+
},
|
| 13 |
+
"question": {
|
| 14 |
+
"dtype": "string",
|
| 15 |
+
"_type": "Value"
|
| 16 |
+
},
|
| 17 |
+
"answer": {
|
| 18 |
+
"dtype": "string",
|
| 19 |
+
"_type": "Value"
|
| 20 |
+
},
|
| 21 |
+
"scope_label": {
|
| 22 |
+
"dtype": "string",
|
| 23 |
+
"_type": "Value"
|
| 24 |
+
},
|
| 25 |
+
"source_doc": {
|
| 26 |
+
"dtype": "string",
|
| 27 |
+
"_type": "Value"
|
| 28 |
+
},
|
| 29 |
+
"section": {
|
| 30 |
+
"dtype": "string",
|
| 31 |
+
"_type": "Value"
|
| 32 |
+
},
|
| 33 |
+
"subsection": {
|
| 34 |
+
"dtype": "string",
|
| 35 |
+
"_type": "Value"
|
| 36 |
+
},
|
| 37 |
+
"retrieved_kb_ids": {
|
| 38 |
+
"feature": {
|
| 39 |
+
"dtype": "string",
|
| 40 |
+
"_type": "Value"
|
| 41 |
+
},
|
| 42 |
+
"_type": "Sequence"
|
| 43 |
+
},
|
| 44 |
+
"retrieved_sections": {
|
| 45 |
+
"feature": {
|
| 46 |
+
"dtype": "string",
|
| 47 |
+
"_type": "Value"
|
| 48 |
+
},
|
| 49 |
+
"_type": "Sequence"
|
| 50 |
+
},
|
| 51 |
+
"retrieved_scopes": {
|
| 52 |
+
"feature": {
|
| 53 |
+
"dtype": "string",
|
| 54 |
+
"_type": "Value"
|
| 55 |
+
},
|
| 56 |
+
"_type": "Sequence"
|
| 57 |
+
},
|
| 58 |
+
"prompt": [
|
| 59 |
+
{
|
| 60 |
+
"content": {
|
| 61 |
+
"dtype": "string",
|
| 62 |
+
"_type": "Value"
|
| 63 |
+
},
|
| 64 |
+
"role": {
|
| 65 |
+
"dtype": "string",
|
| 66 |
+
"_type": "Value"
|
| 67 |
+
}
|
| 68 |
+
}
|
| 69 |
+
],
|
| 70 |
+
"completion": [
|
| 71 |
+
{
|
| 72 |
+
"content": {
|
| 73 |
+
"dtype": "string",
|
| 74 |
+
"_type": "Value"
|
| 75 |
+
},
|
| 76 |
+
"role": {
|
| 77 |
+
"dtype": "string",
|
| 78 |
+
"_type": "Value"
|
| 79 |
+
}
|
| 80 |
+
}
|
| 81 |
+
],
|
| 82 |
+
"text": {
|
| 83 |
+
"dtype": "string",
|
| 84 |
+
"_type": "Value"
|
| 85 |
+
}
|
| 86 |
+
},
|
| 87 |
+
"homepage": "",
|
| 88 |
+
"license": ""
|
| 89 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/train/state.json
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_data_files": [
|
| 3 |
+
{
|
| 4 |
+
"filename": "data-00000-of-00001.arrow"
|
| 5 |
+
}
|
| 6 |
+
],
|
| 7 |
+
"_fingerprint": "a4eeb3b4abeec226",
|
| 8 |
+
"_format_columns": null,
|
| 9 |
+
"_format_kwargs": {},
|
| 10 |
+
"_format_type": null,
|
| 11 |
+
"_output_all_columns": false,
|
| 12 |
+
"_split": null
|
| 13 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/data-00000-of-00001.arrow
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:f2e91442ff072666e35d8227c37e03637290a2bfbbff9b277b8fe61cff2b1862
|
| 3 |
+
size 782200
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/dataset_info.json
ADDED
|
@@ -0,0 +1,89 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"citation": "",
|
| 3 |
+
"description": "",
|
| 4 |
+
"features": {
|
| 5 |
+
"qa_id": {
|
| 6 |
+
"dtype": "string",
|
| 7 |
+
"_type": "Value"
|
| 8 |
+
},
|
| 9 |
+
"index_id": {
|
| 10 |
+
"dtype": "string",
|
| 11 |
+
"_type": "Value"
|
| 12 |
+
},
|
| 13 |
+
"question": {
|
| 14 |
+
"dtype": "string",
|
| 15 |
+
"_type": "Value"
|
| 16 |
+
},
|
| 17 |
+
"answer": {
|
| 18 |
+
"dtype": "string",
|
| 19 |
+
"_type": "Value"
|
| 20 |
+
},
|
| 21 |
+
"scope_label": {
|
| 22 |
+
"dtype": "string",
|
| 23 |
+
"_type": "Value"
|
| 24 |
+
},
|
| 25 |
+
"source_doc": {
|
| 26 |
+
"dtype": "string",
|
| 27 |
+
"_type": "Value"
|
| 28 |
+
},
|
| 29 |
+
"section": {
|
| 30 |
+
"dtype": "string",
|
| 31 |
+
"_type": "Value"
|
| 32 |
+
},
|
| 33 |
+
"subsection": {
|
| 34 |
+
"dtype": "string",
|
| 35 |
+
"_type": "Value"
|
| 36 |
+
},
|
| 37 |
+
"retrieved_kb_ids": {
|
| 38 |
+
"feature": {
|
| 39 |
+
"dtype": "string",
|
| 40 |
+
"_type": "Value"
|
| 41 |
+
},
|
| 42 |
+
"_type": "Sequence"
|
| 43 |
+
},
|
| 44 |
+
"retrieved_sections": {
|
| 45 |
+
"feature": {
|
| 46 |
+
"dtype": "string",
|
| 47 |
+
"_type": "Value"
|
| 48 |
+
},
|
| 49 |
+
"_type": "Sequence"
|
| 50 |
+
},
|
| 51 |
+
"retrieved_scopes": {
|
| 52 |
+
"feature": {
|
| 53 |
+
"dtype": "string",
|
| 54 |
+
"_type": "Value"
|
| 55 |
+
},
|
| 56 |
+
"_type": "Sequence"
|
| 57 |
+
},
|
| 58 |
+
"prompt": [
|
| 59 |
+
{
|
| 60 |
+
"content": {
|
| 61 |
+
"dtype": "string",
|
| 62 |
+
"_type": "Value"
|
| 63 |
+
},
|
| 64 |
+
"role": {
|
| 65 |
+
"dtype": "string",
|
| 66 |
+
"_type": "Value"
|
| 67 |
+
}
|
| 68 |
+
}
|
| 69 |
+
],
|
| 70 |
+
"completion": [
|
| 71 |
+
{
|
| 72 |
+
"content": {
|
| 73 |
+
"dtype": "string",
|
| 74 |
+
"_type": "Value"
|
| 75 |
+
},
|
| 76 |
+
"role": {
|
| 77 |
+
"dtype": "string",
|
| 78 |
+
"_type": "Value"
|
| 79 |
+
}
|
| 80 |
+
}
|
| 81 |
+
],
|
| 82 |
+
"text": {
|
| 83 |
+
"dtype": "string",
|
| 84 |
+
"_type": "Value"
|
| 85 |
+
}
|
| 86 |
+
},
|
| 87 |
+
"homepage": "",
|
| 88 |
+
"license": ""
|
| 89 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/hf_dataset_dict/validation/state.json
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"_data_files": [
|
| 3 |
+
{
|
| 4 |
+
"filename": "data-00000-of-00001.arrow"
|
| 5 |
+
}
|
| 6 |
+
],
|
| 7 |
+
"_fingerprint": "6cb74bcfac21e0ef",
|
| 8 |
+
"_format_columns": null,
|
| 9 |
+
"_format_kwargs": {},
|
| 10 |
+
"_format_type": null,
|
| 11 |
+
"_output_all_columns": false,
|
| 12 |
+
"_split": null
|
| 13 |
+
}
|
UM_Handbook/outputs/baseline2_rag_harness_agent/trainer_runs/train_metrics.json
ADDED
|
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"train_runtime": 3233.2916,
|
| 3 |
+
"train_samples_per_second": 4.949,
|
| 4 |
+
"train_steps_per_second": 0.619,
|
| 5 |
+
"total_flos": 4.4872917665132544e+17,
|
| 6 |
+
"train_loss": 0.2747548096684309
|
| 7 |
+
}
|