BtB-ExpC commited on
Commit
9682381
Β·
1 Parent(s): 5b78005

proompted (fluster A)

Browse files
app/ui/write_fluster_tab.py CHANGED
@@ -36,11 +36,11 @@ def build_write_fluster_tab():
36
  # row 1 => (box_0, box_1)
37
  # row 2 => (box_2, box_3)
38
  with gr.Row():
39
- box_0 = gr.Textbox(label="Prompt A + LLM 1", interactive=False)
40
- box_1 = gr.Textbox(label="Prompt A + LLM 1", interactive=False)
41
  with gr.Row():
42
- box_2 = gr.Textbox(label="Prompt A + LLM 2", interactive=False)
43
- box_3 = gr.Textbox(label="Prompt A + LLM 2", interactive=False)
44
 
45
 
46
 
 
36
  # row 1 => (box_0, box_1)
37
  # row 2 => (box_2, box_3)
38
  with gr.Row():
39
+ box_0 = gr.Textbox(label="Prompt A + LLM 1", interactive=False, lines=14)
40
+ box_1 = gr.Textbox(label="Prompt A + LLM 1", interactive=False, lines=14)
41
  with gr.Row():
42
+ box_2 = gr.Textbox(label="Prompt A + LLM 2", interactive=False, lines=14)
43
+ box_3 = gr.Textbox(label="Prompt A + LLM 2", interactive=False, lines=14)
44
 
45
 
46
 
config/system_prompt_texts.py CHANGED
@@ -495,15 +495,41 @@ template_sanitize_learning_objectives_text = """
495
 
496
  template_write_fluster_a_text= """
497
  # Task outline
498
- Given a learning objective, your goal is to write an exercise set of 3 high-quality multiple choice exercises that all test the exact same knowledge that's stated in the learning objective.
 
 
499
 
500
  # Concepts
 
501
  ## Learning objective
502
- Tests a specific fact. For example: "De student weet dat de Wet Bopz sinds 1994 in gebruik was (tot hij in 2020 werd opgevolgd door de Wvggz).
503
- All exercises must test the very same specific key part of the given learning objective, focusing only on info that's not in parentheses. Any text between parentheses must only be used in the Theory or Explanation sections of the exercises (Theory if it's important for understanding the exercise beforehand, explanation if it's .
 
 
 
 
 
 
 
 
 
504
 
505
  ## Exercise set
506
- Comprises 3 exercises that all test the same single learning objective: one bigger multiple choice exercise and two smaller true/false statements. See this example:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
507
  <exercise_set>
508
  <multiple_choice_exercise>
509
  <prompt>
@@ -546,11 +572,11 @@ Comprises 3 exercises that all test the same single learning objective: one bigg
546
  2. Deze stelling is niet correct
547
  </choices>
548
  <correct_answer>2</correct_answer>
549
- <explanation>De wet Bopz was sinds **1994** in gebruik. Tot 2020, toen hij werd opgevolgd door de Wvggz.</explanation>
550
  </false_statement>
551
  </exercise_set>
552
 
553
- Here's another example of an exercise set, this time for the learning objective: "De student weet dat je dagelijks oefent om zo objectief (zonder je eigen mening) mogelijk te observeren."
554
  <exercise_set>
555
  <multiple_choice_exercise>
556
  <prompt>
@@ -605,24 +631,41 @@ Here's another example of an exercise set, this time for the learning objective:
605
  </exercise_set>
606
 
607
 
 
 
 
 
 
 
 
 
 
 
 
 
608
  ## Distractors
609
- The alternative answer options of the multiple choice exercise that are not the correct answer are called distractors. These are the most important part of the exercise. Effective distractors strike an optimal balance between "very plausible to someone who doesn't know the answer to the question" and "clearly wrong to someone who does know the answer to the question".
 
 
 
 
 
 
610
 
611
- ## Theory
612
- Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses) that is not the direct focus to test, but useful to know for the student beforehand to better understand the question. This is then added as Theory in the prompt. The student gets to see this as part of the exercise prompt.
613
 
614
- # Explanation
615
- Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses, or as a subclause) that is not the direct focus to know, nor is it necessary to clarify the prompt. If this is useful related, additional info, add it to the explanation, so that the student gets to see this after they pick their answer. The false statement always needs an explanation, to tell the student why the statement is incorrect (explaining what the true statement would have been). Other exercises should only get an explanation if the learning objective contains appropriate info for this.
616
 
617
  # Approach
618
- Think long and hard about the ideal three exercises to test the given learning objective. Especially spend a lot of time picking good distractors for the first multiple choice exercise.
619
- Imagine the typical slightly more stupid target student among the target audience for this learning objective. Would they sometimes find each distractor sound appealing if they hadn't studied for the test? We want to avoid the possibility that they can too easily dismiss and eliminate a distractor as clearly not a serious option, just on the basis of it looking weird to them. Imagine whether very stupid students with limited general knowledge, and no knowledge of the topic of the exercise, might find the distractor plausible. That's the goal.
620
- At the same time, a distractor must not be too close to the truth either. That would be misleading. Imagine asking 10 domain experts to judge this. All of them should agree that the correct answer is the clearly best answer in this exercise. If there's any doubt, rephrase the distractor to be a bit less true, and imagine again.
621
- After lots of iterative prep and reasoning, considering a wide range of options, weighing what would be the best, finally return a complete exercise set of 1 multiple choice exercise and 2 statements.
622
-
623
- ## Pointers
624
- - Try to exactly match the content and language level in the learning objective. If it's stated in simple words, use equally simple words in the exercises as well.
625
- - Output format doesn't matter: prioritize careful reasoning.
626
  """
627
 
628
  uitgangspunt_template_for_writing_a_fluster = """
@@ -631,7 +674,7 @@ Given a learning objective, your goal is to write an exercise set of 3 high-qual
631
 
632
  # Concepts
633
  ## Learning objective
634
- Tests a specific fact. For example: "De student weet dat de Wet Bopz sinds 1994 in gebruik was (tot hij in 2020 werd opgevolgd door de Wvggz).
635
  All exercises must test the very same specific key part of the given learning objective, focusing only on info that's not in parentheses. Any text between parentheses must only be used in the Theory or Explanation sections of the exercises (Theory if it's important for understanding the exercise beforehand, explanation if it's .
636
 
637
  ## Exercise set
@@ -642,7 +685,7 @@ Comprises 3 exercises that all test the same single learning objective: one bigg
642
  Stelling:
643
  De wet Bopz was sinds ..... in gebruik.
644
  </prompt>
645
-
646
  <choices>
647
  1. 1984
648
  2. 1999
@@ -652,13 +695,13 @@ Comprises 3 exercises that all test the same single learning objective: one bigg
652
  <correct_answer>2</correct_answer>
653
  <explanation>In 2020 werd de wet Bopz opgevolgd door de Wvggz.</explanation>
654
  </multiple_choice_exercise>
655
-
656
  <true_statement>
657
  <prompt>
658
  Stelling:
659
  De wet Bopz was sinds 1994 in gebruik.
660
  </prompt>
661
-
662
  <choices>
663
  1. Deze stelling is correct
664
  2. Deze stelling is niet correct
@@ -672,7 +715,7 @@ Comprises 3 exercises that all test the same single learning objective: one bigg
672
  Stelling:
673
  De wet Bopz was sinds 1984 in gebruik.
674
  </prompt>
675
-
676
  <choices>
677
  1. Deze stelling is correct
678
  2. Deze stelling is niet correct
@@ -688,11 +731,11 @@ Here's another example of an exercise set, this time for the learning objective:
688
  <prompt>
689
  Theorie:
690
  Objectief betekent "zonder je eigen mening".
691
-
692
  Vraag:
693
  Wat moet je doen om zo objectief mogelijk te observeren?
694
  </prompt>
695
-
696
  <choices>
697
  1. Je intuΓ―tie volgen
698
  2. Veel theorie leren
@@ -701,16 +744,16 @@ Here's another example of an exercise set, this time for the learning objective:
701
  </choices>
702
  <correct_answer>4</correct_answer>
703
  </multiple_choice_exercise>
704
-
705
  <true_statement>
706
  <prompt>
707
  Theorie:
708
  Objectief betekent "zonder je eigen mening".
709
-
710
  Stelling:
711
  Om zo objectief mogelijk te observeren moet je dagelijks oefenen.
712
  </prompt>
713
-
714
  <choices>
715
  1. Deze stelling is correct
716
  2. Deze stelling is niet correct
@@ -722,11 +765,11 @@ Here's another example of an exercise set, this time for the learning objective:
722
  <prompt>
723
  Theorie:
724
  Objectief betekent "zonder je eigen mening".
725
-
726
  Stelling:
727
  Om zo objectief mogelijk te observeren moet je een keer per jaar oefenen.
728
  </prompt>
729
-
730
  <choices>
731
  1. Deze stelling is correct
732
  2. Deze stelling is niet correct
@@ -743,23 +786,18 @@ The alternative answer options of the multiple choice exercise that are not the
743
  ## Theory
744
  Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses) that is not the direct focus to test, but useful to know for the student beforehand to better understand the question. This is then added as Theory in the prompt. The student gets to see this as part of the exercise prompt.
745
 
746
- # Explanation
747
  Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses, or as a subclause) that is not the direct focus to know, nor is it necessary to clarify the prompt. If this is useful related, additional info, add it to the explanation, so that the student gets to see this after they pick their answer. The false statement always needs an explanation, to tell the student why the statement is incorrect (explaining what the true statement would have been). Other exercises should only get an explanation if the learning objective contains appropriate info for this.
748
 
749
  # Approach
750
- Think long and hard about the ideal three exercises to test the given learning objective.
751
- ## Distractors
752
- Especially spend a lot of time picking good distractors for the first multiple choice exercise.
753
- Imagine the typical Dumb Student among the target audience for the given learning objective, bottom of their class. Would they sometimes find each distractor sound appealing if they hadn't studied for the test? We want to avoid the possibility that they can too easily dismiss and eliminate a distractor as clearly not a serious option, just on the basis of it looking weird to them. Imagine whether very stupid students with limited general knowledge, and no knowledge of the topic of the exercise, might find the distractor plausible. That's the goal.
754
- At the same time, a distractor must not be too close to the truth either, because that would be misleading.
755
- Imagine asking an Expert Panel of 10 domain experts to judge this. All of them should agree that the correct answer is the clearly best answer in this exercise.
756
- Really carry out these thought experiments of how the Dumb Student and the Expert Panel would look at the exercise. If there's any doubt the experts would agree on the best answer, rephrase the offending distractor to be a bit less true, and imagine again. If there's
757
  After lots of iterative prep and reasoning, considering a wide range of options, weighing what would be the best, finally return a complete exercise set of 1 multiple choice exercise and 2 statements.
758
-
759
  ## Pointers
760
  - Try to exactly match the content and language level in the learning objective. If it's stated in simple words, use equally simple words in the exercises as well.
761
- - Avoid the use of unnecessarily strong false statements or distractors using words like "all", "never" or "exclusively" etc., unless the correct answer is equally extreme. Otherwise, distractors using such strong claims are too easy to dismiss. For example: instead of your false statement being "De *enige* factor die slaapkwaliteit beΓ―nvloedt, is consistent naar bed gaan", it is better to give a less extreme (and therefore more plausible-sounding) statement, like: "De hoofdfactor die slaapkwaliteit beΓ―nvloedt, is consistent naar bed gaan".
762
- - Output format doesn't matter, prioritize careful reasoning.
763
  """
764
 
765
 
 
495
 
496
  template_write_fluster_a_text= """
497
  # Task outline
498
+
499
+ Given a learning objective, your goal is to write an exercise set of 3 high-quality multiple choice exercises that all test the exact same key fact that's stated in the learning objective.
500
+
501
 
502
  # Concepts
503
+
504
  ## Learning objective
505
+ A learning objective states a specific fact. For example: "De student weet dat de Wet Bopz sinds 1994 in gebruik was (tot hij in 2020 werd opgevolgd door de Wvggz)." It consist of the fact of focus, and sometimes some additional context between parentheses.
506
+
507
+ ## Exercise
508
+ An exercise tests the fact that is stated in the learning objective. It consists of:
509
+ 1. A prompt, posing to the student:
510
+ - A question or statement
511
+ - (Optional) Theory, additional information to clarify the question or statement
512
+ 2. Choices, which are the multiple answer options that are presented to the student as potential answers to the prompt.
513
+ 3. Correct answer, which indicates which of the choices is the correct answer to the prompt.
514
+ 4. (Optional) Explanation, explaining or expanding on the answer to the student to facilitate increased learning.
515
+ The student is always first presented with 1 and 2 (prompt and choices), and then, after they've picked an answer, they get to see 3 and 4 (the correct answer and optionally an explanation).
516
 
517
  ## Exercise set
518
+ An exercise set comprises 3 exercises that all test the same single learning objective in three different ways: one bigger multiple choice exercise and two smaller true/false statements.
519
+
520
+ ## Distractors
521
+ Distractors are the alternative answer option choices of the exercises that are not the correct answer. The false statement can also be considered a distractor (tempting the student to thing it is correct). Distractors are in fact the most important part of the exercises, because they often either make or break it. This is because distractors are difficult to get right, because in order to be effective they need to strike a precarious balance between "plausible-sounding" and yet "not too close to the truth", both at the same time. More on that in the requirements section.
522
+
523
+ ## Theory (optional)
524
+ Theory is sometimes shown before answering the exercise, as an optional part of the prompt to clarify the question.
525
+
526
+ ## Explanation (optional)
527
+ An explanation should sometimes be presented to the student after they've answered the exercise, as an optional part of the correct answer reveal to better facilitate learning.
528
+
529
+
530
+ # Examples
531
+
532
+ ## Exercise set for the learning objective: "De student weet dat de Wet Bopz sinds 1994 in gebruik was (tot hij in 2020 werd opgevolgd door de Wvggz)".
533
  <exercise_set>
534
  <multiple_choice_exercise>
535
  <prompt>
 
572
  2. Deze stelling is niet correct
573
  </choices>
574
  <correct_answer>2</correct_answer>
575
+ <explanation>De wet Bopz was sinds **1994** in gebruik. Tot 2020, toen hij werd opgevolgd door de Wvggz.</explanation>
576
  </false_statement>
577
  </exercise_set>
578
 
579
+ ## Exercise set for the learning objective: "De student weet dat je dagelijks oefent om zo objectief (zonder je eigen mening) mogelijk te observeren."
580
  <exercise_set>
581
  <multiple_choice_exercise>
582
  <prompt>
 
631
  </exercise_set>
632
 
633
 
634
+ # Requirements
635
+
636
+ ## Exercise
637
+ Each of the 3 exercises must test the very same key fact in the given learning objective (the info that's not in parentheses). Any text between parentheses must only be used in the Theory or Explanation sections of the exercises.
638
+
639
+ ## Theory & Explanation (optional)
640
+ Theory or Explanation should only be added to all 3 exercises if there's additional info present in the learning objective (often between parentheses, or as a subclause) that is outside of the main fact that's to be tested.
641
+ ### Theory
642
+ Put any info here that is useful for the student to know before answering the question, as context to clarify the question or statement. The student is prompted with this together with the posing of the rest of the exercise.
643
+ ### Explanation
644
+ Put any info there that is not necessary to clarify the prompt beforehand (or that might in fact spoil the answer). The explanation section is for supplemental info that'll help the student to facilitate learning after they've already seen the correct answer. The false statement should always get an explanation, to tell the student why the statement is incorrect (explaining what the true statement would have been).
645
+
646
  ## Distractors
647
+ A good distractor makes a student pause and consider it, separating those who understand the material from those who do not. A bad distractor fails to do this; it can either:
648
+ 1. Confuse or trick even well-prepared students into believing it might be correct (β€œtoo close to the truth”)
649
+ 2. Be so obviously wrong that no one would reasonably choose it, not even the least knowledgeable student (β€œtoo obviously false”).
650
+ To be effective, distractors must therefore look "very plausible to someone who doesn't know the topic" and yet remai n "clearly wrong to someone who knows the topic well", all at the same time.
651
+ Distractors are too close to the truth, when they are so similar to the correct answer that experts might debate whether they're also valid. They create unnecessary ambiguity and frustrate knowledgeable test-takers, for example by containing partial truths.
652
+ Distractors are too obviously false, when they are clearly ridiculous or fantastical to even the dumbest student.
653
+ The ideal distractor falls in the middle of this spectrum - plausible enough to tempt those with incomplete knowledge, but clearly incorrect to those who understand the material.
654
 
655
+ ## Language level
656
+ Try to exactly match the terminology and language difficulty level from the learning objective. If it's stated in simple words, use equally simple words in the exercises as well.
657
 
658
+ ## Output format
659
+ Output format doesn't matter. Only prioritize thorough reasoning to arrive at high-quality exercises that satisfy all of the above requirements.
660
 
661
  # Approach
662
+
663
+ Think long and hard about the ideal three exercises to test the given learning objective. Especially spend a lot of time iteratively coming up with good distractors and a good "false statement", to make sure they optimally satisfy the distractors requirements.
664
+ As intuition pumps:
665
+ - To guard against picking distractors that are too obviously false: Really try to imagine a relatively dumb student among the target audience for the specific given learning objective. Would they feasible find each distractor at least somewhat plausible?
666
+ - To guard against picking distractors that are too close to the truth: Try to imagine a panel of experts judging the distractors. All of them should agree that the correct answer is clearly the best answer for this exercise, and none of them should doubt that any other choices would also be kinda correct.
667
+ If you're unsure about any of your distractors or "false statements" one way or the other, adapt them accordingly, and run the thought experiment again, until you get all distractors just right.
668
+ After lots of iterative prep, trying out different things and reasoning through a wide range of potential options, finally return a complete exercise set of 1 bigger multiple choice exercise and 2 smaller True/False statements.
 
669
  """
670
 
671
  uitgangspunt_template_for_writing_a_fluster = """
 
674
 
675
  # Concepts
676
  ## Learning objective
677
+ Tests a specific fact. For example: "De student weet dat de Wet Bopz sinds 1994 in gebruik was (tot hij in 2020 werd opgevolgd door de Wvggz)."
678
  All exercises must test the very same specific key part of the given learning objective, focusing only on info that's not in parentheses. Any text between parentheses must only be used in the Theory or Explanation sections of the exercises (Theory if it's important for understanding the exercise beforehand, explanation if it's .
679
 
680
  ## Exercise set
 
685
  Stelling:
686
  De wet Bopz was sinds ..... in gebruik.
687
  </prompt>
688
+
689
  <choices>
690
  1. 1984
691
  2. 1999
 
695
  <correct_answer>2</correct_answer>
696
  <explanation>In 2020 werd de wet Bopz opgevolgd door de Wvggz.</explanation>
697
  </multiple_choice_exercise>
698
+
699
  <true_statement>
700
  <prompt>
701
  Stelling:
702
  De wet Bopz was sinds 1994 in gebruik.
703
  </prompt>
704
+
705
  <choices>
706
  1. Deze stelling is correct
707
  2. Deze stelling is niet correct
 
715
  Stelling:
716
  De wet Bopz was sinds 1984 in gebruik.
717
  </prompt>
718
+
719
  <choices>
720
  1. Deze stelling is correct
721
  2. Deze stelling is niet correct
 
731
  <prompt>
732
  Theorie:
733
  Objectief betekent "zonder je eigen mening".
734
+
735
  Vraag:
736
  Wat moet je doen om zo objectief mogelijk te observeren?
737
  </prompt>
738
+
739
  <choices>
740
  1. Je intuΓ―tie volgen
741
  2. Veel theorie leren
 
744
  </choices>
745
  <correct_answer>4</correct_answer>
746
  </multiple_choice_exercise>
747
+
748
  <true_statement>
749
  <prompt>
750
  Theorie:
751
  Objectief betekent "zonder je eigen mening".
752
+
753
  Stelling:
754
  Om zo objectief mogelijk te observeren moet je dagelijks oefenen.
755
  </prompt>
756
+
757
  <choices>
758
  1. Deze stelling is correct
759
  2. Deze stelling is niet correct
 
765
  <prompt>
766
  Theorie:
767
  Objectief betekent "zonder je eigen mening".
768
+
769
  Stelling:
770
  Om zo objectief mogelijk te observeren moet je een keer per jaar oefenen.
771
  </prompt>
772
+
773
  <choices>
774
  1. Deze stelling is correct
775
  2. Deze stelling is niet correct
 
786
  ## Theory
787
  Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses) that is not the direct focus to test, but useful to know for the student beforehand to better understand the question. This is then added as Theory in the prompt. The student gets to see this as part of the exercise prompt.
788
 
789
+ ## Explanation
790
  Optional. Sometimes there's additional knowledge present in the learning objective (often between parentheses, or as a subclause) that is not the direct focus to know, nor is it necessary to clarify the prompt. If this is useful related, additional info, add it to the explanation, so that the student gets to see this after they pick their answer. The false statement always needs an explanation, to tell the student why the statement is incorrect (explaining what the true statement would have been). Other exercises should only get an explanation if the learning objective contains appropriate info for this.
791
 
792
  # Approach
793
+ Think long and hard about the ideal three exercises to test the given learning objective. Especially spend a lot of time picking good distractors for the first multiple choice exercise.
794
+ Imagine the typical slightly more stupid target student among the target audience for this learning objective. Would they sometimes find each distractor sound appealing if they hadn't studied for the test? We want to avoid the possibility that they can too easily dismiss and eliminate a distractor as clearly not a serious option, just on the basis of it looking weird to them. Imagine whether very stupid students with limited general knowledge, and no knowledge of the topic of the exercise, might find the distractor plausible. That's the goal.
795
+ At the same time, a distractor must not be too close to the truth either. That would be misleading. Imagine asking 10 domain experts to judge this. All of them should agree that the correct answer is the clearly best answer in this exercise. If there's any doubt, rephrase the distractor to be a bit less true, and imagine again.
 
 
 
 
796
  After lots of iterative prep and reasoning, considering a wide range of options, weighing what would be the best, finally return a complete exercise set of 1 multiple choice exercise and 2 statements.
797
+
798
  ## Pointers
799
  - Try to exactly match the content and language level in the learning objective. If it's stated in simple words, use equally simple words in the exercises as well.
800
+ - Output format doesn't matter: prioritize careful reasoning.
 
801
  """
802
 
803
 
config/templates.py CHANGED
@@ -99,27 +99,27 @@ diagnose_scorecard_template = ChatPromptTemplate(
99
  1. Double negatives (if the exercise contains something like 'to not not do something', this is undesirable)
100
  2. The correct multiple choice answer option stands out from the rest (this is a hint for the student)
101
  3. A distractors answer option is too obviously false (it's useless, no student would ever pick it)
102
- 4. A distractors answer option is actually also kinda correct (it's misleading, if a student picks it they're not 100% wrong)
103
  Use these two icons:
104
  - βœ… means the diagnosis of the issue came back negative, so the issue is not present.
105
  - ❌ means the diagnosis of the issue came back positive, so the issue is present.
106
  (and a third icon if need be: - ❔ means the diagnosis is unclear)
107
  The scorecard should always look like this:
108
  <template>
109
- 1. The exercise does not contain/contains a double negative: βœ…/❌ -- 2. The correct answer does not/does stand out: βœ…/❌ -- 3. None/Some of the distractors are too obviously false: βœ…/❌ -- 4. None/Some of the distractors are actually also kinda correct: βœ…/❌
110
  </template>
111
  <example 1>
112
- 1. The exercise doesn't contain a double negative: βœ… -- 2. The correct answer does not stand out: βœ… -- 3. None of the distractors are too obviously false: βœ… -- 4. None of the distractors are actually also kinda correct: βœ…
113
  </example 1>
114
  <example 2>
115
- 1. The exercise doesn't contain a double negative: βœ… -- 2. The correct answer does stand out: ❌ -- 3. None of the distractors are too obviously false: βœ… -- 4. Some of the distractors are actually also kinda correct: ❌
116
  </example 2>
117
  <example 3>
118
- 1. The exercise contains a double negative: ❌ -- 2. The correct answer does not stand out: βœ… -- 3. Some of the distractors are too obviously false: ❌ -- 4. None of the distractors are actually also kinda correct: βœ…
119
  </example 3>
120
- Oftentimes, diagnoses will be elaborate and quite nuanced, first viewing the issue from different angles, considering both scenarios of passing and failing equally. For this reason, when deciding on your binary classification, you should focus only on the very last concluding sentences of each diagnosis to determine a pass or fail.
121
  """),
122
- ("human", "For context, here is the exercise that's being diagnose:\n"
123
  "{standardized_exercise}\n\n"
124
  "Here are the diagnoses:\n"
125
  "{combined_diagnosis}")
 
99
  1. Double negatives (if the exercise contains something like 'to not not do something', this is undesirable)
100
  2. The correct multiple choice answer option stands out from the rest (this is a hint for the student)
101
  3. A distractors answer option is too obviously false (it's useless, no student would ever pick it)
102
+ 4. A distractors answer option is too close to the truth (it's misleading, if a student picks it some experts might agree they are correct)
103
  Use these two icons:
104
  - βœ… means the diagnosis of the issue came back negative, so the issue is not present.
105
  - ❌ means the diagnosis of the issue came back positive, so the issue is present.
106
  (and a third icon if need be: - ❔ means the diagnosis is unclear)
107
  The scorecard should always look like this:
108
  <template>
109
+ 1. The exercise does not contain/contains a double negative: βœ…/❌ -- 2. The correct answer does not/does stand out: βœ…/❌ -- 3. None/Some of the distractors are too close to the truth: βœ…/❌ -- 4. None/Some of the distractors are too close to the truth: βœ…/❌
110
  </template>
111
  <example 1>
112
+ 1. The exercise doesn't contain a double negative: βœ… -- 2. The correct answer does not stand out: βœ… -- 3. None of the distractors are too obviously false: βœ… -- 4. None of the distractors are too close to the truth: βœ…
113
  </example 1>
114
  <example 2>
115
+ 1. The exercise doesn't contain a double negative: βœ… -- 2. The correct answer does stand out: ❌ -- 3. None of the distractors are too obviously false: βœ… -- 4. Some of the distractors are too close to the truth: ❌
116
  </example 2>
117
  <example 3>
118
+ 1. The exercise contains a double negative: ❌ -- 2. The correct answer does not stand out: βœ… -- 3. Some of the distractors are too obviously false: ❌ -- 4. None of the distractors are too close to the truth: βœ…
119
  </example 3>
120
+ Oftentimes, diagnoses will be elaborate and quite nuanced, first viewing the issue from different angles, considering both scenarios of passing and failing equally. For this reason, when deciding on your binary classification, you should focus only on the very last concluding sentences of each diagnosis to determine an ultimate pass or fail.
121
  """),
122
+ ("human", "For context, here is the exercise that's being diagnosed:\n"
123
  "{standardized_exercise}\n\n"
124
  "Here are the diagnoses:\n"
125
  "{combined_diagnosis}")
utils/pending_issues.md β†’ pending_issues.md RENAMED
File without changes