Adherence commited on
Commit
c434c76
·
verified ·
1 Parent(s): 174cad1

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +46 -8
README.md CHANGED
@@ -12,25 +12,55 @@ license: mit
12
 
13
  # Nuremberg Trials AI
14
 
15
- An AI-powered question-answering system for the Nuremberg Trials (1945-1946).
16
 
17
  ## Features
18
 
19
- - **RAG-powered answers**: Retrieves relevant passages from 12,000+ document chunks
20
  - **Source citations**: Every answer shows the original source documents
21
- - **Comprehensive data**: Covers IMT transcripts, judgments, charter, and defendant biographies
22
 
23
  ## Data Sources
24
 
25
- - **Harvard Law School Nuremberg Trials Project**: Full IMT transcript (17,268 pages)
26
- - **Yale Avalon Project**: Judgments, indictments, London Charter
27
- - **Wikipedia**: Defendant biographies
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
 
29
  ## Technology
30
 
31
  - **Embeddings**: all-MiniLM-L6-v2 (sentence-transformers)
32
- - **Vector Search**: FAISS
33
- - **LLM**: Mistral-7B-Instruct (HuggingFace Inference API)
34
  - **UI**: Gradio
35
 
36
  ## Example Questions
@@ -39,3 +69,11 @@ An AI-powered question-answering system for the Nuremberg Trials (1945-1946).
39
  - What were the four counts in the Nuremberg indictment?
40
  - Who was the chief prosecutor for the United States?
41
  - What happened to Hermann Goering?
 
 
 
 
 
 
 
 
 
12
 
13
  # Nuremberg Trials AI
14
 
15
+ The most comprehensive AI-powered question-answering system for the Nuremberg Trials (1945-1949).
16
 
17
  ## Features
18
 
19
+ - **RAG-powered answers**: Searches through 120,000+ document chunks
20
  - **Source citations**: Every answer shows the original source documents
21
+ - **Complete coverage**: All 13 trials (IMT + 12 NMT), judgments, charter, and biographies
22
 
23
  ## Data Sources
24
 
25
+ ### Harvard Law School Nuremberg Trials Project (518MB)
26
+ Complete transcripts from all 13 trials:
27
+ - Trial 1: Medical Case (Karl Brandt et al.)
28
+ - Trial 2: Milch Case
29
+ - Trial 3: Justice Case
30
+ - Trial 4: Hostages Case
31
+ - Trial 5: Pohl Case
32
+ - Trial 6: Einsatzgruppen Case
33
+ - Trial 7: IMT - Main Trial (Goering et al.)
34
+ - Trial 8: Flick Case
35
+ - Trial 9: IG Farben Case
36
+ - Trial 10: RuSHA Case
37
+ - Trial 11: Krupp Case
38
+ - Trial 12: Ministries Case
39
+ - Trial 13: High Command Case
40
+
41
+ **Total: 153,010 pages**
42
+
43
+ ### Yale Avalon Project (192MB)
44
+ - London Charter & IMT Constitution
45
+ - Indictments and Judgments (all defendants)
46
+ - Trial Proceedings (22 volumes)
47
+ - Nazi Conspiracy & Aggression (4 volumes)
48
+
49
+ **Total: 857 documents, 11.3 million words**
50
+
51
+ ### Wikipedia
52
+ - 24 defendant biographies
53
+ - 11 trial overview pages
54
+ - 8 key prosecutors/judges
55
+ - 6 Nazi organization pages
56
+
57
+ **Total: 49 pages**
58
 
59
  ## Technology
60
 
61
  - **Embeddings**: all-MiniLM-L6-v2 (sentence-transformers)
62
+ - **Vector Search**: FAISS with 120,385 vectors
63
+ - **LLM**: Zephyr-7B (HuggingFace Inference API)
64
  - **UI**: Gradio
65
 
66
  ## Example Questions
 
69
  - What were the four counts in the Nuremberg indictment?
70
  - Who was the chief prosecutor for the United States?
71
  - What happened to Hermann Goering?
72
+ - What were the Nazi medical experiments?
73
+ - What was the Einsatzgruppen trial about?
74
+ - Who was prosecuted in the IG Farben trial?
75
+
76
+ ## Related Resources
77
+
78
+ - **Raw Dataset**: [Adherence/nuremberg-trials-complete](https://huggingface.co/datasets/Adherence/nuremberg-trials-complete)
79
+ - **RAG Dataset**: [Adherence/nuremberg-trials-rag](https://huggingface.co/datasets/Adherence/nuremberg-trials-rag)