Adhitya122 commited on
Commit
79d9284
·
verified ·
1 Parent(s): 8bf28d8

Upload folder using huggingface_hub

Browse files
Files changed (1) hide show
  1. index.html +69 -12
index.html CHANGED
@@ -102,10 +102,10 @@
102
  <div class="prose prose-slate prose-lg max-w-none mb-24 leading-relaxed text-slate-700 border-b pb-20">
103
  <h2 class="text-3xl font-bold text-slate-900 mb-6">Introduction</h2>
104
  <p>
105
- The challenge of modern AI in drug discovery is often reduced to a static prediction task: "Does this molecule bind to this target?" But in a real-world biotech lab, the answer is never that simple. A lead chemist must balance <strong>potency</strong> against <strong>toxicity</strong>, <strong>synthesizability</strong>, and a rapidly depleting <strong>assay budget</strong>.
106
  </p>
107
  <p class="mt-4">
108
- <strong>MolForge</strong> is a verifier-driven reinforcement learning environment that replicates this complexity. We don't trust the model to "guess" the answer. Instead, we force it to execute a sequence of actions—edits, assays, and specialist reviews—until it can justify a final submission with verifiable evidence.
109
  </p>
110
 
111
  <div class="mt-12 p-8 bg-slate-900 text-slate-200 rounded-2xl shadow-lg">
@@ -116,7 +116,67 @@
116
  </div>
117
  </div>
118
 
119
- <!-- Scientific Architecture -->
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
120
  <section class="mb-32">
121
  <div class="flex items-center gap-3 mb-8">
122
  <div class="w-10 h-10 rounded-lg bg-indigo-600 flex items-center justify-center text-white font-bold">1</div>
@@ -127,7 +187,7 @@
127
  MolForge is built as a <strong>Partially Observable Markov Decision Process (POMDP)</strong>. This means the agent never sees the "hidden truth" of the receptor. It only sees what its budget allows it to assay.
128
  </p>
129
 
130
- <div class="shadcn-card p-4 bg-slate-50 mb-12">
131
  <img src="assets/molforge_architecture.png" alt="Architecture" class="rounded-lg w-full">
132
  <p class="mt-4 text-center text-xs text-slate-400 font-medium tracking-wide">THE SCIENTIFIC FEEDBACK LOOP: VERIFIER-FIRST DESIGN</p>
133
  </div>
@@ -230,24 +290,21 @@
230
 
231
  <div class="grid md:grid-cols-3 gap-6 mb-12">
232
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-indigo-500">
233
- <h4 class="font-bold mb-2">Coarse Shaping</h4>
234
  <p class="text-xs text-slate-500">Edit feedback avoids exact hidden deltas, forcing the model to rely on empirical assays.</p>
235
  </div>
236
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-emerald-500">
237
- <h4 class="font-bold mb-2">Evidence Multipliers</h4>
238
  <p class="text-xs text-slate-500">Submissions without current potency, toxicity, and synthesis support receive massive penalties.</p>
239
  </div>
240
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-orange-500">
241
- <h4 class="font-bold mb-2">Budget Efficiency</h4>
242
  <p class="text-xs text-slate-500">Small credits for valid evidence-backed submissions that use less than the allocated budget.</p>
243
  </div>
244
  </div>
245
 
246
  <div class="p-6 bg-indigo-50 border border-indigo-100 rounded-xl text-sm">
247
- <p class="font-bold text-indigo-700 mb-2">The Curriculum Mode Advantage</p>
248
- <p class="text-indigo-900 leading-relaxed">
249
- For early RL, we add <strong>"Partial Credit Breadcrumbs"</strong>. If a model fails to submit but showed good scientific behavior (gathering evidence, designing promising molecules), it receives bounded warmup rewards. This solves the sparse reward problem and teaches the model how to explore before it discovers the terminal submission bonus.
250
- </p>
251
  </div>
252
  </section>
253
 
@@ -306,7 +363,7 @@
306
  <!-- Final Takeaway -->
307
  <section class="mb-32 pt-20 border-t text-center">
308
  <h2 class="text-4xl font-black mb-6 tracking-tight">Final Takeaway</h2>
309
- <p class="text-slate-500 max-w-2xl mx-auto mb-12 text-lg leading-relaxed">
310
  MolForge proves that scientific AI should not be built as a single-shot generator. By grounding the LLM in a <strong>closed-loop scientific environment</strong>, we can train models that respect budget, coordinate with specialists, and base their discoveries on verifiable evidence.
311
  </p>
312
  <div class="flex flex-wrap justify-center gap-4">
 
102
  <div class="prose prose-slate prose-lg max-w-none mb-24 leading-relaxed text-slate-700 border-b pb-20">
103
  <h2 class="text-3xl font-bold text-slate-900 mb-6">Introduction</h2>
104
  <p>
105
+ In traditional drug discovery tasks, LLMs are often asked to "generate a molecule" in a single shot. But science doesn't happen in a vacuum. It happens in the loop—through trial, error, and verification.
106
  </p>
107
  <p class="mt-4">
108
+ <strong>MolForge</strong> is a reinforcement learning environment that simulates a medical oncology discovery lab. It forces the model to navigate real-world constraints: limited budget, molecular toxicity, and synthesis complexity.
109
  </p>
110
 
111
  <div class="mt-12 p-8 bg-slate-900 text-slate-200 rounded-2xl shadow-lg">
 
116
  </div>
117
  </div>
118
 
119
+ <!-- The Scientific Verifier Stack -->
120
+ <section class="mb-32">
121
+ <div class="flex items-center gap-3 mb-8">
122
+ <div class="w-10 h-10 rounded-lg bg-emerald-600 flex items-center justify-center text-white font-bold">🧪</div>
123
+ <h2 class="text-3xl font-bold tracking-tight">The Scientific Verifier Stack</h2>
124
+ </div>
125
+
126
+ <p class="text-slate-600 mb-10 text-lg leading-relaxed">
127
+ MolForge doesn't just predict outcomes; it utilizes multiple simulation layers to ground the model's decisions in chemical and biological reality.
128
+ </p>
129
+
130
+ <div class="grid md:grid-cols-3 gap-6 mb-12">
131
+ <div class="shadcn-card p-8 bg-white hover:border-emerald-500 transition-all cursor-default">
132
+ <div class="w-12 h-12 bg-emerald-50 text-emerald-600 rounded-xl flex items-center justify-center mb-6 text-xl">🧬</div>
133
+ <h4 class="font-bold text-lg mb-3">RDKit</h4>
134
+ <p class="text-sm text-slate-500 leading-relaxed italic">"Keeping molecules physically possible"</p>
135
+ <p class="text-sm text-slate-600 mt-4 leading-relaxed">
136
+ RDKit acts as the fundamental chemistry ruleset. It checks for molecular valency, ensures every edit is chemically plausible, and calculates core descriptors like Lipophilicity and TPSA.
137
+ </p>
138
+ </div>
139
+ <div class="shadcn-card p-8 bg-white hover:border-blue-500 transition-all cursor-default">
140
+ <div class="w-12 h-12 bg-blue-50 text-blue-600 rounded-xl flex items-center justify-center mb-6 text-xl">💊</div>
141
+ <h4 class="font-bold text-lg mb-3">TDC Oracles</h4>
142
+ <p class="text-sm text-slate-500 leading-relaxed italic">"Predicting biomedical fate"</p>
143
+ <p class="text-sm text-slate-600 mt-4 leading-relaxed">
144
+ Utilizing the Therapeutics Data Commons, MolForge predicts real-world ADMET properties, toxicity risks, and synthesizability scores (SA_Score) for every candidate.
145
+ </p>
146
+ </div>
147
+ <div class="shadcn-card p-8 bg-white hover:border-indigo-500 transition-all cursor-default">
148
+ <div class="w-12 h-12 bg-indigo-50 text-indigo-600 rounded-xl flex items-center justify-center mb-6 text-xl">🎯</div>
149
+ <h4 class="font-bold text-lg mb-3">Heuristic Docking</h4>
150
+ <p class="text-sm text-slate-500 leading-relaxed italic">"Simulating receptor-drug fit"</p>
151
+ <p class="text-sm text-slate-600 mt-4 leading-relaxed">
152
+ A fast, physics-inspired simulation that updates potency in milliseconds based on structural pocket matching and receptor complementarity.
153
+ </p>
154
+ </div>
155
+ </div>
156
+
157
+ <div class="p-8 bg-slate-50 border border-slate-200 rounded-2xl">
158
+ <h5 class="font-bold text-slate-900 mb-6 flex items-center gap-2">
159
+ <span class="w-2 h-2 bg-indigo-500 rounded-full"></span>
160
+ The 3 Rules of Potency Simulation
161
+ </h5>
162
+ <div class="grid md:grid-cols-3 gap-8">
163
+ <div class="space-y-2">
164
+ <p class="font-bold text-sm text-slate-800">1. Pocket Matching</p>
165
+ <p class="text-xs text-slate-500 leading-relaxed">Structural fit of the fragment (e.g., azaindole) into the KRAS G12C target pocket.</p>
166
+ </div>
167
+ <div class="space-y-2">
168
+ <p class="font-bold text-sm text-slate-800">2. Lipophilic Match</p>
169
+ <p class="text-xs text-slate-500 leading-relaxed">Targeting the ideal LogP of <strong>3.0</strong> for optimal binding without repulsive clashes.</p>
170
+ </div>
171
+ <div class="space-y-2">
172
+ <p class="font-bold text-sm text-slate-800">3. Polarity Match</p>
173
+ <p class="text-xs text-slate-500 leading-relaxed">Optimizing TPSA toward the ideal <strong>85.0</strong> to avoid polar clashes in hydrophobic pockets.</p>
174
+ </div>
175
+ </div>
176
+ </div>
177
+ </section>
178
+
179
+ <!-- The POMDP Architecture -->
180
  <section class="mb-32">
181
  <div class="flex items-center gap-3 mb-8">
182
  <div class="w-10 h-10 rounded-lg bg-indigo-600 flex items-center justify-center text-white font-bold">1</div>
 
187
  MolForge is built as a <strong>Partially Observable Markov Decision Process (POMDP)</strong>. This means the agent never sees the "hidden truth" of the receptor. It only sees what its budget allows it to assay.
188
  </p>
189
 
190
+ <div class="shadcn-card p-4 bg-slate-50 mb-12 border-dashed">
191
  <img src="assets/molforge_architecture.png" alt="Architecture" class="rounded-lg w-full">
192
  <p class="mt-4 text-center text-xs text-slate-400 font-medium tracking-wide">THE SCIENTIFIC FEEDBACK LOOP: VERIFIER-FIRST DESIGN</p>
193
  </div>
 
290
 
291
  <div class="grid md:grid-cols-3 gap-6 mb-12">
292
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-indigo-500">
293
+ <h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Coarse Shaping</h4>
294
  <p class="text-xs text-slate-500">Edit feedback avoids exact hidden deltas, forcing the model to rely on empirical assays.</p>
295
  </div>
296
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-emerald-500">
297
+ <h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Evidence Multipliers</h4>
298
  <p class="text-xs text-slate-500">Submissions without current potency, toxicity, and synthesis support receive massive penalties.</p>
299
  </div>
300
  <div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-orange-500">
301
+ <h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Budget Efficiency</h4>
302
  <p class="text-xs text-slate-500">Small credits for valid evidence-backed submissions that use less than the allocated budget.</p>
303
  </div>
304
  </div>
305
 
306
  <div class="p-6 bg-indigo-50 border border-indigo-100 rounded-xl text-sm">
307
+ <p class="font-bold text-indigo-700 mb-2 italic">"Curriculum mode is the RL warm-up engine—providing the breadcrumbs needed for the model to discover the submission bonus."</p>
 
 
 
308
  </div>
309
  </section>
310
 
 
363
  <!-- Final Takeaway -->
364
  <section class="mb-32 pt-20 border-t text-center">
365
  <h2 class="text-4xl font-black mb-6 tracking-tight">Final Takeaway</h2>
366
+ <p class="text-slate-500 max-w-2xl mx-auto mb-12 text-lg leading-relaxed text-justify">
367
  MolForge proves that scientific AI should not be built as a single-shot generator. By grounding the LLM in a <strong>closed-loop scientific environment</strong>, we can train models that respect budget, coordinate with specialists, and base their discoveries on verifiable evidence.
368
  </p>
369
  <div class="flex flex-wrap justify-center gap-4">