Spaces:
Running
Running
Upload folder using huggingface_hub
Browse files- index.html +69 -12
index.html
CHANGED
|
@@ -102,10 +102,10 @@
|
|
| 102 |
<div class="prose prose-slate prose-lg max-w-none mb-24 leading-relaxed text-slate-700 border-b pb-20">
|
| 103 |
<h2 class="text-3xl font-bold text-slate-900 mb-6">Introduction</h2>
|
| 104 |
<p>
|
| 105 |
-
|
| 106 |
</p>
|
| 107 |
<p class="mt-4">
|
| 108 |
-
<strong>MolForge</strong> is a
|
| 109 |
</p>
|
| 110 |
|
| 111 |
<div class="mt-12 p-8 bg-slate-900 text-slate-200 rounded-2xl shadow-lg">
|
|
@@ -116,7 +116,67 @@
|
|
| 116 |
</div>
|
| 117 |
</div>
|
| 118 |
|
| 119 |
-
<!-- Scientific
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 120 |
<section class="mb-32">
|
| 121 |
<div class="flex items-center gap-3 mb-8">
|
| 122 |
<div class="w-10 h-10 rounded-lg bg-indigo-600 flex items-center justify-center text-white font-bold">1</div>
|
|
@@ -127,7 +187,7 @@
|
|
| 127 |
MolForge is built as a <strong>Partially Observable Markov Decision Process (POMDP)</strong>. This means the agent never sees the "hidden truth" of the receptor. It only sees what its budget allows it to assay.
|
| 128 |
</p>
|
| 129 |
|
| 130 |
-
<div class="shadcn-card p-4 bg-slate-50 mb-12">
|
| 131 |
<img src="assets/molforge_architecture.png" alt="Architecture" class="rounded-lg w-full">
|
| 132 |
<p class="mt-4 text-center text-xs text-slate-400 font-medium tracking-wide">THE SCIENTIFIC FEEDBACK LOOP: VERIFIER-FIRST DESIGN</p>
|
| 133 |
</div>
|
|
@@ -230,24 +290,21 @@
|
|
| 230 |
|
| 231 |
<div class="grid md:grid-cols-3 gap-6 mb-12">
|
| 232 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-indigo-500">
|
| 233 |
-
<h4 class="font-bold mb-2">Coarse Shaping</h4>
|
| 234 |
<p class="text-xs text-slate-500">Edit feedback avoids exact hidden deltas, forcing the model to rely on empirical assays.</p>
|
| 235 |
</div>
|
| 236 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-emerald-500">
|
| 237 |
-
<h4 class="font-bold mb-2">Evidence Multipliers</h4>
|
| 238 |
<p class="text-xs text-slate-500">Submissions without current potency, toxicity, and synthesis support receive massive penalties.</p>
|
| 239 |
</div>
|
| 240 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-orange-500">
|
| 241 |
-
<h4 class="font-bold mb-2">Budget Efficiency</h4>
|
| 242 |
<p class="text-xs text-slate-500">Small credits for valid evidence-backed submissions that use less than the allocated budget.</p>
|
| 243 |
</div>
|
| 244 |
</div>
|
| 245 |
|
| 246 |
<div class="p-6 bg-indigo-50 border border-indigo-100 rounded-xl text-sm">
|
| 247 |
-
<p class="font-bold text-indigo-700 mb-2">
|
| 248 |
-
<p class="text-indigo-900 leading-relaxed">
|
| 249 |
-
For early RL, we add <strong>"Partial Credit Breadcrumbs"</strong>. If a model fails to submit but showed good scientific behavior (gathering evidence, designing promising molecules), it receives bounded warmup rewards. This solves the sparse reward problem and teaches the model how to explore before it discovers the terminal submission bonus.
|
| 250 |
-
</p>
|
| 251 |
</div>
|
| 252 |
</section>
|
| 253 |
|
|
@@ -306,7 +363,7 @@
|
|
| 306 |
<!-- Final Takeaway -->
|
| 307 |
<section class="mb-32 pt-20 border-t text-center">
|
| 308 |
<h2 class="text-4xl font-black mb-6 tracking-tight">Final Takeaway</h2>
|
| 309 |
-
<p class="text-slate-500 max-w-2xl mx-auto mb-12 text-lg leading-relaxed">
|
| 310 |
MolForge proves that scientific AI should not be built as a single-shot generator. By grounding the LLM in a <strong>closed-loop scientific environment</strong>, we can train models that respect budget, coordinate with specialists, and base their discoveries on verifiable evidence.
|
| 311 |
</p>
|
| 312 |
<div class="flex flex-wrap justify-center gap-4">
|
|
|
|
| 102 |
<div class="prose prose-slate prose-lg max-w-none mb-24 leading-relaxed text-slate-700 border-b pb-20">
|
| 103 |
<h2 class="text-3xl font-bold text-slate-900 mb-6">Introduction</h2>
|
| 104 |
<p>
|
| 105 |
+
In traditional drug discovery tasks, LLMs are often asked to "generate a molecule" in a single shot. But science doesn't happen in a vacuum. It happens in the loop—through trial, error, and verification.
|
| 106 |
</p>
|
| 107 |
<p class="mt-4">
|
| 108 |
+
<strong>MolForge</strong> is a reinforcement learning environment that simulates a medical oncology discovery lab. It forces the model to navigate real-world constraints: limited budget, molecular toxicity, and synthesis complexity.
|
| 109 |
</p>
|
| 110 |
|
| 111 |
<div class="mt-12 p-8 bg-slate-900 text-slate-200 rounded-2xl shadow-lg">
|
|
|
|
| 116 |
</div>
|
| 117 |
</div>
|
| 118 |
|
| 119 |
+
<!-- The Scientific Verifier Stack -->
|
| 120 |
+
<section class="mb-32">
|
| 121 |
+
<div class="flex items-center gap-3 mb-8">
|
| 122 |
+
<div class="w-10 h-10 rounded-lg bg-emerald-600 flex items-center justify-center text-white font-bold">🧪</div>
|
| 123 |
+
<h2 class="text-3xl font-bold tracking-tight">The Scientific Verifier Stack</h2>
|
| 124 |
+
</div>
|
| 125 |
+
|
| 126 |
+
<p class="text-slate-600 mb-10 text-lg leading-relaxed">
|
| 127 |
+
MolForge doesn't just predict outcomes; it utilizes multiple simulation layers to ground the model's decisions in chemical and biological reality.
|
| 128 |
+
</p>
|
| 129 |
+
|
| 130 |
+
<div class="grid md:grid-cols-3 gap-6 mb-12">
|
| 131 |
+
<div class="shadcn-card p-8 bg-white hover:border-emerald-500 transition-all cursor-default">
|
| 132 |
+
<div class="w-12 h-12 bg-emerald-50 text-emerald-600 rounded-xl flex items-center justify-center mb-6 text-xl">🧬</div>
|
| 133 |
+
<h4 class="font-bold text-lg mb-3">RDKit</h4>
|
| 134 |
+
<p class="text-sm text-slate-500 leading-relaxed italic">"Keeping molecules physically possible"</p>
|
| 135 |
+
<p class="text-sm text-slate-600 mt-4 leading-relaxed">
|
| 136 |
+
RDKit acts as the fundamental chemistry ruleset. It checks for molecular valency, ensures every edit is chemically plausible, and calculates core descriptors like Lipophilicity and TPSA.
|
| 137 |
+
</p>
|
| 138 |
+
</div>
|
| 139 |
+
<div class="shadcn-card p-8 bg-white hover:border-blue-500 transition-all cursor-default">
|
| 140 |
+
<div class="w-12 h-12 bg-blue-50 text-blue-600 rounded-xl flex items-center justify-center mb-6 text-xl">💊</div>
|
| 141 |
+
<h4 class="font-bold text-lg mb-3">TDC Oracles</h4>
|
| 142 |
+
<p class="text-sm text-slate-500 leading-relaxed italic">"Predicting biomedical fate"</p>
|
| 143 |
+
<p class="text-sm text-slate-600 mt-4 leading-relaxed">
|
| 144 |
+
Utilizing the Therapeutics Data Commons, MolForge predicts real-world ADMET properties, toxicity risks, and synthesizability scores (SA_Score) for every candidate.
|
| 145 |
+
</p>
|
| 146 |
+
</div>
|
| 147 |
+
<div class="shadcn-card p-8 bg-white hover:border-indigo-500 transition-all cursor-default">
|
| 148 |
+
<div class="w-12 h-12 bg-indigo-50 text-indigo-600 rounded-xl flex items-center justify-center mb-6 text-xl">🎯</div>
|
| 149 |
+
<h4 class="font-bold text-lg mb-3">Heuristic Docking</h4>
|
| 150 |
+
<p class="text-sm text-slate-500 leading-relaxed italic">"Simulating receptor-drug fit"</p>
|
| 151 |
+
<p class="text-sm text-slate-600 mt-4 leading-relaxed">
|
| 152 |
+
A fast, physics-inspired simulation that updates potency in milliseconds based on structural pocket matching and receptor complementarity.
|
| 153 |
+
</p>
|
| 154 |
+
</div>
|
| 155 |
+
</div>
|
| 156 |
+
|
| 157 |
+
<div class="p-8 bg-slate-50 border border-slate-200 rounded-2xl">
|
| 158 |
+
<h5 class="font-bold text-slate-900 mb-6 flex items-center gap-2">
|
| 159 |
+
<span class="w-2 h-2 bg-indigo-500 rounded-full"></span>
|
| 160 |
+
The 3 Rules of Potency Simulation
|
| 161 |
+
</h5>
|
| 162 |
+
<div class="grid md:grid-cols-3 gap-8">
|
| 163 |
+
<div class="space-y-2">
|
| 164 |
+
<p class="font-bold text-sm text-slate-800">1. Pocket Matching</p>
|
| 165 |
+
<p class="text-xs text-slate-500 leading-relaxed">Structural fit of the fragment (e.g., azaindole) into the KRAS G12C target pocket.</p>
|
| 166 |
+
</div>
|
| 167 |
+
<div class="space-y-2">
|
| 168 |
+
<p class="font-bold text-sm text-slate-800">2. Lipophilic Match</p>
|
| 169 |
+
<p class="text-xs text-slate-500 leading-relaxed">Targeting the ideal LogP of <strong>3.0</strong> for optimal binding without repulsive clashes.</p>
|
| 170 |
+
</div>
|
| 171 |
+
<div class="space-y-2">
|
| 172 |
+
<p class="font-bold text-sm text-slate-800">3. Polarity Match</p>
|
| 173 |
+
<p class="text-xs text-slate-500 leading-relaxed">Optimizing TPSA toward the ideal <strong>85.0</strong> to avoid polar clashes in hydrophobic pockets.</p>
|
| 174 |
+
</div>
|
| 175 |
+
</div>
|
| 176 |
+
</div>
|
| 177 |
+
</section>
|
| 178 |
+
|
| 179 |
+
<!-- The POMDP Architecture -->
|
| 180 |
<section class="mb-32">
|
| 181 |
<div class="flex items-center gap-3 mb-8">
|
| 182 |
<div class="w-10 h-10 rounded-lg bg-indigo-600 flex items-center justify-center text-white font-bold">1</div>
|
|
|
|
| 187 |
MolForge is built as a <strong>Partially Observable Markov Decision Process (POMDP)</strong>. This means the agent never sees the "hidden truth" of the receptor. It only sees what its budget allows it to assay.
|
| 188 |
</p>
|
| 189 |
|
| 190 |
+
<div class="shadcn-card p-4 bg-slate-50 mb-12 border-dashed">
|
| 191 |
<img src="assets/molforge_architecture.png" alt="Architecture" class="rounded-lg w-full">
|
| 192 |
<p class="mt-4 text-center text-xs text-slate-400 font-medium tracking-wide">THE SCIENTIFIC FEEDBACK LOOP: VERIFIER-FIRST DESIGN</p>
|
| 193 |
</div>
|
|
|
|
| 290 |
|
| 291 |
<div class="grid md:grid-cols-3 gap-6 mb-12">
|
| 292 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-indigo-500">
|
| 293 |
+
<h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Coarse Shaping</h4>
|
| 294 |
<p class="text-xs text-slate-500">Edit feedback avoids exact hidden deltas, forcing the model to rely on empirical assays.</p>
|
| 295 |
</div>
|
| 296 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-emerald-500">
|
| 297 |
+
<h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Evidence Multipliers</h4>
|
| 298 |
<p class="text-xs text-slate-500">Submissions without current potency, toxicity, and synthesis support receive massive penalties.</p>
|
| 299 |
</div>
|
| 300 |
<div class="shadcn-card p-6 bg-slate-50 border-t-4 border-t-orange-500">
|
| 301 |
+
<h4 class="font-bold mb-2 text-sm uppercase tracking-wider text-slate-500">Budget Efficiency</h4>
|
| 302 |
<p class="text-xs text-slate-500">Small credits for valid evidence-backed submissions that use less than the allocated budget.</p>
|
| 303 |
</div>
|
| 304 |
</div>
|
| 305 |
|
| 306 |
<div class="p-6 bg-indigo-50 border border-indigo-100 rounded-xl text-sm">
|
| 307 |
+
<p class="font-bold text-indigo-700 mb-2 italic">"Curriculum mode is the RL warm-up engine—providing the breadcrumbs needed for the model to discover the submission bonus."</p>
|
|
|
|
|
|
|
|
|
|
| 308 |
</div>
|
| 309 |
</section>
|
| 310 |
|
|
|
|
| 363 |
<!-- Final Takeaway -->
|
| 364 |
<section class="mb-32 pt-20 border-t text-center">
|
| 365 |
<h2 class="text-4xl font-black mb-6 tracking-tight">Final Takeaway</h2>
|
| 366 |
+
<p class="text-slate-500 max-w-2xl mx-auto mb-12 text-lg leading-relaxed text-justify">
|
| 367 |
MolForge proves that scientific AI should not be built as a single-shot generator. By grounding the LLM in a <strong>closed-loop scientific environment</strong>, we can train models that respect budget, coordinate with specialists, and base their discoveries on verifiable evidence.
|
| 368 |
</p>
|
| 369 |
<div class="flex flex-wrap justify-center gap-4">
|