Spaces:
Running
Running
File size: 2,321 Bytes
e4defca ecabc19 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 | <!DOCTYPE html>
<html lang="en">
<head>
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<title>SpecPrefill on Unified Memory</title>
<style>
* { margin: 0; padding: 0; box-sizing: border-box; }
body { font-family: -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, sans-serif; background: #f8f9fa; color: #1a1a2e; }
header { background: #1a1a2e; color: #fff; padding: 2rem; text-align: center; }
header h1 { font-size: 1.6rem; margin-bottom: 0.5rem; }
header p { opacity: 0.8; font-size: 0.95rem; max-width: 700px; margin: 0 auto; line-height: 1.5; }
.links { display: flex; gap: 1rem; justify-content: center; margin-top: 1rem; flex-wrap: wrap; }
.links a { color: #a8d8ea; text-decoration: none; font-size: 0.9rem; padding: 0.4rem 0.8rem; border: 1px solid #a8d8ea; border-radius: 4px; }
.links a:hover { background: rgba(168,216,234,0.15); }
.pdf-container { width: 100%; max-width: 900px; margin: 2rem auto; padding: 0 1rem; }
.pdf-container iframe { width: 100%; height: 85vh; border: 1px solid #ddd; border-radius: 6px; background: #fff; }
.download { text-align: center; margin: 1rem 0 2rem; }
.download a { color: #1a1a2e; font-weight: 500; }
</style>
</head>
<body>
<header>
<h1>SpecPrefill on Unified Memory</h1>
<p>Cross-Architecture Sparse Prefill for Large Language Models on Apple Silicon. 3.71–5.45× TTFT reduction on Qwen3.5-122B with a 1.4 GB draft model.</p>
<div class="links">
<a href="specprefill.pdf">PDF (v1)</a>
<a href="specprefill-v2.pdf">PDF (v2)</a>
<a href="specprefill.tex">LaTeX source</a>
<a href="https://doi.org/10.5281/zenodo.19120919">Zenodo DOI</a>
<a href="https://github.com/waybarrios/vllm-mlx/pull/180">PR #180 (upstream)</a>
<a href="https://github.com/Thump604">GitHub</a>
</div>
</header>
<div class="pdf-container">
<iframe src="specprefill-v2.pdf" title="SpecPrefill paper"></iframe>
</div>
<div class="download">
<a href="specprefill-v2.pdf" download>Download PDF</a> · <a href="specprefill.tex" download>Download LaTeX</a>
</div>
</body>
</html>
|