File size: 2,321 Bytes
e4defca
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ecabc19
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="utf-8">
    <meta name="viewport" content="width=device-width, initial-scale=1">
    <title>SpecPrefill on Unified Memory</title>
    <style>
        * { margin: 0; padding: 0; box-sizing: border-box; }
        body { font-family: -apple-system, BlinkMacSystemFont, "Segoe UI", Roboto, sans-serif; background: #f8f9fa; color: #1a1a2e; }
        header { background: #1a1a2e; color: #fff; padding: 2rem; text-align: center; }
        header h1 { font-size: 1.6rem; margin-bottom: 0.5rem; }
        header p { opacity: 0.8; font-size: 0.95rem; max-width: 700px; margin: 0 auto; line-height: 1.5; }
        .links { display: flex; gap: 1rem; justify-content: center; margin-top: 1rem; flex-wrap: wrap; }
        .links a { color: #a8d8ea; text-decoration: none; font-size: 0.9rem; padding: 0.4rem 0.8rem; border: 1px solid #a8d8ea; border-radius: 4px; }
        .links a:hover { background: rgba(168,216,234,0.15); }
        .pdf-container { width: 100%; max-width: 900px; margin: 2rem auto; padding: 0 1rem; }
        .pdf-container iframe { width: 100%; height: 85vh; border: 1px solid #ddd; border-radius: 6px; background: #fff; }
        .download { text-align: center; margin: 1rem 0 2rem; }
        .download a { color: #1a1a2e; font-weight: 500; }
    </style>
</head>
<body>
    <header>
        <h1>SpecPrefill on Unified Memory</h1>
        <p>Cross-Architecture Sparse Prefill for Large Language Models on Apple Silicon. 3.71&ndash;5.45&times; TTFT reduction on Qwen3.5-122B with a 1.4 GB draft model.</p>
        <div class="links">
            <a href="specprefill.pdf">PDF (v1)</a>
            <a href="specprefill-v2.pdf">PDF (v2)</a>
            <a href="specprefill.tex">LaTeX source</a>
            <a href="https://doi.org/10.5281/zenodo.19120919">Zenodo DOI</a>
            <a href="https://github.com/waybarrios/vllm-mlx/pull/180">PR #180 (upstream)</a>
            <a href="https://github.com/Thump604">GitHub</a>
        </div>
    </header>
    <div class="pdf-container">
        <iframe src="specprefill-v2.pdf" title="SpecPrefill paper"></iframe>
    </div>
    <div class="download">
        <a href="specprefill-v2.pdf" download>Download PDF</a> &middot; <a href="specprefill.tex" download>Download LaTeX</a>
    </div>
</body>
</html>