File size: 5,136 Bytes
f866820
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
# RAG Document Assistant

> **Search your documents with AI. Your data stays yours.**

[![Live Demo](https://img.shields.io/badge/Live%20Demo-Try%20It-blue)](https://rag-document-assistant.vercel.app/)
[![Privacy](https://img.shields.io/badge/Privacy-Zero%20Storage-green)](#zero-storage-guarantee)

---

## The Problem

Organizations need AI-powered document search but face a critical barrier: **data privacy**.

Traditional solutions require uploading sensitive documents to third-party servers, creating:
- Compliance risks (GDPR, HIPAA, SOX)
- Data breach exposure
- Loss of control over proprietary information

---

## Our Solution: Zero-Storage Architecture

**Your documents stay in YOUR cloud storage. Always.**

```
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚              INDEXING (one-time setup)                       β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚   YOUR BROWSER                         OUR SERVER            β”‚
β”‚                                                              β”‚
β”‚   1. Files loaded from YOUR Dropbox                          β”‚
β”‚              β”‚                                               β”‚
β”‚              β–Ό                                               β”‚
β”‚   2. Text chunked locally ─────────→  3. Only embeddings +   β”‚
β”‚              β”‚                           file positions      β”‚
β”‚              β–Ό                           stored              β”‚
β”‚   4. Original text PURGED                                    β”‚
β”‚      (no trace remains)                                      β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜

β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚              QUERY TIME (every search)                       β”‚
β”œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€
β”‚                                                              β”‚
β”‚   Your Question ──→ Find matching ──→ Re-fetch text from    β”‚
β”‚                     embeddings        YOUR Dropbox           β”‚
β”‚                          β”‚                   β”‚               β”‚
β”‚                          β–Ό                   β–Ό               β”‚
β”‚                     File paths ───→ Generate answer          β”‚
β”‚                     + positions      (text never stored)     β”‚
β”‚                                                              β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
```

### Zero-Storage Guarantee

| What Happens | Where | Stored? |
|--------------|-------|---------|
| File reading | Your browser | No |
| Text chunking | Your browser | No |
| Original text | Never | No |
| Embeddings | Server | Yes (irreversible) |
| File paths + positions | Server | Yes (for re-fetch) |
| Text at query time | Re-fetched from YOUR Dropbox | No |

**Why this matters:** Your document content is NEVER stored on our servers. At query time, we use the file paths and positions to re-fetch the exact text from YOUR Dropbox. Disconnect Dropbox = queries stop working = complete data control.

---

## Why This Matters

### For Compliance Teams
- No data leaves user devices = No data breach risk
- Embeddings are not personal data = GDPR-friendly
- Audit-ready architecture

### For Security Teams
- Zero attack surface on document content
- No sensitive data in transit or at rest
- Client-side processing eliminates server vulnerabilities

### For Business Leaders
- Use AI document search without compliance reviews
- No vendor lock-in on your data
- Deploy with confidence

---

## How It Works

1. **Connect Dropbox** - OAuth authentication (we never see credentials)
2. **Select Files** - Choose .txt, .md, or .pdf files (up to 5 MB)
3. **Index** - Text processed in your browser, only embeddings sent to server
4. **Search** - Ask questions in natural language
5. **Get Answers** - Receive cited responses from your indexed content

---

## Use Cases

- **Compliance**: Search regulatory documents without data exposure
- **Legal**: Query contracts while maintaining privilege
- **HR**: Access policy documents securely
- **Research**: Search proprietary research without leakage

---

## Try It Now

**[Launch Demo](https://rag-document-assistant.vercel.app/)**

Your documents. Your device. Your privacy.