Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-merged-20260406
Files
xet
nielsr/arxiv-chandra-ocr-full-merged-20260406
/
data
1.14 GB
2,774 files
Updated 17 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
284 kB
xet
17 days ago
5112464a
part-00001.jsonl.gz
381 kB
xet
17 days ago
40eb6bae
part-00002.jsonl.gz
242 kB
xet
17 days ago
91e10c66
part-00003.jsonl.gz
371 kB
xet
17 days ago
a3d62321
part-00004.jsonl.gz
320 kB
xet
17 days ago
1081c313
part-00005.jsonl.gz
312 kB
xet
17 days ago
ac7f1549
part-00006.jsonl.gz
397 kB
xet
17 days ago
d7c1d150
part-00007.jsonl.gz
315 kB
xet
17 days ago
781149e4
part-00008.jsonl.gz
342 kB
xet
17 days ago
62ce4b5f
part-00009.jsonl.gz
385 kB
xet
17 days ago
83b07d7e
part-00010.jsonl.gz
347 kB
xet
17 days ago
30379d46
part-00011.jsonl.gz
421 kB
xet
17 days ago
87785bf0
part-00012.jsonl.gz
305 kB
xet
17 days ago
16f3a874
part-00013.jsonl.gz
309 kB
xet
17 days ago
b1395b55
part-00014.jsonl.gz
181 kB
xet
17 days ago
1e3a1cb8
part-00015.jsonl.gz
277 kB
xet
17 days ago
0ed18f94
part-00016.jsonl.gz
456 kB
xet
17 days ago
cf79a14b
part-00017.jsonl.gz
372 kB
xet
17 days ago
b710d15e
part-00018.jsonl.gz
413 kB
xet
17 days ago
e407968e
part-00019.jsonl.gz
320 kB
xet
17 days ago
d30fb22f
part-00020.jsonl.gz
253 kB
xet
17 days ago
d2a460df
part-00021.jsonl.gz
271 kB
xet
17 days ago
13054a6e
part-00022.jsonl.gz
445 kB
xet
17 days ago
f5cab456
part-00023.jsonl.gz
362 kB
xet
17 days ago
b7bd3b2d
part-00024.jsonl.gz
189 kB
xet
17 days ago
de5e01f8
part-00025.jsonl.gz
305 kB
xet
17 days ago
9999557b
part-00026.jsonl.gz
239 kB
xet
17 days ago
f825f79e
part-00027.jsonl.gz
390 kB
xet
17 days ago
9875f8af
part-00028.jsonl.gz
240 kB
xet
17 days ago
20319e60
part-00029.jsonl.gz
310 kB
xet
17 days ago
6950042b
part-00030.jsonl.gz
285 kB
xet
17 days ago
db953b9d
part-00031.jsonl.gz
285 kB
xet
17 days ago
5088c48f
part-00032.jsonl.gz
321 kB
xet
17 days ago
c0f10de6
part-00033.jsonl.gz
384 kB
xet
17 days ago
415a8527
part-00034.jsonl.gz
304 kB
xet
17 days ago
15d3ae1e
part-00035.jsonl.gz
327 kB
xet
17 days ago
e0d204ef
part-00036.jsonl.gz
328 kB
xet
17 days ago
ec7c2365
part-00037.jsonl.gz
374 kB
xet
17 days ago
602e2b23
part-00038.jsonl.gz
268 kB
xet
17 days ago
5e96ee3c
part-00039.jsonl.gz
410 kB
xet
17 days ago
685fc6b2
part-00040.jsonl.gz
422 kB
xet
17 days ago
cc760440
part-00041.jsonl.gz
420 kB
xet
17 days ago
87bfa9cb
part-00042.jsonl.gz
331 kB
xet
17 days ago
adc3b46c
part-00043.jsonl.gz
431 kB
xet
17 days ago
0e4d4527
part-00044.jsonl.gz
418 kB
xet
17 days ago
5476ff4d
part-00045.jsonl.gz
352 kB
xet
17 days ago
aec207ae
part-00046.jsonl.gz
413 kB
xet
17 days ago
677b62fe
part-00047.jsonl.gz
303 kB
xet
17 days ago
c2c56846
part-00048.jsonl.gz
305 kB
xet
17 days ago
3dd93709
part-00049.jsonl.gz
334 kB
xet
17 days ago
65104e85
part-00050.jsonl.gz
347 kB
xet
17 days ago
ff53d380
part-00051.jsonl.gz
283 kB
xet
17 days ago
4314f1b7
part-00052.jsonl.gz
379 kB
xet
17 days ago
d68a1896
part-00053.jsonl.gz
418 kB
xet
17 days ago
4cc6d20d
part-00054.jsonl.gz
253 kB
xet
17 days ago
17165452
part-00055.jsonl.gz
344 kB
xet
17 days ago
a6e2b52a
part-00056.jsonl.gz
318 kB
xet
17 days ago
c9cc660b
part-00057.jsonl.gz
306 kB
xet
17 days ago
ee5e2965
part-00058.jsonl.gz
248 kB
xet
17 days ago
445d49b2
part-00059.jsonl.gz
339 kB
xet
17 days ago
c40d7602
part-00060.jsonl.gz
413 kB
xet
17 days ago
854a92fb
part-00061.jsonl.gz
385 kB
xet
17 days ago
21143268
part-00062.jsonl.gz
338 kB
xet
17 days ago
ffab68fe
part-00063.jsonl.gz
378 kB
xet
17 days ago
cb130613
part-00064.jsonl.gz
302 kB
xet
17 days ago
618a829e
part-00065.jsonl.gz
322 kB
xet
17 days ago
c5677d1f
part-00066.jsonl.gz
319 kB
xet
17 days ago
3a036eab
part-00067.jsonl.gz
333 kB
xet
17 days ago
07443655
part-00068.jsonl.gz
342 kB
xet
17 days ago
1f84f7e9
part-00069.jsonl.gz
310 kB
xet
17 days ago
be82d741
part-00070.jsonl.gz
203 kB
xet
17 days ago
73b8bfef
part-00071.jsonl.gz
345 kB
xet
17 days ago
fe8164f4
part-00072.jsonl.gz
346 kB
xet
17 days ago
cb3be411
part-00073.jsonl.gz
378 kB
xet
17 days ago
5a98ebe9
part-00074.jsonl.gz
288 kB
xet
17 days ago
2679da11
part-00075.jsonl.gz
302 kB
xet
17 days ago
c69fba45
part-00076.jsonl.gz
417 kB
xet
17 days ago
0d1708ce
part-00077.jsonl.gz
285 kB
xet
17 days ago
55d9160d
part-00078.jsonl.gz
235 kB
xet
17 days ago
a14f375d
part-00079.jsonl.gz
315 kB
xet
17 days ago
fb785778
part-00080.jsonl.gz
291 kB
xet
17 days ago
ce635c3f
part-00081.jsonl.gz
347 kB
xet
17 days ago
957fee0f
part-00082.jsonl.gz
314 kB
xet
17 days ago
028c3614
part-00083.jsonl.gz
323 kB
xet
17 days ago
baf25280
part-00084.jsonl.gz
350 kB
xet
17 days ago
5c9e7ada
part-00085.jsonl.gz
316 kB
xet
17 days ago
3c589121
part-00086.jsonl.gz
326 kB
xet
17 days ago
acb7d14e
part-00087.jsonl.gz
286 kB
xet
17 days ago
7ace3eed
part-00088.jsonl.gz
328 kB
xet
17 days ago
67a79490
part-00089.jsonl.gz
321 kB
xet
17 days ago
4094e66f
part-00090.jsonl.gz
308 kB
xet
17 days ago
e82314f0
part-00091.jsonl.gz
280 kB
xet
17 days ago
157c6d30
part-00092.jsonl.gz
278 kB
xet
17 days ago
2e336764
part-00093.jsonl.gz
400 kB
xet
17 days ago
2326b5f9
part-00094.jsonl.gz
279 kB
xet
17 days ago
3152ee3f
part-00095.jsonl.gz
311 kB
xet
17 days ago
f5669965
part-00096.jsonl.gz
296 kB
xet
17 days ago
8977718d
part-00097.jsonl.gz
334 kB
xet
17 days ago
4051704e
part-00098.jsonl.gz
378 kB
xet
17 days ago
a3e10beb
part-00099.jsonl.gz
296 kB
xet
17 days ago
f2fc05ca
Load more
Use this bucket
Total size
1.14 GB
Files
2,774
Last updated
Apr 6
Pre-warmed CDN
US
EU
US
EU
Contributors