Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s14
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s14
/
data
81.2 MB
181 files
Updated 19 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
271 kB
xet
21 days ago
b1e4797b
part-00001.jsonl.gz
502 kB
xet
21 days ago
87e05dc0
part-00002.jsonl.gz
430 kB
xet
21 days ago
ef92e5d5
part-00003.jsonl.gz
498 kB
xet
21 days ago
0b14b6b0
part-00004.jsonl.gz
321 kB
xet
21 days ago
c32c0033
part-00005.jsonl.gz
407 kB
xet
21 days ago
4db5b17c
part-00006.jsonl.gz
379 kB
xet
21 days ago
a1ec3995
part-00007.jsonl.gz
461 kB
xet
21 days ago
519b34ab
part-00008.jsonl.gz
543 kB
xet
21 days ago
075be034
part-00009.jsonl.gz
513 kB
xet
21 days ago
909fd2db
part-00010.jsonl.gz
512 kB
xet
21 days ago
c8d508e7
part-00011.jsonl.gz
378 kB
xet
21 days ago
fa80a9c3
part-00012.jsonl.gz
508 kB
xet
21 days ago
b4f9df25
part-00013.jsonl.gz
450 kB
xet
21 days ago
f307cc65
part-00014.jsonl.gz
445 kB
xet
21 days ago
d42000f0
part-00015.jsonl.gz
404 kB
xet
21 days ago
f4b0b633
part-00016.jsonl.gz
394 kB
xet
21 days ago
d1392003
part-00017.jsonl.gz
442 kB
xet
21 days ago
c98409bd
part-00018.jsonl.gz
464 kB
xet
21 days ago
296b4f6c
part-00019.jsonl.gz
483 kB
xet
21 days ago
a1a10cc7
part-00020.jsonl.gz
421 kB
xet
21 days ago
018410b4
part-00021.jsonl.gz
406 kB
xet
21 days ago
8400cbbd
part-00022.jsonl.gz
478 kB
xet
21 days ago
5daddb72
part-00023.jsonl.gz
419 kB
xet
21 days ago
0098e8e9
part-00024.jsonl.gz
553 kB
xet
21 days ago
33ed3329
part-00025.jsonl.gz
466 kB
xet
21 days ago
af8a8d72
part-00026.jsonl.gz
358 kB
xet
21 days ago
d91f6d14
part-00027.jsonl.gz
391 kB
xet
21 days ago
7b66d1f3
part-00028.jsonl.gz
436 kB
xet
21 days ago
e1d9c46a
part-00029.jsonl.gz
479 kB
xet
21 days ago
8f8d906b
part-00030.jsonl.gz
536 kB
xet
21 days ago
63986c61
part-00031.jsonl.gz
327 kB
xet
21 days ago
3854370f
part-00032.jsonl.gz
431 kB
xet
21 days ago
a1ea2147
part-00033.jsonl.gz
538 kB
xet
21 days ago
76abae47
part-00034.jsonl.gz
527 kB
xet
21 days ago
5f92f5bb
part-00035.jsonl.gz
433 kB
xet
21 days ago
dce138a6
part-00036.jsonl.gz
501 kB
xet
21 days ago
1fb9206d
part-00037.jsonl.gz
344 kB
xet
21 days ago
c5f86b5a
part-00038.jsonl.gz
415 kB
xet
21 days ago
5316f13f
part-00039.jsonl.gz
324 kB
xet
21 days ago
51c8c36a
part-00040.jsonl.gz
370 kB
xet
21 days ago
3fc720ee
part-00041.jsonl.gz
352 kB
xet
21 days ago
0cdaac12
part-00042.jsonl.gz
483 kB
xet
21 days ago
b9907f76
part-00043.jsonl.gz
349 kB
xet
21 days ago
11d6c125
part-00044.jsonl.gz
300 kB
xet
21 days ago
6aaeb6e5
part-00045.jsonl.gz
504 kB
xet
21 days ago
28072c29
part-00046.jsonl.gz
449 kB
xet
21 days ago
3bb28b5b
part-00047.jsonl.gz
375 kB
xet
21 days ago
7295161c
part-00048.jsonl.gz
524 kB
xet
21 days ago
333b09a7
part-00049.jsonl.gz
362 kB
xet
20 days ago
afebf43b
part-00050.jsonl.gz
386 kB
xet
20 days ago
bd3086b3
part-00051.jsonl.gz
448 kB
xet
20 days ago
9faaf716
part-00052.jsonl.gz
455 kB
xet
20 days ago
1583d00b
part-00053.jsonl.gz
472 kB
xet
20 days ago
9d071567
part-00054.jsonl.gz
454 kB
xet
20 days ago
c98c7337
part-00055.jsonl.gz
454 kB
xet
20 days ago
51cfced2
part-00056.jsonl.gz
363 kB
xet
20 days ago
1bd15790
part-00057.jsonl.gz
412 kB
xet
20 days ago
44b6b17a
part-00058.jsonl.gz
404 kB
xet
20 days ago
7eacb964
part-00059.jsonl.gz
418 kB
xet
20 days ago
95e84bc5
part-00060.jsonl.gz
382 kB
xet
20 days ago
bd88e3c5
part-00061.jsonl.gz
395 kB
xet
20 days ago
898ee275
part-00062.jsonl.gz
535 kB
xet
20 days ago
613ea345
part-00063.jsonl.gz
489 kB
xet
20 days ago
4330936d
part-00064.jsonl.gz
448 kB
xet
20 days ago
2cbffc22
part-00065.jsonl.gz
409 kB
xet
20 days ago
118d0c29
part-00066.jsonl.gz
452 kB
xet
20 days ago
39d24b7c
part-00067.jsonl.gz
495 kB
xet
20 days ago
66a8696d
part-00068.jsonl.gz
391 kB
xet
20 days ago
d62ad372
part-00069.jsonl.gz
291 kB
xet
20 days ago
9a963280
part-00070.jsonl.gz
433 kB
xet
20 days ago
2a42bf06
part-00071.jsonl.gz
336 kB
xet
20 days ago
664f6424
part-00072.jsonl.gz
446 kB
xet
20 days ago
0753e956
part-00073.jsonl.gz
380 kB
xet
20 days ago
54b00c7d
part-00074.jsonl.gz
432 kB
xet
20 days ago
fbca61f1
part-00075.jsonl.gz
388 kB
xet
20 days ago
f8d1fd1b
part-00076.jsonl.gz
524 kB
xet
20 days ago
aee7de16
part-00077.jsonl.gz
347 kB
xet
20 days ago
062be1a8
part-00078.jsonl.gz
400 kB
xet
20 days ago
f64e6a9e
part-00079.jsonl.gz
498 kB
xet
20 days ago
ec1c097b
part-00080.jsonl.gz
435 kB
xet
20 days ago
d0e2fba8
part-00081.jsonl.gz
468 kB
xet
20 days ago
04c5bc17
part-00082.jsonl.gz
404 kB
xet
20 days ago
5d8c3d94
part-00083.jsonl.gz
562 kB
xet
20 days ago
a54722bc
part-00084.jsonl.gz
467 kB
xet
20 days ago
d0f9eac9
part-00085.jsonl.gz
451 kB
xet
20 days ago
b9684d68
part-00086.jsonl.gz
334 kB
xet
20 days ago
8e5e1fde
part-00087.jsonl.gz
483 kB
xet
20 days ago
48a8b934
part-00088.jsonl.gz
450 kB
xet
20 days ago
a978a40c
part-00089.jsonl.gz
519 kB
xet
20 days ago
9314ac9a
part-00090.jsonl.gz
504 kB
xet
20 days ago
02ea3127
part-00091.jsonl.gz
466 kB
xet
20 days ago
3a677571
part-00092.jsonl.gz
376 kB
xet
20 days ago
4192135f
part-00093.jsonl.gz
555 kB
xet
20 days ago
72f0b9d4
part-00094.jsonl.gz
580 kB
xet
20 days ago
fa0c3774
part-00095.jsonl.gz
413 kB
xet
20 days ago
312d1c96
part-00096.jsonl.gz
505 kB
xet
20 days ago
242953e9
part-00097.jsonl.gz
557 kB
xet
20 days ago
43eaf812
part-00098.jsonl.gz
448 kB
xet
20 days ago
f429c586
part-00099.jsonl.gz
513 kB
xet
20 days ago
443d3ab4
Load more
Use this bucket
Total size
81.2 MB
Files
181
Last updated
Apr 4
Pre-warmed CDN
US
EU
US
EU
Contributors