Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Buckets:
nielsr
/
arxiv-chandra-ocr-full-20260402-l40sx1-s10
Files
xet
nielsr/arxiv-chandra-ocr-full-20260402-l40sx1-s10
/
data
74.5 MB
181 files
Updated 19 days ago
Ctrl+K
Name
Size
Uploaded
Xet hash
part-00000.jsonl.gz
363 kB
xet
21 days ago
6ca8e6b9
part-00001.jsonl.gz
347 kB
xet
21 days ago
e9a31e0b
part-00002.jsonl.gz
454 kB
xet
21 days ago
077e6a35
part-00003.jsonl.gz
444 kB
xet
21 days ago
04c89a02
part-00004.jsonl.gz
433 kB
xet
21 days ago
63c1187f
part-00005.jsonl.gz
520 kB
xet
21 days ago
e90639bd
part-00006.jsonl.gz
355 kB
xet
21 days ago
6016c522
part-00007.jsonl.gz
408 kB
xet
21 days ago
b3eaef52
part-00008.jsonl.gz
521 kB
xet
21 days ago
f43eadaf
part-00009.jsonl.gz
474 kB
xet
21 days ago
0f52479e
part-00010.jsonl.gz
421 kB
xet
21 days ago
50635d20
part-00011.jsonl.gz
464 kB
xet
21 days ago
c9e86894
part-00012.jsonl.gz
392 kB
xet
21 days ago
2866bfc9
part-00013.jsonl.gz
324 kB
xet
21 days ago
ecea0d72
part-00014.jsonl.gz
428 kB
xet
21 days ago
dd70f2bb
part-00015.jsonl.gz
457 kB
xet
21 days ago
7f364f6e
part-00016.jsonl.gz
433 kB
xet
21 days ago
c995d220
part-00017.jsonl.gz
419 kB
xet
21 days ago
cf671f4f
part-00018.jsonl.gz
323 kB
xet
21 days ago
e443d651
part-00019.jsonl.gz
345 kB
xet
21 days ago
7e2fe1e4
part-00020.jsonl.gz
466 kB
xet
21 days ago
ad760d7e
part-00021.jsonl.gz
413 kB
xet
21 days ago
c4c87b4c
part-00022.jsonl.gz
479 kB
xet
21 days ago
2a5245d9
part-00023.jsonl.gz
449 kB
xet
21 days ago
4beb6034
part-00024.jsonl.gz
356 kB
xet
21 days ago
e2079577
part-00025.jsonl.gz
375 kB
xet
21 days ago
f9e6c228
part-00026.jsonl.gz
460 kB
xet
21 days ago
cc98a190
part-00027.jsonl.gz
448 kB
xet
21 days ago
4195fd93
part-00028.jsonl.gz
388 kB
xet
21 days ago
6898de55
part-00029.jsonl.gz
421 kB
xet
21 days ago
a691d2e8
part-00030.jsonl.gz
374 kB
xet
21 days ago
f13bcfcb
part-00031.jsonl.gz
443 kB
xet
21 days ago
8e05307f
part-00032.jsonl.gz
445 kB
xet
21 days ago
dfc2c88f
part-00033.jsonl.gz
516 kB
xet
21 days ago
f197841d
part-00034.jsonl.gz
363 kB
xet
21 days ago
d164083f
part-00035.jsonl.gz
472 kB
xet
21 days ago
482d8f2c
part-00036.jsonl.gz
556 kB
xet
21 days ago
3443cbe7
part-00037.jsonl.gz
441 kB
xet
21 days ago
acdf0a03
part-00038.jsonl.gz
461 kB
xet
21 days ago
cfe4fb24
part-00039.jsonl.gz
379 kB
xet
21 days ago
00a9ab7c
part-00040.jsonl.gz
430 kB
xet
21 days ago
f48c1011
part-00041.jsonl.gz
400 kB
xet
21 days ago
23d2c641
part-00042.jsonl.gz
345 kB
xet
21 days ago
227e7271
part-00043.jsonl.gz
442 kB
xet
21 days ago
a069ad22
part-00044.jsonl.gz
459 kB
xet
21 days ago
7acf19a2
part-00045.jsonl.gz
408 kB
xet
21 days ago
f3b93d9e
part-00046.jsonl.gz
437 kB
xet
21 days ago
3ca5ed80
part-00047.jsonl.gz
403 kB
xet
21 days ago
dd089852
part-00048.jsonl.gz
419 kB
xet
21 days ago
5960b0ea
part-00049.jsonl.gz
372 kB
xet
21 days ago
d010aefb
part-00050.jsonl.gz
416 kB
xet
21 days ago
3c44c498
part-00051.jsonl.gz
349 kB
xet
21 days ago
63093ee8
part-00052.jsonl.gz
271 kB
xet
21 days ago
dedb3cd9
part-00053.jsonl.gz
411 kB
xet
21 days ago
3a260fa3
part-00054.jsonl.gz
468 kB
xet
21 days ago
abd5f99c
part-00055.jsonl.gz
410 kB
xet
21 days ago
73e6a1b2
part-00056.jsonl.gz
395 kB
xet
21 days ago
68081560
part-00057.jsonl.gz
409 kB
xet
20 days ago
87941592
part-00058.jsonl.gz
445 kB
xet
20 days ago
1f86d6eb
part-00059.jsonl.gz
428 kB
xet
20 days ago
4871a1e3
part-00060.jsonl.gz
437 kB
xet
20 days ago
8660c8cc
part-00061.jsonl.gz
426 kB
xet
20 days ago
3aecdc09
part-00062.jsonl.gz
350 kB
xet
20 days ago
66b185dd
part-00063.jsonl.gz
440 kB
xet
20 days ago
63ee517f
part-00064.jsonl.gz
455 kB
xet
20 days ago
e5285c8e
part-00065.jsonl.gz
474 kB
xet
20 days ago
126c5136
part-00066.jsonl.gz
433 kB
xet
20 days ago
d9863920
part-00067.jsonl.gz
429 kB
xet
20 days ago
574f5f22
part-00068.jsonl.gz
375 kB
xet
20 days ago
e9110db7
part-00069.jsonl.gz
454 kB
xet
20 days ago
d966fe23
part-00070.jsonl.gz
492 kB
xet
20 days ago
68625d3c
part-00071.jsonl.gz
383 kB
xet
20 days ago
086b47a5
part-00072.jsonl.gz
432 kB
xet
20 days ago
93fa0e8f
part-00073.jsonl.gz
453 kB
xet
20 days ago
799fc12a
part-00074.jsonl.gz
446 kB
xet
20 days ago
570c5422
part-00075.jsonl.gz
414 kB
xet
20 days ago
de9f0cbc
part-00076.jsonl.gz
461 kB
xet
20 days ago
7fad3baf
part-00077.jsonl.gz
355 kB
xet
20 days ago
77498799
part-00078.jsonl.gz
356 kB
xet
20 days ago
edf4b6a9
part-00079.jsonl.gz
407 kB
xet
20 days ago
6895c566
part-00080.jsonl.gz
383 kB
xet
20 days ago
43fbeef0
part-00081.jsonl.gz
463 kB
xet
20 days ago
ab1bf7dd
part-00082.jsonl.gz
501 kB
xet
20 days ago
c22bffc9
part-00083.jsonl.gz
367 kB
xet
20 days ago
f1c5989d
part-00084.jsonl.gz
333 kB
xet
20 days ago
2d2057ff
part-00085.jsonl.gz
469 kB
xet
20 days ago
6fb9e273
part-00086.jsonl.gz
376 kB
xet
20 days ago
8373582b
part-00087.jsonl.gz
424 kB
xet
20 days ago
fcd54c48
part-00088.jsonl.gz
330 kB
xet
20 days ago
03e0ae1d
part-00089.jsonl.gz
508 kB
xet
20 days ago
bfe2c9d7
part-00090.jsonl.gz
408 kB
xet
20 days ago
d9a59f83
part-00091.jsonl.gz
397 kB
xet
20 days ago
5263057f
part-00092.jsonl.gz
473 kB
xet
20 days ago
742748ee
part-00093.jsonl.gz
471 kB
xet
20 days ago
3cde4d31
part-00094.jsonl.gz
375 kB
xet
20 days ago
db11f29a
part-00095.jsonl.gz
496 kB
xet
20 days ago
1ff3b825
part-00096.jsonl.gz
417 kB
xet
20 days ago
8def5676
part-00097.jsonl.gz
373 kB
xet
20 days ago
01e6db99
part-00098.jsonl.gz
425 kB
xet
20 days ago
e860c5a4
part-00099.jsonl.gz
369 kB
xet
20 days ago
c0dcb2b9
Load more
Use this bucket
Total size
74.5 MB
Files
181
Last updated
Apr 4
Pre-warmed CDN
US
EU
US
EU
Contributors