Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.93k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.37k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 279 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 155 • 10
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 207 • 78
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 395k • 325 codeparrot/apps Updated Oct 20, 2022 • 18.4k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 17.3k • 126 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 37.8k • 100
Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.93k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.37k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 279 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 155 • 10
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 395k • 325 codeparrot/apps Updated Oct 20, 2022 • 18.4k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 17.3k • 126 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 37.8k • 100
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 207 • 78