Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
27
3
29
Sir Doge
PlanetDOGE
Follow
webxos's profile picture
afrideva's profile picture
mondalsurojit's profile picture
4 followers
·
22 following
DogeCodez
AI & ML interests
Artificial intelligence research
Recent Activity
reacted
to
ajibawa-2023
's
post
with 👍
3 days ago
PHP-Code-Large Dataset: https://huggingface.co/datasets/ajibawa-2023/PHP-Code-Large PHP-Code-Large is a large-scale corpus of PHP source code comprising more than 12 million lines of PHP code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the PHP ecosystem. By providing a high-volume, language-specific corpus, PHP-Code-Large enables systematic experimentation in PHP-focused model training, domain adaptation, and downstream code understanding tasks. PHP-Code-Large addresses the need for a dedicated PHP-only dataset at substantial scale, enabling focused research across backend systems, CMS platforms, APIs, and full-stack PHP environments.
reacted
to
ajibawa-2023
's
post
with 🔥
3 days ago
Python-Code-Large Dataset: https://huggingface.co/datasets/ajibawa-2023/Python-Code-Large Python-Code-Large is a large-scale corpus of Python source code comprising more than 2 million rows of Python code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the Python ecosystem. By providing a high-volume, language-specific corpus, Python-Code-Large enables systematic experimentation in Python-focused model training, domain adaptation, and downstream code understanding tasks. Python-Code-Large addresses the need for a dedicated Python-only dataset at substantial scale, enabling focused research across data science, backend systems, automation, scientific computing, and AI-driven Python environments.
reacted
to
ajibawa-2023
's
post
with 🔥
3 days ago
Cpp-Code-Large Dataset: https://huggingface.co/datasets/ajibawa-2023/Cpp-Code-Large Cpp-Code-Large is a large-scale corpus of C++ source code comprising more than 5 million lines of C++ code. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and static program analysis for the C++ ecosystem. By providing a high-volume, language-specific corpus, Cpp-Code-Large enables systematic experimentation in C++-focused model training, domain adaptation, and downstream code understanding tasks. Cpp-Code-Large addresses the need for a dedicated C++-only dataset at substantial scale, enabling focused research across systems programming, performance-critical applications, embedded systems, game engines, and large-scale native software projects.
View all activity
Organizations
PlanetDOGE
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a Space
over 1 year ago
Build error
1
KAI 7B Chat
💻
1
liked
a model
over 1 year ago
Keynote-Technology/TIGMaN-text-to-image
Text-to-Image
•
Updated
Aug 21, 2024
•
7
•
1
liked
2 Spaces
over 2 years ago
Runtime error
4
Convert to Safetensors
🐶
4
Paused
263
Convert to Safetensors
🐶
263
Convert models to Safetensors and open a PR
liked
3 models
over 2 years ago
Writer/palmyra-3B
Text Generation
•
Updated
Aug 17, 2024
•
2.77k
•
9
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation
•
Updated
Mar 7, 2024
•
68.7k
•
160
sd-concepts-library/cute-game-style
Updated
Oct 26, 2022
•
23
liked
a Space
over 2 years ago
Runtime error
Featured
174
Model Evaluator
📊
174
liked
a dataset
over 2 years ago
Keynote-Technology/PLANE-2K
Viewer
•
Updated
Nov 11, 2023
•
2k
•
11
•
3
liked
a model
over 2 years ago
Keynote-Technology/KAI-7B-Instruct-v0.1
Text Generation
•
7B
•
Updated
Aug 23, 2024
•
9
•
10
liked
a Space
over 2 years ago
Build error
7
More Image Models
😻
7
liked
2 models
over 2 years ago
afrideva/TinyKAI-3B-beta-GGUF
Text Generation
•
3B
•
Updated
Nov 10, 2023
•
135
•
1
Keynote-Technology/TinyKAI-0.7B-v0.1
Text Generation
•
0.1B
•
Updated
Aug 24, 2024
•
9
•
3
liked
a dataset
over 2 years ago
togethercomputer/RedPajama-Data-V2
Updated
Nov 21, 2024
•
4.23k
•
400
liked
6 models
over 2 years ago
cloudqi/cqi_text_to_image_pt_v0
Text-to-Image
•
Updated
May 20, 2025
•
69
•
22
openai/imagegpt-small
Updated
Jun 12, 2023
•
1.85k
•
28
lighteternal/fact-or-opinion-xlmr-el
Text Classification
•
Updated
Feb 27, 2022
•
3.32k
•
•
25
mistralai/Mistral-7B-Instruct-v0.1
Text Generation
•
Updated
Jul 24, 2025
•
382k
•
1.83k
Keynote-Technology/TinyKAI-3B-v0.1
Text Generation
•
3B
•
Updated
Nov 18, 2023
•
7
•
2
openlm-research/open_llama_3b_v2
Text Generation
•
Updated
Jul 16, 2023
•
16.7k
•
160
Load more