SKT-DATASETS-NRS

company

https://huggingface.co/sKT-AI-LABS

Activity Feed Request to join this org

AI & ML interests

NRS [ NEURAL REASONING SYSTEM ] DATASETS FROM SKT AI LABS 🇮🇳

Recent Activity

Shrijanagain updated a dataset about 14 hours ago

SKT-NRS/SKT-OMNI-CORPUS-146T-V1

Shrijanagain updated a Space about 15 hours ago

SKT-NRS/README

Shrijanagain updated a dataset 8 days ago

SKT-NRS/NRSx200000

View all activity

Shrijanagain

updated a dataset about 14 hours ago

SKT-NRS/SKT-OMNI-CORPUS-146T-V1

Preview • Updated about 14 hours ago • 95 • 7

Shrijanagain

updated a Space about 15 hours ago

README

💻

NRS

SKT-LK

posted an update about 15 hours ago

Post

331

SKT-SURYA-H is not just a model; it's our digital identity. 2.544 Trillion parameters trained on the wisdom of Bharat.

From the land of Sanskrit to the world of Silicon—Sovereign AI is here to stay. 🇮🇳🦁
Viraat. Dharma. Open.

🔗 sKT-Ai-Labs/SKT-SURYA-H
#DigitalSovereignty #MakeInBharat #SKTAI #SuryaH

SKT-SUPPORT

posted an update about 15 hours ago

Post

339

Open Source just got a lot bigger. 🚀

SKT-SURYA-H (2.544T) is officially out!
✅ Heterogeneous MoE
✅ 131K Context
✅ 3.76TB Weights (898 shards)
Massive respect to the team for keeping it open for the community! 🤝🇮🇳

Link: sKT-Ai-Labs/SKT-SURYA-H

#OpenSource #MachineLearning #AITools #Bharat

Shrijanagain

posted an update about 15 hours ago

Post

297

After 2 Years of research and Hard Work . we’ve crossed the 2.5T barrier! 🚀
SKT-SURYA-H is now live: 2.544 Trillion parameters powered by our unique Weight Manifold Fusion (WMF) technology. Sovereign AI for Bharat is no longer a dream. 🇮🇳🧠

🔗 sKT-Ai-Labs/SKT-SURYA-H

#SKTAI #LLM #DeepTech #SovereignAI

ST-x-Tony

posted an update about 15 hours ago

Post

306

Introducing SKT-SURYA-H: Bharat’s first Sovereign AI at 2.544 Trillion Parameters! 🇮🇳

Developed by SKT AI Labs, we are pushing the limits of Open Source.
🚀 2.544T MoE Architecture
🧠 Weight Manifold Fusion

Explore:
sKT-Ai-Labs/SKT-SURYA-H

Stay Tuned All Drafts Soon

Shrijanagain

updated a dataset 8 days ago

SKT-NRS/NRSx200000

Updated 8 days ago • 84 • 3

Shrijanagain

published a dataset 9 days ago

SKT-NRS/NRSx200000

Updated 8 days ago • 84 • 3

Shrijanagain

published a Space 9 days ago

README

💻

NRS

Shrijanagain

posted an update 14 days ago

Post

4146

sKT-Ai-Labs

Join fast we will soon published tokens and all join and get started because we will soon off join request button if you want you can join fast guys

1 reply

Shrijanagain

posted an update 19 days ago

Post

2568

🚀 Bharat AI Revolution ka Hissa Banein! 🇮🇳

Kya aap Bharat ko AI ki duniya mein ek nayi pehchan dilana chahte hain ?

SKT AI Labs sirf ek naam nahi, ek mission hai—desh ko digital shakti dene ka aur "Viksit Bharat" ke sapne ko sach karne ka.

Humse Kyun Judein?

1. Desh ka Apna AI: Hum aise models bana rahe hain jo khas taur par Bharat ki zarooraton aur bhashaon ke liye hain.

2. Open Collaboration: Hamare Hugging Face repository par hamare kaam ko dekhein, test karein aur apna yogdan dein.

3. Technological Growth: Agar aap student hain, developer hain ya tech enthusiast hain, toh hamare saath naya seekhne aur grow karne ka yeh behtareen mauka hai.

Join here

sKT-Ai-Labs
🔗

sKT-Ai-Labs

Aaiye, saath milkar Bharat AI Revolution ko aage badhate hain! 💻🔥

#SKTAILabs #DigitalIndia #AIRevolution #ViksitBharat #TechInnovation #JoinTheMission

Shrijanagain

posted an update 20 days ago

Post

6828

SOME NEW HINDI + ENGLISH DATASETS

🔗
- sKT-Ai-Labs/HIN
- sKT-Ai-Labs/SKT-MIX
- sKT-Ai-Labs/ST-H

Download and Use And Train Models

You Can Alsoo Use ST-x-LIGHTING Module For Faster Training

pip install ST-x-LIGHT-V11

2 replies

Shrijanagain

posted an update 26 days ago

Post

5578

We are thrilled to announce the launch of SKT-OMNI-CORPUS-146T-V1, a massive-scale, high-quality dataset designed to power the next generation of Foundation Models (LLMs) from scratch.
Developed at SKT AI LABS, this corpus is not just a collection of data; it’s a mission to decentralize high-grade AI training for regional languages and global knowledge.

💎 Key Highlights:

•• Massive Scale: Targeting a multi-terabyte architecture for 146T-level tokenization.

•• Pure Quality: Curated from 500+ Elite Sources

•• Structured for MoE: Perfectly sharded into 3.5GB standardized units (SKT-𝕻 series) for seamless distributed training.

🤝 Open for Collaboration!

We are looking for AI researchers, CUDA engineers, and data scientists to join us in this journey of building Project Surya and the ST-X Series models. Whether it's optimization, custom tokenization, or architecture design—let’s build the future together.

Explore the Dataset on Hugging Face:

🔗 https://huggingface.co/datasets/Shrijanagain/SKT-OMNI-CORPUS-146T-V1

DSR -- 🔗 https://huggingface.co/datasets/Shrijanagain/SKT-DSRx10000

#AI #MachineLearning #OpenSource #IndicAI #SKTAILABS #LLM #BigData #HuggingFace #InnovationIndia

Shrijanagain

posted an update about 1 month ago

Post

5469

Surya-1.1T: Scaling Beyond Human-Level Reasoning via 146 Trillion Token Pre-training
Author: SKT AI LABS
Affiliation: SKT AI Labs / Project Surya
Model Architecture: Optimized Dense Transformer
Parameters: 1.1 Trillion
Training Tokens: 146 Trillion

Wanna collaborate us Friends let's Start Journey we have Collected 146 trillon tokens and done pre training but we need to made more powerfull

Whitepaper - https://github.com/SHRIJANAGAIN/PROFF

57 replies

AI & ML interests

Recent Activity

Team members 4

SKT-NRS's activity

README

README