AI & ML interests

Org page for Safetensors: Simple, safe way to store and distribute tensors

Recent Activity

julien-c  new activity about 1 month ago
safetensors/convert:Reopen the Space
lysandre  new activity about 1 month ago
safetensors/convert:Reopen the Space
julien-c  new activity about 1 month ago
safetensors/convert:Update README.md
View all activity

victor 
posted an update 7 days ago
view post
Post
4701
Want to share my enthusiasm for zai-org/GLM-5.1 here too 🔥

I think we have it: our open source Claude Code = GLM-5.1 + Pi (https://pi.dev/) - Built a Three.js racing game to eval and it's extremely impressive. Thoughts:

- One-shot car physics with real drift mechanics (this is hard)

- My fav part: Awesome at self iterating (with no vision!) created 20+ Bun.WebView debugging tools to drive the car programmatically and read game state. Proved a winding bug with vector math without ever seeing the screen

- 531-line racing AI in a single write: 4 personalities, curvature map, racing lines, tactical drifting. Built telemetry tools to compare player vs AI speed curves and data-tuned parameters

- All assets from scratch: 3D models, procedural textures, sky shader, engine sounds, spatial AI audio!

- Can do hard math: proved road normals pointed DOWN via vector cross products, computed track curvature normalized by arc length to tune AI cornering speed

You are going to hear about this model a lot in the next months - open source let's go - and thanks z-ai🚀🚀
  • 4 replies
·
julien-c 
in safetensors/convert about 1 month ago

Reopen the Space

5
#43 opened about 2 months ago by
laiking
lysandre 
in safetensors/convert about 1 month ago

Reopen the Space

5
#43 opened about 2 months ago by
laiking
julien-c 
in safetensors/convert about 1 month ago

Update README.md

1
#45 opened about 2 months ago by deleted

Update README.md

1
#46 opened about 2 months ago by deleted
victor 
posted an update 3 months ago
view post
Post
2726
Interesting article: use Claude Code to help open models write CUDA kernels (for eg) by turning CC traces into Skills. They made a library out of it 👀

https://huggingface.co/blog/upskill
danieldk 
posted an update 3 months ago
view post
Post
2816
kernels 0.12 is out! 🎉

Changes:

* Support for kernel version branches to gracefully roll out kernel API changes.
* Support for PyTorch 2.10.
* kernel-builder is now merged into the kernels repo.
* Initial support for standardized kernel benchmarks.

https://github.com/huggingface/kernels/releases/tag/v0.12.0
victor 
posted an update 4 months ago
danieldk 
posted an update 6 months ago
lysandre 
posted an update 7 months ago
view post
Post
8553
We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !

v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.

Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago!
  • 6 replies
·
danieldk 
posted an update 9 months ago
view post
Post
2069
kernels 0.8.0 is out: https://github.com/huggingface/kernels/releases/tag/v0.8.0

This release refines kernel selection in the kernelize function:

• You can now register kernels for certain CUDA capability ranges.
• Rather than doing exact mating of modes, fall back to other compatible modes. If you are kernelizing for inference, but you only registered a training + torch.compile kernel, it will use that kernel since it is compatible with inference as well.
  • 1 reply
·
danieldk 
posted an update 9 months ago
danieldk 
posted an update 9 months ago
view post
Post
385
Kernels 0.7.0 is out: https://github.com/huggingface/kernels/releases/tag/v0.7.0 🚀

This release makes it possible to register multiple kernels for a layer. Do you have a super-fast kernel for inference and another kernel for training? Register them both and kernelize will pick the kernel depending on whether you are going to do training or inference.