Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
Ifigeneia Apostolopoulou
ifaposto
Follow
ifaposto
ifiaposto
AI & ML interests
approximate inference, probabilistic models, uncertainty quantification, generative models
Recent Activity
posted
an
update
25 days ago
I’ve uploaded a minimal, self-contained implementation of manual autograd for a transformer-based classifier in PyTorch. It can help build intuition for what autograd is doing under the hood and is a useful hands-on reference for low-level differentiation in Transformer models, such as writing custom backward passes and tracing how gradients flow through attention blocks. 🐙 GitHub: https://github.com/ifiaposto/transformer_custom_autograd/tree/main 📓 Colab: https://colab.research.google.com/drive/1Lt7JDYG44p7YHJ76eRH_8QFOPkkoIwhn
new
activity
10 months ago
Qwen/Qwen2.5-7B-Instruct:
ValueError: Unrecognized model in Qwen/Qwen2.5-7B-Instruct. Should have a `model_type` key in its config.json,
published
an
article
about 1 year ago
Online Batch Size Adaptation in Hugging Face Trainer
View all activity
Organizations
None yet
ifaposto
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
authored
a paper
over 1 year ago
A Rate-Distortion View of Uncertainty Quantification
Paper
•
2406.10775
•
Published
Jun 16, 2024
•
1