Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
6
14
Rummy
yang31210999
Follow
zunhai's profile picture
John6666's profile picture
wanng's profile picture
5 followers
ยท
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
updated
a model
16 days ago
yang31210999/result-weight-similarity-0327_ICML_Rebuttal
published
a model
16 days ago
yang31210999/result-weight-similarity-0327_ICML_Rebuttal
View all activity
Organizations
yang31210999
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
ISTA-DASLab/Meta-Llama-3.1-8B-Instruct-AQLM-PV-2Bit-1x16-hf
9 months ago
Tokenizer Config: incorrect EOS token and missing chat template
๐
1
1
#1 opened over 1 year ago by
av-codes
New activity in
mistralai/Mistral-Small-3.1-24B-Instruct-2503
about 1 year ago
Mistral3ForConditionalGeneration has no vLLM implementation and the Transformers implementation is not compatible with vLLM. Try setting VLLM_USE_V1=0.
๐
4
3
#16 opened about 1 year ago by
pedrojfb99
New activity in
google/gemma-3-27b-it
about 1 year ago
evals (PT vs IT)
๐
2
1
#30 opened about 1 year ago by
erichartford
New activity in
yang31210999/Llama3.1-1B-Neo-BAAI-1000k
about 1 year ago
Add library name, pipeline tag, paper link, and Github link
#1 opened about 1 year ago by
nielsr
New activity in
yang31210999/Llama-3.1-Minitron-4B-Depth-Neo-BAAI-100k
about 1 year ago
Enhance model card with metadata, paper link, and basic usage
#1 opened about 1 year ago by
nielsr
New activity in
yang31210999/Llama-3.2-1B-Instruct-Neo-BAAI-10k
about 1 year ago
Add pipeline tag, library name and link to Github repository
#1 opened about 1 year ago by
nielsr