Bidirectional Qwen
Collection
Qwen models with custom class for bidirectional attention • 2 items • Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Original Qwen2.5-0.5B with a custom loading class which turns the model bidirectional (causal=False).