Bidirectional Qwen
Collection
Qwen models with custom class for bidirectional attention • 2 items • Updated
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Qwen2.5-0.5 with bidirectional attention continually pre-trained on MS-MARCO documents for masked next token prediction, as per LLM2VEC.