Engine Phase 1b: AVX2 vectorized matmul + RMSNorm kernels bb40248 verified ticketguy commited on 1 day ago