TRI-ML
/

Foundry-LLM-1.2B-1T

Text Generation

Model card Files Files and versions

Foundry-LLM-1.2B-1T / config_model.yaml

jmercat's picture

release: initial squashed history

f2cb618 about 1 month ago

history blame contribute delete

385 Bytes

	attn_name: torch_attn
	cast_output_to_float32: false
	ffn_type: swiglu
	freeze: false
	hidden_dim: 2048
	is_causal: true
	max_seq_len: 2048
	n_heads: 16
	n_layers: 24
	norm_eps: 1.0e-05
	norm_type: lp_layer_norm
	positional_embedding_type: rotary
	post_embed_norm: false
	qk_norm: true
	resume_from_checkpoint: null
	resume_weights_only: false
	type: transformer
	vocab_size: 49280
	weight_tying: false