Roblox Luau Mistral 7B โ SFT (Deprecated)
This model is deprecated and should not be used. It suffers from output collapse and produces degenerate/repetitive outputs. Use the RFT version instead.
A supervised fine-tuned LoRA adapter on Mistral-7B-Instruct-v0.3 for Roblox Luau code generation. This was an early experiment that did not produce stable outputs.
Do Not Use
This model frequently collapses into repetitive or degenerate outputs. The successor RFT model fixes this through reinforcement fine-tuning with hybrid reward scoring.
Use instead: squaredcuber/roblox-luau-mistral-7b-rft
- Downloads last month
- 22
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support
Model tree for squaredcuber/roblox-luau-mistral-7b-2
Base model
mistralai/Mistral-7B-v0.3 Finetuned
mistralai/Mistral-7B-Instruct-v0.3