Text Generation
Transformers
Safetensors
llama
thinking
reasoning
Gemini Flash
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
role play
128k context
llama3.3
llama-3
llama-3.3
unsloth
finetune
conversational
text-generation-inference
using with VLLM
#1
by prudant - opened
can be used with some reasoning parser in VLLM in order to use in production environments on a reasoning transparent way?
Yes ; however keep in mind this is still a Llama 3.3 8B , some data will be out of date.
Likewise, it may still have L 3.3 8B "quirks".
During the tuning some data was updated; but I would not consider this a full and complete update.
Please test carefully before using in production.
You might want to consider new Mistral Nemo Instructs (now Claude Opus 4.5 High reasoning) just uploaded today ; which have reasoning
as well and perform very well.
thanks!