facebook
/

galactica-1.3b

Text Generation

text-generation-inference

Model card Files Files and versions

Convert checkpoint files to float16

#6

by mkardas - opened Dec 5, 2022

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

No description provided.

Convert checkpoint files to float16241c2933

mkardas changed pull request status to open Dec 5, 2022

How can I implement this?

What are you trying to achieve?

The 1.3b model uses most of my 8gb of vram so large requests make it go over pretty quickly, I was hoping this would cut the memory use down.

You can load your model with:

model = OPTForCausalLM.from_pretrained(
            "facebook/galactica-1.3b",
            torch_dtype="float16",
            device_map="auto"
        )

mkardas changed pull request status to merged Dec 6, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment