You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

Quantized llama2-70b-chat-hf model for NVIDIA MLPerf Inference optimized implementations.

Safetensors

Model size

39B params

Tensor type

F16

F8_E4M3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support