1024m
/

something-v1

Model card Files Files and versions

1024m commited on Jan 1, 2025

Commit

dd7a3bb

·

verified ·

1 Parent(s): c4bc681

Delete README.md

Files changed (1) hide show

README.md +0 -48

README.md DELETED Viewed

@@ -1,48 +0,0 @@
----
-language:
-- en
-license: apache-2.0
-tags:
-- pretrained
-pipeline_tag: text-generation
-inference:
-  parameters:
-    temperature: 0.7
-extra_gated_description: If you want to learn more about how we process your personal data, please read our <a href="https://mistral.ai/terms/">Privacy Policy</a>.
----
-# Model Card for Mistral-7B-v0.1
-The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
-Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
-For full details of this model please read our [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).
-## Model Architecture
-Mistral-7B-v0.1 is a transformer model, with the following architecture choices:
-- Grouped-Query Attention
-- Sliding-Window Attention
-- Byte-fallback BPE tokenizer
-## Troubleshooting
-- If you see the following error:
-```
-KeyError: 'mistral'
-```
-- Or:
-```
-NotImplementedError: Cannot copy out of meta tensor; no data!
-```
-Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
-## Notice
-Mistral 7B is a pretrained base model and therefore does not have any moderation mechanisms.
-## The Mistral AI Team
-Albert Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lélio Renard Lavaud, Lucile Saulnier, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed.