Instructions to use thekraftors/text-to-sql-beta-v1-q8 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use thekraftors/text-to-sql-beta-v1-q8 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="thekraftors/text-to-sql-beta-v1-q8")# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("thekraftors/text-to-sql-beta-v1-q8", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use thekraftors/text-to-sql-beta-v1-q8 with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "thekraftors/text-to-sql-beta-v1-q8" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thekraftors/text-to-sql-beta-v1-q8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/thekraftors/text-to-sql-beta-v1-q8
- SGLang
How to use thekraftors/text-to-sql-beta-v1-q8 with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "thekraftors/text-to-sql-beta-v1-q8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thekraftors/text-to-sql-beta-v1-q8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "thekraftors/text-to-sql-beta-v1-q8" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thekraftors/text-to-sql-beta-v1-q8", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use thekraftors/text-to-sql-beta-v1-q8 with Docker Model Runner:
docker model run hf.co/thekraftors/text-to-sql-beta-v1-q8
license: apache-2.0
language:
- en
Your Project Name
Description
Briefly describe your project and its purpose.
Files
.gitattributes: Git configuration file.README.md: This file, containing information about the project.initial commit: Description of the initial commit.checkpoint: Checkpoint file.counter: File related to counting.encoder.json: JSON file for the encoder.events.out.tfevents.1705309987.5f53e38a8413: TensorFlow events file.hparams.json: JSON file containing hyperparameters.model-5000.data-00000-of-00001: Model data file.model-5000.index: Model index file.model-5000.meta: Model meta file.vocab.bpe: Vocabulary file.
Usage
Explain how to use or run your project. Include any dependencies and setup instructions.
Model Training
If applicable, provide information on how the model was fine-tuned and any specific training steps.
License
Specify the license under which your project is released.
Acknowledgments
Give credit to any resources, libraries, or individuals you used or were inspired by during the development.
Contact
Provide contact information for the maintainer or contributors, in case someone wants to reach out.
Feel free to customize this template based on the specific details of your project.