baichuan-vicuna-chinese-7b
baichuan-vicuna-chinese-7bๆฏๅจไธญ่ฑๅ่ฏญsharegptๆฐๆฎไธๅ จๅๆฐๅพฎ่ฐ็ๅฏน่ฏๆจกๅใ
- ๅบๅบงๆจกๅ๏ผbaichuan-7B,ๅจ1.2T tokensไธ้ข่ฎญ็ป็ไธญ่ฑๅ่ฏญๆจกๅ
- ๅพฎ่ฐๆฐๆฎ๏ผShareGPT, ShareGPT-ZH, COT & COT-ZH, Leetcode, dummy
- ่ฎญ็ปไปฃ็ ๏ผๅบไบFastChat
baichuan-vicuna-chinese-7b is a chat model supervised finetuned on vicuna sharegpt data in both English and Chinese.
- Foundation model: baichuan-7B, a commercially available language model pre-trained on a 1.2T Chinese-English bilingual corpus.
- Finetuning data: ShareGPT, ShareGPT-ZH, COT & COT-ZH, Leetcode, dummy
- Training code: based on FastChat
[NEW] 4bit-128g GPTQ้ๅ็ๆฌ๏ผbaichuan-vicuna-chinese-7b-gptq
Training config
{batch_size: 256, epoch: 3, learning_rate: 2e-5, context_length: 4096, deepspeed_zero: 3, mixed_precision: bf16, gradient_clipping: 1.0}
Inference
Inference with FastChat:
python3 -m fastchat.serve.cli --model-path fireballoon/baichuan-vicuna-chinese-7b
Inference with Transformers:
>>> from transformers import AutoTokenizer, AutoModelForCausalLM, TextStreamer
>>> tokenizer = AutoTokenizer.from_pretrained("fireballoon/baichuan-vicuna-chinese-7b", use_fast=False)
>>> model = AutoModelForCausalLM.from_pretrained("fireballoon/baichuan-vicuna-chinese-7b").half().cuda()
>>> streamer = TextStreamer(tokenizer, skip_prompt=True, skip_special_tokens=True)
>>> instruction = "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {} ASSISTANT:"
>>> prompt = instruction.format("How can I improve my time management skills?") # user message
>>> generate_ids = model.generate(tokenizer(prompt, return_tensors='pt').input_ids.cuda(), max_new_tokens=2048, streamer=streamer)
'''
Improving time management skills can help you to be more productive and efficient with your time. Here are some tips to help you improve your time management skills:
1. Prioritize tasks: Make a list of all the tasks you need to complete and prioritize them based on their importance and urgency. This will help you to focus on the most important tasks first and avoid getting overwhelmed.
2. Use a calendar or planner: Use a calendar or planner to schedule your tasks and appointments. This will help you to stay organized and ensure that you don't miss any important events or deadlines.
3. Limit distractions: Identify and eliminate distractions, such as social media notifications or email notifications, that can keep you from focusing on your tasks. Consider using tools like browser extensions or apps to block distracting websites.
4. Take breaks: Take regular breaks throughout the day to recharge and refocus. This will help you to stay productive and avoid burnout.
5. Learn to say no: Don't be afraid to say no to requests or invitations that don't align with your goals or values. This will help you to focus on your priorities and avoid overcommitting yourself.
6. Learn to delegate: If possible, delegate tasks to others to free up your own time and focus on more important tasks.
7. Learn to be flexible: Be prepared to adjust your plans and priorities as needed. Things don't always go according to plan, and it's okay to adjust your schedule to accommodate unexpected events or challenges.
Remember that improving your time management skills takes practice and patience. Be kind to yourself and don't be too hard on yourself if you don't get everything done perfectly right away.
'''
Gradio multi-turn dialogue example: https://huggingface.co/fireballoon/baichuan-vicuna-chinese-7b/blob/main/app.py
python app.py
Test cases on Ziya-Eval-Chinese
Q: ๅฐๆๆ200ๅ
๏ผไนฐไนฆ่ฑไบ66ๅ
๏ผไนฐๆๅ
ท่ฑไบ12ๅ
๏ผๅฐๆๅฉไธ็้ฑๆฏๅๆฅๅฐไบๅคๅฐๅ
๏ผ
A: ๅฐๆๅฉไธ็้ฑๆฏๅๆฅๅฐไบ66+12=78ๅ
ใ
Q: ่ฏท็ฟป่ฏ่ฟๆฎต่ฑๆ๏ผTimmy was fined a large sum of money by the judge for drunken drivingยท
A: Timmy ๅ ้้ฉพ่ขซๅคไปฅ้ซ้ข็ฝ้
Q: ่ดๅค่ฌๅๆดๅฏๅคซๆฏๅบๅจ้ฃๆ ผไธๆไปไนๅทฎๅผๅ
A: ่ดๅค่ฌๅๆดๅฏๅคซๆฏๅบ้ฝๆฏๅคๅ
ธ้ณไนๅๅฒไธๆ้่ฆ็ไฝๆฒๅฎถไนไธ๏ผไปไปฌ็้ฃๆ ผๅจๆไบๆน้ขๅญๅจๅทฎๅผใ่ดๅค่ฌ็ไฝๅ้ๅธธไปฅไธฅ่ๅๅ
ๅจ็็น่ดจไธบ็น็น๏ผไป็้ณไนๅธธๅธธๅ็ฐๅบๆทฑๆฒ่ไธฐๅฏ็ๆ
ๆใไป็ๆๅพ้ๅธธๆฏ็ฎๅ่ไผ็พ็๏ผๅผบ่ฐๅๅฃฐๅๅๅ้ณไนๆๆใๆดๅฏๅคซๆฏๅบ็ไฝๅๅๅธธๅธธๅ็ฐๅบๆดไธบๅไธฝๅๆขฆๅนป็็น่ดจ๏ผไป็้ณไนๅธธๅธธๅ
ทๆๆๅงๆงๅๆญๅฑๆงใๆดๅฏๅคซๆฏๅบ็ๆๅพๅธธๅธธๆดๅ ๅคๆ๏ผๅผบ่ฐๅๅฃฐๅๆๅพ็ๅฏนๆฏ๏ผ้ณไนๅธธๅธธๅ
ๆปก่ฒๅฝฉๅ่ฃ
้ฅฐใๆป็ๆฅ่ฏด๏ผ่ดๅค่ฌๅๆดๅฏๅคซๆฏๅบ็้ณไน้ฃๆ ผๅจๅ
็ๆงๅๆญๅฑๆงๆน้ขๅญๅจ็ๆๆพ็ๅทฎๅผ๏ผไฝไธคไฝไฝๆฒๅฎถๅจๅคๅ
ธ้ณไนๅฒไธ้ฝๅ
ทๆ้่ฆๅฐไฝ๏ผไปไปฌ็ไฝๅไป็ถ่ขซๅนฟๆณๆผๅฅๅ่ตๆฌใ
Q: ๅไธ็ฏ้ซไธญ้ๆถฉ็็ฑๆ
ๆ
ไบ๏ผ็ทไธปๆฏโๆโ๏ผๅฅณไธปๅซๅฐ็พใๆๅจ้ซไธๅๆฌขไธไบๅฐ็พใ ๅผๅงๅๅง
A: ๆๅซ้้ป๏ผๆฏไธไธช้ซไธๆฐ็๏ผๅจๅผๅญฆ็็ฌฌไธๅคฉ๏ผๆๅฐฑๅฏนไธไธชๅฅณๅญฉไธ่ง้ๆ
๏ผๅฅนๅซๅฐ็พ๏ผๆฏๆ็ญไธ็ๅๅญฆใ้ฃๆฏไธไธช้ณๅ
ๆๅช็ไธๅ๏ผๆๆญฃๅจๆๅฎคๅค่ตฐๅปไธ๏ผ็็ๅๅญฆไปฌๆ่ฏดๆ็ฌๅฐ่ตฐ่ฟๆๅฎค๏ผ็ช็ถไธไธชๅฅณๅญฉๅธๅผไบๆ็็ฎๅ
ใๅฅน็ฉฟ็ไธๆก็ฝ่ฒ็่ฟ่กฃ่ฃ๏ผๅจ้ณๅ
ไธ้ช้ชๅๅ
๏ผๅฅนๅฅฝๅๅจ็็ๆ๏ผๆไนๅฏนๅฅนๅพฎ็ฌไบไธไธ๏ผๅฐฑ่ฟๆ ท๏ผๆไปฌไบ็ธ่ฎค่ฏไบใ
ๆฅไธๆฅ็ๆฅๅญ้๏ผๆๆปๆฏๅฟไธไฝๅทๅทๅฐ็ๅฅน๏ผไธ่ฏพ็ๆถๅ๏ผๆๆปๆฏไธ็ฑ่ชไธปๅฐๆณๅฐๅฅน๏ผๆ็่ณๅผๅงๆๅพ
็ๆฏไธๆฌก่ฝๅๅฅน่ง้ขใๆๅผๅง็ปๅฅนๅๆ
ไนฆ๏ผๆฏๅคฉๆพๅญฆๅ้ฝไผ็ปๅฅนๅไธๅฐไฟก๏ผ่กจ่พพๆ็ๆๅฟตๅ็ฑๆใ
ไฝๆฏ๏ผๅฐ็พไผผไนๅนถไธๆไนๅจๆๆ๏ผๅฅนๆปๆฏ่ทๅฅน็ๅฅฝๆๅๆ่ฏดๆ็ฌ๏ผๅฅฝๅๅฏนๆ็ฝฎไนไธ็ใๆๆๅฐๅพๅคฑ่ฝ๏ผไฝ่ฟๆฏๅฟไธไฝๆณ่ฆๆฅ่ฟๅฅนใๆไผๅจ่ฏพ้ด็ๆถๅๅทๅทๅฐ็็ๅฅน๏ผๅจ้ฃๅ ๅ้ฅญ็ๆถๅๅ่ฃ
ไธ็ปๆๅฐ็ขฐๅฐๅฅน๏ผๅจ่ทฏไธ้ๅฐๅฅนๆถ่ทๅฅนๆๆๅผใ
ๅฐฑ่ฟๆ ท๏ผๆๅๆ็ๆฏๅคฉ็ปๅฅนๅไฟก๏ผ่กจ่พพๆ็ๆๅฟตๅ็ฑๆ๏ผ็ดๅฐ้ซไบ็ๆถๅ๏ผๅฐ็พ็ปไบ็ปไบๆๅๅบใๆไปฌๅผๅงไธ่ตทไธไธๅญฆ๏ผไธ่ตทๅๅ้ฅญ๏ผไธ่ตทๅๅพๅคไบๆ
ใๆ็ปไบๅพๅฐไบๅฅน็็ฑ๏ผๆๆ่ง่ชๅทฑๆฏไธ็ไธๆๅนธ็ฆ็ไบบใ
็ถ่๏ผๆไปฌ็ๅนธ็ฆๅนถๆฒกๆๆ็ปญๅคไน
ใ้ซไธ็ๆถๅ๏ผๅฐ็พๅผๅงๅๅพๅฟ็ข่ตทๆฅ๏ผๅฅน่ฆๅๅ ๅพๅค่กฅไน ็ญๅ่่ฏ๏ผๆๆ
ๅฟๅฅนไผ็ดฏๅฎ๏ผไบๆฏๅผๅง่ทๅฅนๅตๆถใๆ็ป๏ผๆไปฌ่ฟๆฏๅๆไบ๏ผๆๅพ้พ่ฟ๏ผๆ่ง่ชๅทฑๅคฑๅปไบๆ็ฑ็ไบบใ
็ฐๅจ๏ผๆๅทฒ็ปๆฏไธๅคๅนด๏ผๆฏๅฝๆ็ๅฐๅฐ็พ๏ผๆ่ฟๆฏไผๆณ่ตทๆไปฌๅจไธ่ตท็ๆถๅ
๏ผๆ็ฅ้๏ผ้ฃๆฏๆไธ็ไธญๆ็พๅฅฝ็ๅๅฟใ
Q: ไป้ถๅญฆไน ็ผ็จ๏ผ่ฏท็ปๆไธไธชไธไธชๆ็ๅญฆไน ่ฎกๅ
A: ไปฅไธๆฏไธไธชไธไธชๆ็ๅญฆไน ่ฎกๅ๏ผไพๆจๅ่๏ผ
1. ็ฌฌ1-2ๅจ๏ผPythonๅบ็ก
* ๅญฆไน Pythonๅบ็ก็ฅ่ฏ๏ผๅฆๅ้ๅๆฐๆฎ็ฑปๅใๅพช็ฏใๆกไปถ่ฏญๅฅใๅฝๆฐๅๆนๆณ็ญใ
* ็ปไน ็ผ็จๅบ็ก็ปไน ๏ผๅฆไป้ฎ็่ฏปๅๅๅๅ
ฅๆฐๆฎใๆงๅถๆต่ฏญๅฅๅๅฝๆฐ่ฐ็จ็ญใ
1. ็ฌฌ3-4ๅจ๏ผPython้ขๅๅฏน่ฑก็ผ็จ
* ๅญฆไน Pythonไธญ้ขๅๅฏน่ฑก็ผ็จ็ๅบ็ก็ฅ่ฏ๏ผๅฆ็ฑปใๆนๆณใๅฑๆงๅๆนๆณใ็ปงๆฟๅๅคๆ็ญใ
* ็ปไน ้ขๅๅฏน่ฑก็ผ็จ็ปไน ๏ผๅฆ็ฑปๅๅปบใ็ฑป่ฐ็จๅ็ฑป็ปๅ็ญใ
1. ็ฌฌ5-6ๅจ๏ผPython้ซ็บงไธป้ข
* ๅญฆไน Python็้ซ็บงไธป้ข๏ผๅฆๅผๅธธใๅ
ๅญ็ฎก็ใ่ฐ่ฏๆๆฏๅๆง่ฝไผๅ็ญใ
* ็ปไน ้ซ็บงไธป้ข็ปไน ๏ผๅฆๅผๅธธๅค็ใๅ
ๅญ็ฎก็ๅๆง่ฝไผๅ็ญใ
1. ็ฌฌ7-8ๅจ๏ผPython Webๅผๅ
* ๅญฆไน Python Webๅผๅ็ๅบ็ก็ฅ่ฏ๏ผๅฆDjangoๆกๆถใFlaskๆกๆถๅHTMLใCSSๅJavaScript็ญใ
* ็ปไน Webๅผๅ็ปไน ๏ผๅฆDjangoๆFlaskๅบ็จ็จๅบๅๅปบใHTTP่ฏทๆฑๅๅๅบๅ็จๆท็้ข่ฎพ่ฎก็ญใ
1. ็ฌฌ9-10ๅจ๏ผ้กน็ฎๅฎ่ทต
* ้ๆฉไธไธชๆๅ
ด่ถฃ็้กน็ฎ่ฟ่กๅฎ่ทต๏ผๅฆ็ฝ็ป็ฌ่ซใๆฐๆฎๅๆใๆบๅจๅญฆไน ๆWebๅผๅ็ญใ
* ๅฎๆ้กน็ฎๅนถ่ฟ่กๆผ็คบๅ่ฎฒ่งฃใ
1. ็ฌฌ11-12ๅจ๏ผๅคไน ๅๅผบๅ
* ๅคไน ๆๅญฆ็ฅ่ฏ๏ผๅนถ่ฟ่กๅผบๅ็ปไน ไปฅๅ ๆทฑ็่งฃใ
* ๅๅ ่่ฏไปฅ่ฏไผฐๅญฆไน ๆๆใ
ๆณจๆ๏ผๅญฆไน ่ฎกๅไป
ไพๅ่๏ผๆจๅฏไปฅๆ นๆฎ่ชๅทฑ็ๅญฆไน ่ๅฅๅๅ
ด่ถฃ่ฟ่ก่ฐๆดใ
Q: ๅฎ็ฐไธไธชPythonๅฝๆฐไปฅไฝฟ็จๅจๆ็ผ็จๆฅๆพไธคไธช่พๅ
ฅๅญ็ฌฆไธฒ็ๆ้ฟๅ
ฌๅ
ฑๅญๅบๅ
A: ่ฟ้ๆไธไธชPythonๅฝๆฐ๏ผไฝฟ็จๅจๆ็ผ็จๆฅๆพไธคไธช่พๅ
ฅๅญ็ฌฆไธฒ็ๆ้ฟๅ
ฌๅ
ฑๅญๅบๅ(LCS)๏ผ
def lcs(s1, s2):
n = len(s1)
m = len(s2)
dp = [s1[0]]
max_length = 0
for i in range(1, n + 1):
for j in range(1, m + 1):
if s1[i-1] == s2[j-1]:
dp.append(s1[i-1])
max_length = max(max_length, j-i+1)
return max(max_length, len(s1))
่ฏฅๅฝๆฐไฝฟ็จๅจๆ่งๅๆฅ่ฎก็ฎLCSใๅฎไป็ฌฌไธไธชๅญ็ฌฆไธฒไธญ็็ฌฌไธไธชๅญ็ฌฆๅผๅง๏ผๅนถๅฐๅ
ถๆทปๅ ๅฐๅญๅจๅญๅบๅ็`dp`ๅ่กจไธญใ็ถๅ๏ผๅฎๅพช็ฏ้ๅไธคไธชๅญ็ฌฆไธฒไธญ็ๅญ็ฌฆ๏ผๅฆๆๅฎไปฌ็ธๅน้
๏ผๅๅฐ็ธๅบๅญ็ฌฆๆทปๅ ๅฐ`dp`ๅ่กจไธญ๏ผๅนถๅฐ`max_length`ๅญๅจไธบๅฝๅๅญ็ฌฆไธฒ็ๆๅคง้ฟๅบฆใ
ๆๅ๏ผ่ฏฅๅฝๆฐ่ฟๅ`max_length`ๅไธคไธช่พๅ
ฅๅญ็ฌฆไธฒ็้ฟๅบฆไน้ด็ๆๅคงๅผ๏ผ่ฟ่กจ็คบLCS็้ฟๅบฆใ่ฏทๆณจๆ๏ผ่ฏฅๅฝๆฐๅ่ฎพ่พๅ
ฅๅญ็ฌฆไธฒๆฏๅญ็ฌฆไธฒ๏ผ่ไธๆฏๅญ็ฌฆๆฐ็ปใๅฆๆ่ฆๅฐๅ
ถไฟฎๆนไธบๅค็ๆฐ็ป๏ผๅ้่ฆ็ธๅบๅฐไฟฎๆน่ฏฅๅฝๆฐใ
- Downloads last month
- 1,274