Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
jishengpeng's picture
4 4 6

jishengpeng

novateur
questiontwo's profile picture cocoa2525's profile picture EgorV4x's profile picture
·
https://novateurjsp.github.io
  • jishengpeng

AI & ML interests

speech language model, discrete codec, text to speech

Recent Activity

published a model about 1 month ago
novateur/fff
liked a model 7 months ago
tencent/HunyuanImage-3.0
updated a model 11 months ago
novateur/eee
View all activity

Organizations

Zhejiang University's profile picture

upvoted a paper 11 months ago

WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

Paper • 2505.09558 • Published May 14, 2025 • 10
upvoted a paper about 1 year ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6, 2025 • 72
upvoted a collection over 1 year ago

WavTokenizer-Medium-Large

Collection
https://arxiv.org/abs/2408.16532 • 4 items • Updated Feb 25, 2025 • 11
upvoted a paper over 1 year ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 50
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs