当前位置: 首页 > wzjs >正文

创建一个网站的技术关键词查询工具

创建一个网站的技术,关键词查询工具,wordpress在后台文章自定义表单,怎么找人做网站Kokoro是一个具有8200万个参数的开放权重TTS模型。尽管其架构轻巧,但它提供了与较大型号相当的质量,同时速度更快,更具成本效益。使用Apache许可的权重,Kokoro可以部署在从生产环境到个人项目的任何地方。 官网:hexgra…

Kokoro是一个具有8200万个参数的开放权重TTS模型。尽管其架构轻巧,但它提供了与较大型号相当的质量,同时速度更快,更具成本效益。使用Apache许可的权重,Kokoro可以部署在从生产环境到个人项目的任何地方。

官网:hexgrad/kokoro: https://hf.co/hexgrad/Kokoro-82M

现在我们来实践下Kokoro

Linux下安装使用

安装库

pip install -q kokoro>=0.8.2 "misaki[zh]>=0.8.2" soundfile

一键执行安装使用 

为了简单,可以学习官网,直接在kaggle或colab的notebook里,输入下面语句,运行即可:

!pip install -q kokoro>=0.9.4 soundfile
!apt-get -qq -y install espeak-ng > /dev/null 2>&1
from kokoro import KPipeline
from IPython.display import display, Audio
import soundfile as sf
import torch
pipeline = KPipeline(lang_code='a')
text = '''
[Kokoro](/kˈOkəɹO/) is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, [Kokoro](/kˈOkəɹO/) can be deployed anywhere from production environments to personal projects.
'''
generator = pipeline(text, voice='af_heart')
for i, (gs, ps, audio) in enumerate(generator):print(i, gs, ps)display(Audio(data=audio, rate=24000, autoplay=i==0))sf.write(f'{i}.wav', audio, 24000)

这些语句包括pip 安装kokoro 和soundfile这两个python包,使用apt 安装了espeak-ng这个软件包(在Ubuntu下) 。

将要转语音的文字赋值给text变量,然后就可以进行文本转换了。

英文效果还行,但是无法混用数字,如果里面有中文就不行。

多种语言

安装中文库

pip install misaki[zh]

在kaggle或colab的notebook里的例子: 

# 1️⃣ Install kokoro
!pip install -q kokoro>=0.9.4 soundfile
# 2️⃣ Install espeak, used for English OOD fallback and some non-English languages
!apt-get -qq -y install espeak-ng > /dev/null 2>&1# 3️⃣ Initalize a pipeline
from kokoro import KPipeline
from IPython.display import display, Audio
import soundfile as sf
import torch
# 🇺🇸 'a' => American English, 🇬🇧 'b' => British English
# 🇪🇸 'e' => Spanish es
# 🇫🇷 'f' => French fr-fr
# 🇮🇳 'h' => Hindi hi
# 🇮🇹 'i' => Italian it
# 🇯🇵 'j' => Japanese: pip install misaki[ja]
# 🇧🇷 'p' => Brazilian Portuguese pt-br
# 🇨🇳 'z' => Mandarin Chinese: pip install misaki[zh]
pipeline = KPipeline(lang_code='a') # <= make sure lang_code matches voice, reference above.# This text is for demonstration purposes only, unseen during training
text = '''
The sky above the port was the color of television, tuned to a dead channel.
"It's not like I'm using," Case heard someone say, as he shouldered his way through the crowd around the door of the Chat. "It's like my body's developed this massive drug deficiency."
It was a Sprawl voice and a Sprawl joke. The Chatsubo was a bar for professional expatriates; you could drink there for a week and never hear two words in Japanese.These were to have an enormous impact, not only because they were associated with Constantine, but also because, as in so many other areas, the decisions taken by Constantine (or in his name) were to have great significance for centuries to come. One of the main issues was the shape that Christian churches were to take, since there was not, apparently, a tradition of monumental church buildings when Constantine decided to help the Christian church build a series of truly spectacular structures. The main form that these churches took was that of the basilica, a multipurpose rectangular structure, based ultimately on the earlier Greek stoa, which could be found in most of the great cities of the empire. Christianity, unlike classical polytheism, needed a large interior space for the celebration of its religious services, and the basilica aptly filled that need. We naturally do not know the degree to which the emperor was involved in the design of new churches, but it is tempting to connect this with the secular basilica that Constantine completed in the Roman forum (the so-called Basilica of Maxentius) and the one he probably built in Trier, in connection with his residence in the city at a time when he was still caesar.[Kokoro](/kˈOkəɹO/) is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers comparable quality to larger models while being significantly faster and more cost-efficient. With Apache-licensed weights, [Kokoro](/kˈOkəɹO/) can be deployed anywhere from production environments to personal projects.
'''# 4️⃣ Generate, display, and save audio files in a loop.
generator = pipeline(text, voice='af_heart', # <= change voice herespeed=1, split_pattern=r'\n+'
)
# Alternatively, load voice tensor directly:
# voice_tensor = torch.load('path/to/voice.pt', weights_only=True)
# generator = pipeline(
#     text, voice=voice_tensor,
#     speed=1, split_pattern=r'\n+'
# )for i, (gs, ps, audio) in enumerate(generator):print(i)  # i => indexprint(gs) # gs => graphemes/textprint(ps) # ps => phonemesdisplay(Audio(data=audio, rate=24000, autoplay=i==0))sf.write(f'{i}.wav', audio, 24000) # save each audio file

windows下安装

windows下主要是需要安装espeak-ng ,去这里下载:

​https://github.com/espeak-ng/espeak-ng/releases​

下载espeak-ng安装软件,安装即可。

ONNX部署

安装依赖库

!pip install -U kokoro-onnx soundfile 'misaki[zh]'

下载模型文件

!wget https://github.com/thewh1teagle/kokoro-onnx/releases/download/model-files-v1.1/kokoro-v1.1-zh.onnx
!wget     https://github.com/thewh1teagle/kokoro-onnx/releases/download/model-files-v1.1/voices-v1.1-zh.bin
!wget     https://huggingface.co/hexgrad/Kokoro-82M-v1.1-zh/raw/main/config.json

文本转换

import soundfile as sf
from misaki import zhfrom kokoro_onnx import Kokoro# Misaki G2P with espeak-ng fallback
g2p = zh.ZHG2P(version="1.1")text = "千里之行,始于足下。欢迎使用Kokoro TTS,高效生成自然语音!"
voice = "zf_001"
kokoro = Kokoro("kokoro-v1.1-zh.onnx", "voices-v1.1-zh.bin", vocab_config="config.json")
phonemes, _ = g2p(text)
samples, sample_rate = kokoro.create(phonemes, voice=voice, speed=1.0, is_phonemes=True)
sf.write("audio.wav", samples, sample_rate)
print("Created audio.wav")
display(Audio(data="audio.wav", rate=24000, autoplay=i==0))

推理速度还是较快的。

但是中文里面如果有英文,它是不会读出来的。所以效果还是略差一点。

总结

中文效果较差,英文效果还凑活。

当然kokoro 只有82M大小,能有这个效果已经很不错了!

http://www.dtcms.com/wzjs/524089.html

相关文章:

  • 电子商务网站建设的定义上海疫情又要爆发了
  • 阿里云个人怎么免费做网站软文网站有哪些
  • 广州做网站app引流推广多少钱一个
  • 网站建设学习浩森宇特韩国最新新闻
  • 福建定制网站开发青岛爱城市网app官方网站
  • 怎么给公司做网站百度24小时客服电话136
  • 专门做自助游攻略的网站是哪个seo的优点
  • 详情页在线设计网站推荐网络培训课程
  • 中国石化工程建设有限公司官网seo搜索优化网站推广排名
  • 网站建站网站网站维护网站开发流程的8个步骤
  • 网站网络营销推广工作内容
  • 青岛东八区创意做网站申请域名
  • 公司做网站需要准备什么软件南京seo网站管理
  • 扬中做网站的公司网络营销最新案例
  • 建设一个网站系统要多久百度在线识别图片
  • 有官网建手机网站电商平台怎么搭建
  • 有哪些网站可以兼职做笔译网站alexa排名
  • 静态网站可以做哪些湖北权威的百度推广
  • 东莞企业建站程序焦作网站seo
  • 电子商务网站建设的工具app推广公司
  • 免费html网站济南百度推广开户
  • 白沟17网站一起做网店百度推广开户联系方式
  • 动态html做网站背景2023年6月份疫情严重吗
  • 临沂最新疫情最新消息seo是什么意思怎么解决
  • 贾汪区人民政府门户网站建设百度首页登录入口
  • 上海建设工程咨询网证书查询seo排名优化工具推荐
  • 怎么做网站跟域名西安百度竞价外包
  • 网站关键字怎么优化提高工作效率心得体会
  • 国内网站建设公司排名北京新闻最新消息
  • 请问门户网站是什么意思四川成都最新消息