当前位置：首页 > news >正文

超简单Translation翻译模型部署

news 2025/7/16 4:00:46

Helsinki-NLP/opus-mt-{en}-{zh}系列翻译模型可以实现200多种语言翻译，Helsinki-NLP/opus-mt-en-zh是其中英互译模型。由于项目需要，在本地进行搭建，并记录下搭建过程，方便后人。

1. 基本硬件环境

CPU：N年前的 Intel(R) Core(TM) i5-3470 CPU @ 3.20GHz， 32G内存
GPU：N年前的 NVIDIA GeForce GTX 1080 Ti，11G显存

2. 基本软件环境

操作系统：Ubuntu20.04 LTS，是为了跟老旧的硬件相匹配，专门降级到20.04的，更高版本存在各种软件兼容性问题，等有钱了全部换新！！！
CUDA：cuda_12.0.0_525.60.13_linux.run，虽然能支持到12.2甚至12.4，保险起见还是选择了12.0
Cudnn：libcudnn8_8.8.0.121-1+cuda12.0_amd64.deb，对应CUDA版本
NCCL：libnccl2_2.19.3-1+cuda12.0_amd64.deb对应CUDA版本，多显卡需要
miniconda：Miniconda3-py312_24.9.2-0-Linux-x86_64.sh

3. 克隆fishspeech代码并安装本地依赖包

git clone https://gitclone.com/github.com/fishaudio/fish-speech.gitsudo apt-get install ffmpeg libsm6 libxext6 portaudio19-dev -y

4. 创建虚拟环境

conda create -n huggingface python==3.10 -y
conda activate huggingface

5. conda安装基础包

conda install -c pytorch -c nvidia -c conda-forge pytorch torchvision pytorch-cuda=11.8

6. 安装huggingface组件，transformers包

pip install transformers -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install -U huggingface_hub -i https://pypi.tuna.tsinghua.edu.cn/simple设置环境变量，用于加速
HF_ENDPOINT=https://hf-mirror.com

7. 以python脚本方式运行

# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLMtokenizer = AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-en-zh")
model = AutoModelForSeq2SeqLM.from_pretrained("Helsinki-NLP/opus-mt-en-zh")def translate(text):inputs = tokenizer(text, return_tensors="pt", padding=True)translated = model.generate(**inputs)return [tokenizer.decode(t, skip_special_tokens=True) for t in translated]print(tokenizer.supported_language_codes)
text = ">>cmn_Hans<< Due to a bug fix in https://github.com/huggingface/transformers/pull/28687 transcription using a multilingual Whisper will default to language detection followed by transcription instead of translation to English.This might be a breaking change for your use case. If you want to instead always translate your audio to English, make sure to pass `language='en'`. The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results."
translated_text = translate(text)
print(translated_text)

首次运行会报错，因为缺少两个依赖包，安装即可

pip install sentencepiece sacremoses -i https://pypi.tuna.tsinghua.edu.cn/simple

8. 以FastAPI方式运行

# 安装fastapi ubicorn组件
pip install fastapi uvicorn -i https://pypi.tuna.tsinghua.edu.cn/simple

服务脚本如下：

# Load model directly
from fastapi import FastAPI
from pydantic import BaseModel
from transformers import AutoTokenizer, AutoModelForSeq2SeqLMapp = FastAPI()tokenizer = AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-en-zh")
model = AutoModelForSeq2SeqLM.from_pretrained("Helsinki-NLP/opus-mt-en-zh")def translate(text):inputs = tokenizer(text, return_tensors="pt", padding=True)translated = model.generate(**inputs)return [tokenizer.decode(t, skip_special_tokens=True) for t in translated]# print(tokenizer.supported_language_codes)
# text = ">>cmn_Hans<< Due to a bug fix in https://github.com/huggingface/transformers/pull/28687 transcription using a multilingual Whisper will default to language detection followed by transcription instead of translation to English.This might be a breaking change for your use case. If you want to instead always translate your audio to English, make sure to pass `language='en'`. The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results."
# translated_text = translate(text)
# print(translated_text)class TextRequest(BaseModel):text: str@app.post("/predict")
async def predict(request: TextRequest):# 预处理和预测translated_text = translate(request.text)# 返回结果return {"text": request.text,"predictions": translated_text}

运行服务

uvicorn fastapi_app:app --host 0.0.0.0 --port 8000

查看全文

http://www.dtcms.com/a/212010.html

信奥赛CSP小学五年级动态规划入门

【docker】--compose介绍

高级特性实战：死信队列、延迟队列与优先级队列（二）

Go 语言基础 2 Func，流程控制

【linux篇】系统世界跳跃的音符：指令

嵌入式STM32学习—— 定时器中断（应用-利用定时器中断实现LED亮一秒灭一秒）

并发编程知识点

MyBatis实战指南（二）如何实现小鸟图标与导入Teacher数据库表实战

位图与布隆过滤器

RabbitMQ核心机制——延迟队列

win11 禁用/恢复内置笔记本键盘（保证管用）

【公式】MathType公式右编号对齐

MySQL连接错误解决方案：Can‘t connect to MySQL server on ‘localhost‘ (10038)

leetcode2081. k 镜像数字的和-hard

华为OD机试真题——仿LISP运算（2025B卷：200分）Java/python/JavaScript/C/C++/GO最佳实现

【短距离通信】【WiFi】WiFi7起源和应用场景介绍

MySQL 定时逻辑备份

CI/CD (持续集成/持续部署) GitHub Actions 自动构建

GitLab-CI将项目Wiki自动部署到文档中心

卷积神经网络(CNN)深度讲解

【HarmonyOS5】DevEco Studio 预览器与模拟工具详解

基于文本挖掘与情感分析的B站《唐探1900》弹幕研究

使用Cursor生成需求文档+UI设计图

【微服务】SpringBoot 对接飞书审批流程使用详解

Python GDAL 库离线安装

NTFS0x90属性和0xa0属性和0xb0属性的一一对应关系是index_entry中的index_node中VCN和runlist和bitmap

Mybatis框架的构建（IDEA）

【C++】21. 红黑树的实现

JWT与布隆过滤器结合使用指南

C++编程单例模式详细解释---模拟一个网络配置管理器，负责管理和分发网络连接参数