在amazon linux 2023上面通过Fedora 36软件仓库源安装tesseract5
问题
由于amazon linux 2023上面不能使用EPEL软件仓库源,所以,只能手动添加Fedora 36软件仓库源来安装tesseract。
配置Fedora 36软件仓库源
sudo vim /etc/yum.repos.d/fedora.repo
内容如下:
# /etc/yum.repos.d/fedora.repo[fedora]
name=Fedora 36 - $basearch
#baseurl=http://download.example/pub/fedora/linux/releases/36/Everything/$basearch/os/
metalink=https://mirrors.fedoraproject.org/metalink?repo=fedora-36&arch=$basearch
enabled=1
metadata_expire=7d
repo_gpgcheck=0
type=rpm
gpgcheck=1
gpgkey=https://src.fedoraproject.org/rpms/fedora-repos/raw/f36/f/RPM-GPG-KEY-fedora-36-primary
skip_if_unavailable=False
测试:
[ssm-user@ip-xxx-xx-xx-xx ~]$ dnf repoquery --nvr tesseract
Amazon Linux 2023 repository 70 MB/s | 49 MB 00:00
Amazon Linux 2023 Kernel Livepatch repository 232 kB/s | 28 kB 00:00
Fedora 36 - x86_64 22 MB/s | 69 MB 00:03
tesseract-5.0.1-5.fc36
安装tesseract5
# 安装主程序
sudo dnf install tesseract
# 安装英文语言包
sudo dnf install tesseract-langpack-eng
# 安装中文简体语言包
sudo dnf install tesseract-langpack-chi_sim
sudo dnf install tesseract-langpack-chi_sim_vert
这里可以使用如下方式查询tesseract相关软件包:
sudo dnf search tesseract
测试
ssm-user@ip- ~]$ tesseract 0.png output -l chi_sim+eng --psm 3 --oem 3
Estimating resolution as 475
Detected 17 diacritics
[ssm-user@ip- ~]$ cat output.txt
15:12 all 5G a)
《 详情BR BR#2025xxxxxxx xxXXXXXXXXXXXXXXXXXxddddiNDe1分钟前 & 冰发表评论: KY
参考
- 我們能在 Amazon Linux 2023 上安裝 EPEL 倉庫嗎?
- Install tesseract-ocr on Amazon Linux 2023
- Extra Packages for Enterprise Linux (EPEL)
