installation_en.md 4.3 KB
Newer Older
X
xxxpsyduck 已提交
1
## QUICK INSTALLATION
K
Khanh Tran 已提交
2 3 4 5

After testing, paddleocr can run on glibc 2.23. You can also test other glibc versions or install glic 2.23 for the best compatibility.

PaddleOCR working environment:
6
- PaddlePaddle1.8+, Recommend PaddlePaddle 2.0.0.beta
D
dyning 已提交
7
- python3.7
K
Khanh Tran 已提交
8
- glibc 2.23
G
grasswolfs 已提交
9

littletomatodonkey's avatar
littletomatodonkey 已提交
10
It is recommended to use the docker provided by us to run PaddleOCR, please refer to the use of docker [link](https://www.runoob.com/docker/docker-tutorial.html/).
K
Khanh Tran 已提交
11

G
grasswolfs 已提交
12 13
*If you want to directly run the prediction code on mac or windows, you can start from step 2.*

M
MissPenguin 已提交
14
**1. (Recommended) Prepare a docker environment. The first time you use this image, it will be downloaded automatically. Please be patient.**
K
Khanh Tran 已提交
15 16 17 18 19 20
```
# Switch to the working directory
cd /home/Projects
# You need to create a docker container for the first run, and do not need to run the current command when you run it again
# Create a docker container named ppocr and map the current directory to the /paddle directory of the container

W
weishengyu 已提交
21
# If using CPU, use docker instead of nvidia-docker to create docker
K
Khanh Tran 已提交
22 23
sudo docker run --name ppocr -v $PWD:/paddle --network=host -it hub.baidubce.com/paddlepaddle/paddle:latest-gpu-cuda9.0-cudnn7-dev /bin/bash
```
L
LDOUBLEV 已提交
24
If using CUDA9, please run the following command to create a container:
K
Khanh Tran 已提交
25 26 27
```
sudo nvidia-docker run --name ppocr -v $PWD:/paddle --network=host -it hub.baidubce.com/paddlepaddle/paddle:latest-gpu-cuda9.0-cudnn7-dev /bin/bash
```
L
LDOUBLEV 已提交
28
If using CUDA10, please run the following command to create a container:
K
Khanh Tran 已提交
29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51
```
sudo nvidia-docker run --name ppocr -v $PWD:/paddle --network=host -it hub.baidubce.com/paddlepaddle/paddle:latest-gpu-cuda10.0-cudnn7-dev /bin/bash
```
You can also visit [DockerHub](https://hub.docker.com/r/paddlepaddle/paddle/tags/) to get the image that fits your machine.
```
# ctrl+P+Q to exit docker, to re-enter docker using the following command:
sudo docker container exec -it ppocr /bin/bash
```

Note: If the docker pull is too slow, you can download and load the docker image manually according to the following steps. Take cuda9 docker for example, you only need to change cuda9 to cuda10 to use cuda10 docker:
```
# Download the CUDA9 docker compressed file and unzip it
wget https://paddleocr.bj.bcebos.com/docker/docker_pdocr_cuda9.tar.gz
# To reduce download time, the uploaded docker image is compressed and needs to be decompressed
tar zxf docker_pdocr_cuda9.tar.gz
# Create image
docker load < docker_pdocr_cuda9.tar
# After completing the above steps, check whether the downloaded image is loaded through docker images
docker images
# If you have the following output after executing docker images, you can follow step 1 to create a docker environment.
hub.baidubce.com/paddlepaddle/paddle   latest-gpu-cuda9.0-cudnn7-dev    f56310dcc829
```

W
weishengyu 已提交
52
**2. Install PaddlePaddle v2.0**
K
Khanh Tran 已提交
53
```
W
weishengyu 已提交
54
python3 -m pip install --upgrade pip
K
Khanh Tran 已提交
55

56
# If you have cuda9 or cuda10 installed on your machine, please run the following command to install
T
tink2123 已提交
57
python3 -m pip install paddlepaddle-gpu==2.0.0b0 -i https://mirror.baidu.com/pypi/simple
G
grasswolfs 已提交
58 59

# If you only have cpu on your machine, please run the following command to install
T
tink2123 已提交
60
python3 -m pip install paddlepaddle==2.0.0b0 -i https://mirror.baidu.com/pypi/simple
K
Khanh Tran 已提交
61 62 63 64
```
For more software version requirements, please refer to the instructions in [Installation Document](https://www.paddlepaddle.org.cn/install/quick) for operation.


M
MissPenguin 已提交
65
**3. Clone PaddleOCR repo**
K
Khanh Tran 已提交
66 67 68 69 70 71 72 73 74 75 76
```
# Recommend
git clone https://github.com/PaddlePaddle/PaddleOCR

# If you cannot pull successfully due to network problems, you can also choose to use the code hosting on the cloud:

git clone https://gitee.com/paddlepaddle/PaddleOCR

# Note: The cloud-hosting code may not be able to synchronize the update with this GitHub project in real time. There might be a delay of 3-5 days. Please give priority to the recommended method.
```

M
MissPenguin 已提交
77
**4. Install third-party libraries**
K
Khanh Tran 已提交
78 79
```
cd PaddleOCR
M
MissPenguin 已提交
80
python3 -m pip install -r requirements.txt
K
Khanh Tran 已提交
81
```
L
LDOUBLEV 已提交
82 83 84 85 86 87

If you getting this error `OSError: [WinError 126] The specified module could not be found` when you install shapely on windows.

Please try to download Shapely whl file using [http://www.lfd.uci.edu/~gohlke/pythonlibs/#shapely](http://www.lfd.uci.edu/~gohlke/pythonlibs/#shapely).

Reference: [Solve shapely installation on windows](https://stackoverflow.com/questions/44398265/install-shapely-oserror-winerror-126-the-specified-module-could-not-be-found)