doc update

60193654 · zhangjun · d4dab9af · 60193654
隐藏空白更改
内联并排

Showing with 103 addition and 1 deletion

doc/BAIDU_KUNLUN_XPU_SERVING_CN.md doc/BAIDU_KUNLUN_XPU_SERVING_CN.md +103 -1

未找到文件。
--- a/doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
+++ b/doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
-# Serving using Baidu Kunlun XPU
+# Paddle Serving使用百度昆仑芯片部署
+Paddle Serving支持使用百度昆仑芯片进行预测部署。目前试验性支持在百度昆仑芯片和arm服务器（如飞腾 FT-2000+/64）上进行部署，后续完善对其他异构硬件服务器部署能力。
+
+# 编译、安装
+基本环境配置可参考[该文档](COMPILE_CN.md）进行配置。
+## 编译
+* 编译server部分
+```
+cd Serving
+mkdir -p server-build-arm && cd server-build-arm
+
+cmake -DPYTHON_INCLUDE_DIR=/usr/include/python3.7m/ \
+    -DPYTHON_LIBRARIES=/usr/lib64/libpython3.7m.so \
+    -DPYTHON_EXECUTABLE=/usr/bin/python \
+    -DWITH_PYTHON=ON \
+    -DWITH_LITE=ON \
+    -DWITH_XPU=ON \
+    -DSERVER=ON ..
+make -j10
+```
+可以执行`make install`把目标产出放在`./output`目录下，cmake阶段需添加`-DCMAKE_INSTALL_PREFIX=./output`选项来指定存放路径。
+* 编译client部分
+```
+mkdir -p client-build-arm && cd client-build-arm
+
+cmake -DPYTHON_INCLUDE_DIR=/usr/include/python3.7m/ \
+    -DPYTHON_LIBRARIES=/usr/lib64/libpython3.7m.so \
+    -DPYTHON_EXECUTABLE=/usr/bin/python \
+    -DWITH_PYTHON=ON \
+    -DWITH_LITE=ON \
+    -DWITH_XPU=ON \
+    -DCLIENT=ON ..
+
+make -j10
+```
+* 编译app部分
+```
+cd Serving 
+mkdir -p app-build-arm && cd app-build-arm
+
+cmake -DPYTHON_INCLUDE_DIR=/usr/include/python3.7m/ \
+    -DPYTHON_LIBRARIES=/usr/lib64/libpython3.7m.so \
+    -DPYTHON_EXECUTABLE=/usr/bin/python \
+    -DWITH_PYTHON=ON \
+    -DWITH_LITE=ON \
+    -DWITH_XPU=ON \
+    -DAPP=ON ..
+
+make -j10
+```
+## 安装wheel包
+以上编译步骤完成后，会在各自编译目录$build_dir/python/dist生成whl包，分别安装即可。例如server步骤，会在server-build-arm/python/dist目录下生成whl包, 使用命令```pip install -u xxx.whl```进行安装。
+
+# 请求参数说明
+为了支持arm+xpu服务部署，使用Paddle-Lite加速能力，请求时需使用以下参数。
+|参数|参数说明|备注|
+|:--|:--|:--|
+|use_lite|使用Paddle-Lite Engine|使用Paddle-Lite cpu预测能力|
+|use_xpu|使用Baidu Kunlun进行预测|该选项需要与use_lite配合使用|
+|ir_optim|开启Paddle-Lite计算图优化|详细见[Paddle-Lite](https://github.com/PaddlePaddle/Paddle-Lite)|
+# 部署使用示例
+## 下载模型
+```
+wget --no-check-certificate https://paddle-serving.bj.bcebos.com/uci_housing.tar.gz
+tar -xzf uci_housing.tar.gz
+```
+## 启动rpc服务
+主要有三种启动配置：
+* 使用arm cpu+xpu部署，使用Paddle-Lite xpu优化加速能力；
+* 单独使用arm cpu部署，使用Paddle-Lite优化加速能力；
+* 使用arm cpu部署，不使用Paddle-Lite加速。
+    
+推荐使用前两种部署方式。
+
+启动rpc服务，使用arm cpu+xpu部署，使用Paddle-Lite xpu优化加速能力
+```
+python3 -m paddle_serving_server_gpu.serve --model uci_housing_model --thread 6 --port 9292 --use_lite --use_xpu --ir_optim
+```
+启动rpc服务，使用arm cpu部署, 使用Paddle-Lite加速能力
+```
+python3 -m paddle_serving_server_gpu.serve --model uci_housing_model --thread 6 --port 9292 --use_lite --ir_optim
+```
+启动rpc服务，使用arm cpu部署, 不使用Paddle-Lite加速能力
+```
+python3 -m paddle_serving_server_gpu.serve --model uci_housing_model --thread 6 --port 9292
+```
+## client调用
+```
+from paddle_serving_client import Client
+import numpy as np
+client = Client()
+client.load_client_config("uci_housing_client/serving_client_conf.prototxt")
+client.connect(["127.0.0.1:9292"])
+data = [0.0137, -0.1136, 0.2553, -0.0692, 0.0582, -0.0727,
+        -0.1583, -0.0584, 0.6283, 0.4919, 0.1856, 0.0795, -0.0332]
+fetch_map = client.predict(feed={"x": np.array(data).reshape(1,13,1)}, fetch=["price"])
+print(fetch_map)
+```
+以下提供部分样例，其他模型可参照进行修改。
+|示例名称|示例链接|
+|:-----|:--|
+|fit_a_line|http://github.com|
+|resnet|http://github.com|