v0.6.0 · 标签 · PaddlePaddle / Serving

v0.6.0

Paddle Serving v0.6.0 Release note:

新特性：
- 集成Paddle 2.1 inference, #1221
- 支持fp16和int8的低精度推理, #1130, #1236
- 通过Kubernetes部署Serving服务, #1139, #1184, #1193
- 新增安全网关与Serving协同部署, #1235
- 支持X86 + XPU环境部署Serving, #1080
功能增强：
- Python合并paddle_serving_server和paddle_serving_server_gpu成统一服务, #1082
- Pipeline增加mini-batch推理, #1186
- Pipeline支持日志切割, #1238
- Pipeline优化数据传入eval处理，增加channel的跟踪日志, #1209
- C++ Serving重构预测库调用方法，#1080
- C++ Serving支持多模型线性组合，#1124
- C++ Serving资源管理与优化, #1143
- C++ Serving接口增加String类型输入, #1124
- C++ Serving优化数据组装方法，使用memcpy替换循环拷贝, #1124
- C++ Serving编译选型增加GDB开关, #1124
- 增加Benchmark脚本，更新GPU benchmark数据, #1197, #1175
文档升级：
- 新增 doc/PADDLE_SERVING_ON_KUBERNETES.md
- 新增 doc/LOD.md
- 新增 doc/LOD_CN.md
- 新增 doc/PROCESS_DATA.md
- 修改 doc/PIPELINE_SERVING.md
- 修改 doc/PIPELINE_SERVING_CN.md
- 修改 doc/CREATING.md
- 修改 doc/SAVE.md
- 修改 doc/SAVE_CN.md
- 修改 doc/TENSOR_RT.md
- 修改 doc/TENSOR_RT_CN.md
- 修改 doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
- 修改 doc/ENCRYPTION.md
- 修改 doc/ENCRYPTION_CN.md
- 修改 doc/DESIGN_DOC.md
- 修改 doc/DESIGN_DOC_CN.md
- 修改 doc/DOCKER_IMAGES.md
- 修改 doc/DOCKER_IMAGES_CN.md
- 修改 doc/LATEST_PACKAGES.md
- 修改 doc/COMPILE.md
- 修改 doc/COMPILE_CN.md
- 修改 doc/BERT_10_MINS.md
- 修改 doc/BERT_10_MINS_CN.md
- 修改 doc/BAIDU_KUNLUN_XPU_SERVING.md
- 修改 doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
- 修改 README.md
- 修改 README_CN.md
Demo升级：
- 新增 python/python/examples/low_precision/resnet50
- 新增 python/examples/xpu/bert
- 新增python/examples/xpu/ernie
- 新增 python/examples/xpu/vgg19
- 新增 python/examples/pipeline/PaddleDetection/faster_rcnn
- 新增 python/examples/pipeline/PaddleDetection/ppyolo_mbv3
- 新增 python/examples/pipeline/PaddleDetection/yolov3
- 新增 python/examples/pipeline/PaddleClas/DarkNet53
- 新增 python/examples/pipeline/PaddleClas/HRNet_W18_C
- 新增 python/examples/pipeline/PaddleClas/MobileNetV1
- 新增 python/examples/pipeline/PaddleClas/MobileNetV2
- 新增 python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
- 新增 python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
- 新增 python/examples/pipeline/PaddleClas/ResNet50_vd
- 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
- 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_KL
- 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
- 新增 python/examples/pipeline/PaddleClas/ResNet_V2_50
- 新增 python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
- 新增 python/examples/pipeline/bert
- 新增 python/examples/ocr/ocr_cpp_client.py
- 修改 python/examples/bert [benchmark]
- 修改 python/examples/pipeline/ocr[benchmark]
docker升级：
- 新增docker运行镜像(CPU, cuda10.1, cuda10.2, cuda11) (Py36, Py37, Py38)
- 新增Cuda 11环境的开发docker镜像
- 新增Kubernetes Demo镜像
Bug修复：
- 修复不规范代码命名，统一infer. h文件和paddle_engine. h中模型参数的命名规范. #1136
- 修复C++部分框架被绕过的错误. #1124
- 修复py35下Json.load函数异常的错误.#1124
- 修复ssd_vgg16_300_240e_voc示例中feed_var缺少参数'im_shape'导致的预测结果异常的错误.#1180
- 修复多个GRPC因模型路径变更导致的错误.#1147
- 修复C++log日志打印异常的错误. #1154
- 修复WebService漏传Thread参数的错误. #1136
- 修复golang引入的编译错误. #1101
- 修复Java gRPC模型下的错误. #1215

For English

New Features:
- Integrated Paddle 2.1 Inference, #1221
- Support low-precision inference of fp16 and int8, #1130, #1236
- Deploy Serving service through Kubernetes, #1139, #1184, #1193
- New Security gateway, #1235
- Serving deployment in X86 + XPU environment, #1080
Feature Improvements:
- Merge paddle_serving_server and paddle_serving_server_gpu into a unified paddle_serving_server, #1082
- Pipeline supports Mini-batch inference, #1186
- Pipeline supports log file rotating, #1238
- Pipeline optimizes data transfer to eval for processing, and increases channel tracking logs, #1209
- C++ Serving reconstruction prediction engine call method, #1080
- C++ Serving supports linear combination of multiple models, #1124
- C++ Serving interface adds direct input of String type, #1124
- C++ Serving resource management and optimization, #1143
- C++ Serving performance optimization, changing for loop copy to function memcpy, #1124
- C++ Serving add GDB compilation options, #1124
- Add Benchmark script and update GPU benchmark data, #1197, #1175
Document Updates:
- Add doc/PADDLE_SERVING_ON_KUBERNETES.md
- Add doc/LOD.md
- Add doc/LOD_CN.md
- Add doc/PROCESS_DATA.md
- Modify doc/PIPELINE_SERVING.md
- Modify doc/PIPELINE_SERVING_CN.md
- Modify doc/CREATING.md
- Modify doc/SAVE.md
- Modify doc/SAVE_CN.md
- Modify doc/TENSOR_RT.md
- Modify doc/TENSOR_RT_CN.md
- Modify doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
- Modify doc/ENCRYPTION.md
- Modify doc/ENCRYPTION_CN.md
- Modify doc/DESIGN_DOC.md
- Modify doc/DESIGN_DOC_CN.md
- Modify doc/DOCKER_IMAGES.md
- Modify doc/DOCKER_IMAGES_CN.md
- Modify doc/LATEST_PACKAGES.md
- Modify doc/COMPILE.md
- Modify doc/COMPILE_CN.md
- Modify doc/BERT_10_MINS.md
- Modify doc/BERT_10_MINS_CN.md
- Modify doc/BAIDU_KUNLUN_XPU_SERVING.md
- Modify doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
- Modify README.md
- Modify README_CN.md
Demo Updates:
- Add python/python/examples/low_precision/resnet50
- Add python/examples/xpu/bert
- Add python/examples/xpu/ernie
- Add python/examples/xpu/vgg19
- Add python/examples/pipeline/PaddleDetection/faster_rcnn
- Add python/examples/pipeline/PaddleDetection/ppyolo_mbv3
- Add python/examples/pipeline/PaddleDetection/yolov3
- Add python/examples/pipeline/PaddleClas/DarkNet53
- Add python/examples/pipeline/PaddleClas/HRNet_W18_C
- Add python/examples/pipeline/PaddleClas/MobileNetV1
- Add python/examples/pipeline/PaddleClas/MobileNetV2
- Add python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
- Add python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
- Add python/examples/pipeline/PaddleClas/ResNet50_vd
- Add python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
- Add python/examples/pipeline/PaddleClas/ResNet50_vd_KL
- Add python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
- Add python/examples/pipeline/PaddleClas/ResNet_V2_50
- Add python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
- Add python/examples/pipeline/bert
- Add python/examples/ocr/ocr_cpp_client.py
- Modify python/examples/bert [benchmark]
- Modify python/examples/pipeline/ocr[benchmark]
Docker Updates:
- Add runtime dockers (CPU, CUDA10.1, CUDA10.2, CUDA11) (Py36, Py37, Py38)
- Add CUDA 11 develop level docker images
- Add kubernetes demo images
Bug Fixes:
- Fixed the problem of irregular naming, #1136
- Fixed the problem that part of C + + multithreading and framework were bypassed due to the adaptation of paddle-inference2.0. #1124
- Fixed the problem of JSON. Load in py35.#1124
- Fixed missing a feed_var: 'im_shape' in the test-client request, resulting in no prediction result.#1180
- Fixed multiple bugs in gRPC.#1147
- Fixed the read OP print log logic bug in C + +. #1154
- Fixed the WebService missed a thread parameter, unified the template name in infer. h and paddle_engine. h. #1136
- Fixed compile errors of golang import. #1101
- Fixed Java gRPC bugs, #1215