Paddle Serving v0.6.0 Release note:

  • 新特性:
    • 集成Paddle 2.1 inference, #1221
    • 支持fp16和int8的低精度推理, #1130, #1236
    • 通过Kubernetes部署Serving服务, #1139, #1184, #1193
    • 新增安全网关与Serving协同部署, #1235
    • 支持X86 + XPU环境部署Serving, #1080
  • 功能增强:
    • Python合并paddle_serving_server和paddle_serving_server_gpu成统一服务, #1082
    • Pipeline增加mini-batch推理, #1186
    • Pipeline支持日志切割, #1238
    • Pipeline优化数据传入eval处理,增加channel的跟踪日志, #1209
    • C++ Serving重构预测库调用方法,#1080
    • C++ Serving支持多模型线性组合,#1124
    • C++ Serving资源管理与优化, #1143
    • C++ Serving接口增加String类型输入, #1124
    • C++ Serving优化数据组装方法,使用memcpy替换循环拷贝, #1124
    • C++ Serving编译选型增加GDB开关, #1124
    • 增加Benchmark脚本,更新GPU benchmark数据, #1197, #1175
  • 文档升级:
    • 新增 doc/PADDLE_SERVING_ON_KUBERNETES.md
    • 新增 doc/LOD.md
    • 新增 doc/LOD_CN.md
    • 新增 doc/PROCESS_DATA.md
    • 修改 doc/PIPELINE_SERVING.md
    • 修改 doc/PIPELINE_SERVING_CN.md
    • 修改 doc/CREATING.md
    • 修改 doc/SAVE.md
    • 修改 doc/SAVE_CN.md
    • 修改 doc/TENSOR_RT.md
    • 修改 doc/TENSOR_RT_CN.md
    • 修改 doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
    • 修改 doc/ENCRYPTION.md
    • 修改 doc/ENCRYPTION_CN.md
    • 修改 doc/DESIGN_DOC.md
    • 修改 doc/DESIGN_DOC_CN.md
    • 修改 doc/DOCKER_IMAGES.md
    • 修改 doc/DOCKER_IMAGES_CN.md
    • 修改 doc/LATEST_PACKAGES.md
    • 修改 doc/COMPILE.md
    • 修改 doc/COMPILE_CN.md
    • 修改 doc/BERT_10_MINS.md
    • 修改 doc/BERT_10_MINS_CN.md
    • 修改 doc/BAIDU_KUNLUN_XPU_SERVING.md
    • 修改 doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
    • 修改 README.md
    • 修改 README_CN.md
  • Demo升级:
    • 新增 python/python/examples/low_precision/resnet50
    • 新增 python/examples/xpu/bert
    • 新增python/examples/xpu/ernie
    • 新增 python/examples/xpu/vgg19
    • 新增 python/examples/pipeline/PaddleDetection/faster_rcnn
    • 新增 python/examples/pipeline/PaddleDetection/ppyolo_mbv3
    • 新增 python/examples/pipeline/PaddleDetection/yolov3
    • 新增 python/examples/pipeline/PaddleClas/DarkNet53
    • 新增 python/examples/pipeline/PaddleClas/HRNet_W18_C
    • 新增 python/examples/pipeline/PaddleClas/MobileNetV1
    • 新增 python/examples/pipeline/PaddleClas/MobileNetV2
    • 新增 python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
    • 新增 python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
    • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd
    • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
    • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_KL
    • 新增 python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
    • 新增 python/examples/pipeline/PaddleClas/ResNet_V2_50
    • 新增 python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
    • 新增 python/examples/pipeline/bert
    • 新增 python/examples/ocr/ocr_cpp_client.py
    • 修改 python/examples/bert [benchmark]
    • 修改 python/examples/pipeline/ocr[benchmark]
  • docker升级:
    • 新增docker运行镜像(CPU, cuda10.1, cuda10.2, cuda11) (Py36, Py37, Py38)
    • 新增Cuda 11环境的开发docker镜像
    • 新增Kubernetes Demo镜像
  • Bug修复:
    • 修复不规范代码命名,统一infer. h文件和paddle_engine. h中模型参数的命名规范. #1136
    • 修复C++部分框架被绕过的错误. #1124
    • 修复py35下Json.load函数异常的错误.#1124
    • 修复ssd_vgg16_300_240e_voc示例中feed_var缺少参数'im_shape'导致的预测结果异常的错误.#1180
    • 修复多个GRPC因模型路径变更导致的错误.#1147
    • 修复C++log日志打印异常的错误. #1154
    • 修复WebService漏传Thread参数的错误. #1136
    • 修复golang引入的编译错误. #1101
    • 修复Java gRPC模型下的错误. #1215

For English

  • New Features:
    • Integrated Paddle 2.1 Inference, #1221
    • Support low-precision inference of fp16 and int8, #1130, #1236
    • Deploy Serving service through Kubernetes, #1139, #1184, #1193
    • New Security gateway, #1235
    • Serving deployment in X86 + XPU environment, #1080
  • Feature Improvements:
    • Merge paddle_serving_server and paddle_serving_server_gpu into a unified paddle_serving_server, #1082
    • Pipeline supports Mini-batch inference, #1186
    • Pipeline supports log file rotating, #1238
    • Pipeline optimizes data transfer to eval for processing, and increases channel tracking logs, #1209
    • C++ Serving reconstruction prediction engine call method, #1080
    • C++ Serving supports linear combination of multiple models, #1124
    • C++ Serving interface adds direct input of String type, #1124
    • C++ Serving resource management and optimization, #1143
    • C++ Serving performance optimization, changing for loop copy to function memcpy, #1124
    • C++ Serving add GDB compilation options, #1124
    • Add Benchmark script and update GPU benchmark data, #1197, #1175
  • Document Updates:
    • Add doc/PADDLE_SERVING_ON_KUBERNETES.md
    • Add doc/LOD.md
    • Add doc/LOD_CN.md
    • Add doc/PROCESS_DATA.md
    • Modify doc/PIPELINE_SERVING.md
    • Modify doc/PIPELINE_SERVING_CN.md
    • Modify doc/CREATING.md
    • Modify doc/SAVE.md
    • Modify doc/SAVE_CN.md
    • Modify doc/TENSOR_RT.md
    • Modify doc/TENSOR_RT_CN.md
    • Modify doc/MULTI_SERVICE_ON_ONE_GPU_CN.md
    • Modify doc/ENCRYPTION.md
    • Modify doc/ENCRYPTION_CN.md
    • Modify doc/DESIGN_DOC.md
    • Modify doc/DESIGN_DOC_CN.md
    • Modify doc/DOCKER_IMAGES.md
    • Modify doc/DOCKER_IMAGES_CN.md
    • Modify doc/LATEST_PACKAGES.md
    • Modify doc/COMPILE.md
    • Modify doc/COMPILE_CN.md
    • Modify doc/BERT_10_MINS.md
    • Modify doc/BERT_10_MINS_CN.md
    • Modify doc/BAIDU_KUNLUN_XPU_SERVING.md
    • Modify doc/BAIDU_KUNLUN_XPU_SERVING_CN.md
    • Modify README.md
    • Modify README_CN.md
  • Demo Updates:
    • Add python/python/examples/low_precision/resnet50
    • Add python/examples/xpu/bert
    • Add python/examples/xpu/ernie
    • Add python/examples/xpu/vgg19
    • Add python/examples/pipeline/PaddleDetection/faster_rcnn
    • Add python/examples/pipeline/PaddleDetection/ppyolo_mbv3
    • Add python/examples/pipeline/PaddleDetection/yolov3
    • Add python/examples/pipeline/PaddleClas/DarkNet53
    • Add python/examples/pipeline/PaddleClas/HRNet_W18_C
    • Add python/examples/pipeline/PaddleClas/MobileNetV1
    • Add python/examples/pipeline/PaddleClas/MobileNetV2
    • Add python/examples/pipeline/PaddleClas/MobileNetV3_large_x1_0
    • Add python/examples/pipeline/PaddleClas/ResNeXt101_vd_64x4d
    • Add python/examples/pipeline/PaddleClas/ResNet50_vd
    • Add python/examples/pipeline/PaddleClas/ResNet50_vd_FPGM
    • Add python/examples/pipeline/PaddleClas/ResNet50_vd_KL
    • Add python/examples/pipeline/PaddleClas/ResNet50_vd_PACT
    • Add python/examples/pipeline/PaddleClas/ResNet_V2_50
    • Add python/examples/pipeline/PaddleClas/ShuffleNetV2_x1_0
    • Add python/examples/pipeline/bert
    • Add python/examples/ocr/ocr_cpp_client.py
    • Modify python/examples/bert [benchmark]
    • Modify python/examples/pipeline/ocr[benchmark]
  • Docker Updates:
    • Add runtime dockers (CPU, CUDA10.1, CUDA10.2, CUDA11) (Py36, Py37, Py38)
    • Add CUDA 11 develop level docker images
    • Add kubernetes demo images
  • Bug Fixes:
    • Fixed the problem of irregular naming, #1136
    • Fixed the problem that part of C + + multithreading and framework were bypassed due to the adaptation of paddle-inference2.0. #1124
    • Fixed the problem of JSON. Load in py35.#1124
    • Fixed missing a feed_var: 'im_shape' in the test-client request, resulting in no prediction result.#1180
    • Fixed multiple bugs in gRPC.#1147
    • Fixed the read OP print log logic bug in C + +. #1154
    • Fixed the WebService missed a thread parameter, unified the template name in infer. h and paddle_engine. h. #1136
    • Fixed compile errors of golang import. #1101
    • Fixed Java gRPC bugs, #1215

项目简介

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/PaddlePaddle/Serving

发行版本 14

Release v0.9.0

全部发行版

贡献者 36

全部贡献者

开发语言

  • C++ 51.6 %
  • Python 27.0 %
  • Shell 8.0 %
  • CMake 6.0 %
  • Go 4.4 %