新特性

  • 增加C++ Serving 和 Pipeline Serving编译环境检查 #1584
  • C++ Serving 支持修改log日志生成路径 #1592
  • 使用TRT时,新增动态shape配置功能和示例 #1590
  • 新增Python Pipeline Serving 普罗米修斯监控 #1586
  • 新增C++ Serving 普罗米修斯监控 #1568 #1576 #1577
  • 支持异构硬件,包括:x86+DCU、ARM+ascend310、ARM+ascend910 #1544
  • 支持Python39

性能优化

  • C++ Serving增加请求结果缓存功能,相同的请求直接返回 #1585, #1588

功能增强

  • 更便捷的C++串联多模型方式 #1546
  • dockerfile升级,新增centos dockerfile #1618 #1594
  • 新增Pipeline Serving bf16低精度支持 #1594 #1554

文档和示例变更

  • 新增pp-shitu示例 #1572
  • 新增PaddleNLP示例 #1609
  • 新增环境检查文档 #1643
  • 新增动态TRT使用文档 #1643
  • 新增异构硬件使用文档 #1641,#1654
  • 新增请求缓存Cache使用说明文档 #1641, #1588

Bug修复

  • 修复异步框架下内存泄露问题 #1589
  • 修复Pipeline Serving中输入为list[str]的情况 #1598

For English:

New features

  • Add C++ serving and pipeline serving compilation environment check #1584
  • C++ serving supports modifying the log generation path #1592
  • When using TRT, new dynamic shape configuration functions and examples are added #1590
  • Add Python pipeline serving Prometheus monitoring #1586
  • Add C++ serving Prometheus monitoring #1568 #1576 #1577
  • Support heterogeneous hardware, including x86 + DCU, arm + ascend310 and arm + ascend910 #1544
  • Support Python 39

Performance optimization

  • C++ serving adds the request result caching function, and the same request is directly returned #1585, #1588

Function Enhance

  • More convenient C++ series multi model mode #1546
  • Dockerfile upgrade, new Centos dockerfile #1618 #1594
  • New pipeline serving bf16 low precision support #1594 #1554

Documentation and sample changes

  • New PP-Shitu example #1572
  • New paddlenlp example #1609
  • New environmental inspection document #1643
  • New dynamic TRT usage document #1643
  • New heterogeneous hardware usage documents #1641, #1654
  • New request cache instructions #1641, #1588

Bug repair

  • Fix memory leak in asynchronous framework #1589
  • Fix the input of list [STR] in pipeline serving #1598

项目简介

A flexible, high-performance carrier for machine learning models(『飞桨』服务化部署框架)

🚀 Github 镜像仓库 🚀

源项目地址

https://github.com/PaddlePaddle/Serving

发行版本 14

Release v0.9.0

全部发行版

贡献者 36

全部贡献者

开发语言

  • C++ 51.6 %
  • Python 27.0 %
  • Shell 8.0 %
  • CMake 6.0 %
  • Go 4.4 %