新特性
- 增加C++ Serving 和 Pipeline Serving编译环境检查 #1584
- C++ Serving 支持修改log日志生成路径 #1592
- 使用TRT时,新增动态shape配置功能和示例 #1590
- 新增Python Pipeline Serving 普罗米修斯监控 #1586
- 新增C++ Serving 普罗米修斯监控 #1568 #1576 #1577
- 支持异构硬件,包括:x86+DCU、ARM+ascend310、ARM+ascend910 #1544
- 支持Python39
性能优化
- C++ Serving增加请求结果缓存功能,相同的请求直接返回 #1585, #1588
功能增强
- 更便捷的C++串联多模型方式 #1546
- dockerfile升级,新增centos dockerfile #1618 #1594
- 新增Pipeline Serving bf16低精度支持 #1594 #1554
文档和示例变更
- 新增pp-shitu示例 #1572
- 新增PaddleNLP示例 #1609
- 新增环境检查文档 #1643
- 新增动态TRT使用文档 #1643
- 新增异构硬件使用文档 #1641,#1654
- 新增请求缓存Cache使用说明文档 #1641, #1588
Bug修复
- 修复异步框架下内存泄露问题 #1589
- 修复Pipeline Serving中输入为list[str]的情况 #1598
For English:
New features
- Add C++ serving and pipeline serving compilation environment check #1584
- C++ serving supports modifying the log generation path #1592
- When using TRT, new dynamic shape configuration functions and examples are added #1590
- Add Python pipeline serving Prometheus monitoring #1586
- Add C++ serving Prometheus monitoring #1568 #1576 #1577
- Support heterogeneous hardware, including x86 + DCU, arm + ascend310 and arm + ascend910 #1544
- Support Python 39
Performance optimization
- C++ serving adds the request result caching function, and the same request is directly returned #1585, #1588
Function Enhance
- More convenient C++ series multi model mode #1546
- Dockerfile upgrade, new Centos dockerfile #1618 #1594
- New pipeline serving bf16 low precision support #1594 #1554
Documentation and sample changes
- New PP-Shitu example #1572
- New paddlenlp example #1609
- New environmental inspection document #1643
- New dynamic TRT usage document #1643
- New heterogeneous hardware usage documents #1641, #1654
- New request cache instructions #1641, #1588
Bug repair
- Fix memory leak in asynchronous framework #1589
- Fix the input of list [STR] in pipeline serving #1598