README.md 17.1 KB
Newer Older
1
[**中文**](./README.md)
Y
YixinKristy 已提交
2

D
daminglu 已提交
3
<p align="center">
Y
YixinKristy 已提交
4
  <img src="https://raw.githubusercontent.com/PaddlePaddle/VisualDL/develop/frontend/packages/core/public/images/logo-visualdl.svg?sanitize=true" width="70%"/>
D
daminglu 已提交
5 6
</p>

L
LaraStuStu 已提交
7

8

P
Peter Pan 已提交
9
<p align="center">
P
Peter Pan 已提交
10
<a href="https://actions-badge.atrox.dev/PaddlePaddle/VisualDL/goto?ref=develop"><img alt="Build Status" src="https://img.shields.io/endpoint.svg?url=https%3A%2F%2Factions-badge.atrox.dev%2FPaddlePaddle%2FVisualDL%2Fbadge%3Fref%3Ddevelop&style=flat-square" alt="Build Status" /></a>
P
Peter Pan 已提交
11 12 13 14 15
<a href="https://pypi.org/project/visualdl/"><img src="https://img.shields.io/pypi/v/visualdl?style=flat-square" alt="PyPI" /></a>
<a href="https://pypi.org/project/visualdl/#files"><img src="https://img.shields.io/pypi/dm/visualdl?style=flat-square" alt="Downloads" /></a>
<a href="https://github.com/PaddlePaddle/VisualDL/blob/develop/LICENSE"><img src="https://img.shields.io/github/license/paddlepaddle/visualdl?style=flat-square" alt="License" /></a>
</p>

16

P
Peter Pan 已提交
17
<p align="center">
18
<a href="javascript:void(0)"><img src="https://img.shields.io/badge/QQ_Group-1045783368-52B6EF?style=social&logo=tencent-qq&logoColor=000&logoWidth=20" alt="QQ Group" /></a>
P
Peter Pan 已提交
19
</p>
Y
YixinKristy 已提交
20

D
daminglu 已提交
21

22 23 24
## Introduction

VisualDL, a visualization analysis tool of PaddlePaddle, provides a variety of charts to show the trends of parameters, and visualizes model structures, data samples, histograms of tensors, pr curves and high-dimensional data distributions. It enables users to understand the training process and the model structure more clearly and intuitively so as to optimize models efficiently.
Q
Qiao Longfei 已提交
25

26 27 28
VisualDL provides various visualization functions, including tracking metrics in real-time, visualizing the model structure, displaying the data sample, presenting the changes of distributions of tensors, showing the pr curves, projecting high-dimensional data to a lower dimensional space and more. Additionally, VisualDL provides VDL.service, which enables developers easily to save, track and share visualization results of experiments. For specific guidelines of each function, please refer to  [**VisualDL User Guide**](./docs/components/UserGuide-en.md). Currently, VisualDL iterates rapidly and new functions will be continuously added.

Browsers supported by VisualDL are:
Y
YixinKristy 已提交
29

P
Peter Pan 已提交
30 31 32 33 34
- Google Chrome ≥ 79
- Firefox ≥ 67
- Microsoft Edge ≥ 79
- Safari ≥ 11.1

35
VisualDL natively supports the use of Python. Developers can retrieve plentiful visualization results by simply adding a few lines of Python code into the model before training.
D
daminglu 已提交
36

37
## Contents
D
daminglu 已提交
38

39 40 41 42 43 44 45
* [Key Highlights](#Key-Highlights)
* [Installation](#Installation)
* [Usage Guideline](#Usage-Guideline)
* [Function Preview](#Function-Preview)
* [Contribution](#Contribution)
* [More Details](#More-Details)
* [Technical Communication](#Technical-Communication)
D
daminglu 已提交
46 47


48

49
## Key Highlights
D
daminglu 已提交
50

51
### Easy to Use
52

53
The high-level design of API makes it easy to use. Only one click can initiate the visualization of model structures.
D
daminglu 已提交
54

55

56
### Various Functions
Q
Qiao Longfei 已提交
57

58
The function contains the visualization of training parameters, data samples, graph structures, histograms of tensors, PR curves and high-dimensional data.
Y
YixinKristy 已提交
59

60
### High Compatibility
Y
YixinKristy 已提交
61

62
VisualDL provides the visualization of the mainstream model structures such as Paddle, ONNX, Caffe, widely supporting visual analysis for diverse users.
Y
YixinKristy 已提交
63

64
### Fully Support
Y
YixinKristy 已提交
65

66
By Integrating into PaddlePaddle and related modules, VisualDL allows developers to use different components unobstructed, and thus have the best experience in the PaddlePaddle ecosystem.
Q
Qiao Longfei 已提交
67

68
## Installation
69 70


71
### Install by PiP
72

73
```shell
Y
YixinKristy 已提交
74
python -m pip install visualdl -i https://mirror.baidu.com/pypi/simple
75
```
76 77

### Install by Code
Y
YixinKristy 已提交
78 79 80 81 82 83 84 85 86

```
git clone https://github.com/PaddlePaddle/VisualDL.git
cd VisualDL

python setup.py bdist_wheel
pip install --upgrade dist/visualdl-*.whl
```

87 88 89 90
Please note that Python 2 is no longer maintained officially since January 1, 2020. VisualDL now only supports Python 3 in order to ensure the usability of codes.


## Usage Guideline
Y
YixinKristy 已提交
91

92
VisualDL stores the data, parameters and other information of the training process in a log file. Users can launch the panel to observe the visualization results.
Y
YixinKristy 已提交
93

94
### 1. Log
Y
YixinKristy 已提交
95

96
The Python SDK is provided at the back end of VisualDL, and a logger can be customized through LogWriter. The interface description is shown as follows:
97

Y
YixinKristy 已提交
98
```python
99
class LogWriter(logdir=None,
Y
YixinKristy 已提交
100 101 102 103 104
                comment='',
                max_queue=10,
                flush_secs=120,
                filename_suffix='',
                write_to_disk=True,
105
                **kwargs)
106 107
```

108
#### Interface Parameters
Y
YixinKristy 已提交
109

110 111 112 113 114 115 116 117 118
| parameters      | type    | meaning                                                      |
| --------------- | ------- | ------------------------------------------------------------ |
| logdir          | string  | The path location of log file. VisualDL will create a log file under this path to record information generated by the training process. If not specified, the path will be  `runs/${CURRENT_TIME}`as default. |
| comment         | string  | Add a suffix to the log folder name, which is invalid if logdir is already specified. |
| max_queue       | int     | The maximum capacity of the data generated before recording in a log file. If the capacity is reached, the data is immediately written into the log file. |
| flush_secs      | int     | The maximum cache time of the data generated before recording in a log file, when this time is reached, the data is immediately written to the log file. |
| filename_suffix | string  | Add a suffix to the default log file name.                   |
| write_to_disk   | boolean | Write into disk or not.                                      |
| display_name    | string  | Set the name of different runs when `logdir` is too long or needed to be hidden. If not set, the default name is `logdir`. |
Y
YixinKristy 已提交
119

120
#### Example
Y
YixinKristy 已提交
121

122
Create a log file and record scalar values:
D
daminglu 已提交
123 124 125 126

```python
from visualdl import LogWriter

127
# create a log file under `./log/scalar_test/train`
Y
YixinKristy 已提交
128
with LogWriter(logdir="./log/scalar_test/train") as writer:
129
    # use `add_scalar` to record scalar values
Y
YixinKristy 已提交
130 131 132
    writer.add_scalar(tag="acc", step=1, value=0.5678)
    writer.add_scalar(tag="acc", step=2, value=0.6878)
    writer.add_scalar(tag="acc", step=3, value=0.9878)
D
daminglu 已提交
133 134
```

135
### 2. Launch Panel
D
daminglu 已提交
136

137
In the above example, the log has recorded three sets of scalar values. Developers can view the visualization results of the log file through launching the visualDL panel. There are two ways to launch a log file:
Z
Zeyu Chen 已提交
138

139
#### Launch by Command Line
Z
Zeyu Chen 已提交
140

141
Use the command line to launch the VisualDL panel:
D
daminglu 已提交
142

Y
YixinKristy 已提交
143
```python
P
Peter Pan 已提交
144
visualdl --logdir <dir_1, dir_2, ... , dir_n> --model <model_file> --host <host> --port <port> --cache-timeout <cache_timeout> --language <language> --public-path <public_path> --api-only
D
daminglu 已提交
145
```
Y
YixinKristy 已提交
146

147
Parameter details:
Y
YixinKristy 已提交
148

149 150 151 152 153 154 155 156 157 158
| parameters      | meaning                                                      |
| --------------- | ------------------------------------------------------------ |
| --logdir        | Set one or more directories of the log. VisualDL will search the log file recursively under this path to display the all experimental results. |
| --model         | Set path to the model file (not a directory). VisualDL will visualize the model file in Graph page. PaddlePaddle、ONNX、Keras、Core ML、Caffe and Other model formats are supported. Please refer to [Graph - Functional Instructions](./docs/components/UserGuide-en.md#functional-instructions-2). |
| --host          | Specify IP address. The default value is `127.0.0.1`.        |
| --port          | Set the port. The default value is `8040`.                   |
| --cache-timeout | Cache time of the backend. During the cache time, the front end requests the same URL multiple times, and then the returned data is obtained from the cache. The default cache time is 20 seconds. |
| --language      | The language of the VisualDL panel. Language can be specified as 'en' or 'zh', and the default is the language used by the browser. |
| --public-path   | The URL path of the VisualDL panel. The default path is '/app', meaning that the access address is 'http://&lt;host&gt;:&lt;port&gt;/app'. |
| --api-only      | Decide whether or not to provide only API. If this parameter is set, VisualDL will only provides API service without displaying the web page, and the API address is 'http://&lt;host&gt;:&lt;port&gt;/&lt;public_path&gt;/api'. Additionally, If the public_path parameter is not specified, the default address is 'http://&lt;host&gt;:&lt;port&gt;/api'. |
Y
YixinKristy 已提交
159

160
To visualize the log file generated in the previous step, developers can launch the panel through the command:
Y
YixinKristy 已提交
161 162 163

```
visualdl --logdir ./log
D
daminglu 已提交
164 165
```

166
#### Launch in Python Script
Y
YixinKristy 已提交
167

168 169

Developers can start the VisualDL panel in Python script as follows:
D
daminglu 已提交
170

171
```python
Y
YixinKristy 已提交
172
visualdl.server.app.run(logdir,
P
Peter Pan 已提交
173
                        model="path/to/model",
Y
YixinKristy 已提交
174 175 176 177
                        host="127.0.0.1",
                        port=8080,
                        cache_timeout=20,
                        language=None,
178 179
                        public_path=None,
                        api_only=False,
Y
YixinKristy 已提交
180 181 182
                        open_browser=False)
```

183
Please note: since all parameters are indefinite except `logdir`, developers should specify parameter names when using them.
Y
YixinKristy 已提交
184

185
The interface parameters are as follows:
Y
YixinKristy 已提交
186

187 188 189 190 191 192 193 194 195 196 197
| parameters    | type                                               | meaning                                                      |
| ------------- | -------------------------------------------------- | ------------------------------------------------------------ |
| logdir        | string or list[string_1, string_2, ... , string_n] | Set one or more directories of the log. VisualDL will search the log file recursively under this path to display the all experimental results. |
| model         | string                                             | Set path to the model file (not a directory). VisualDL will visualize the model file in Graph page. |
| host          | string                                             | Specify IP address. The default value is `127.0.0.1`.        |
| port          | int                                                | Set the port. The default value is `8040`.                   |
| cache_timeout | int                                                | Cache time of the backend. During the cache time, the front end requests the same URL multiple times, and then the returned data is obtained from the cache. The default cache time is 20 seconds. |
| language      | string                                             | The language of the VisualDL panel. Language can be specified as 'en' or 'zh', and the default is the language used by the browser. |
| public_path   | string                                             | The URL path of the VisualDL panel. The default path is '/app', meaning that the access address is 'http://&lt;host&gt;:&lt;port&gt;/app'. |
| api_only      | boolean                                            | Decide whether or not to provide only API. If this parameter is set, VisualDL will only provides API service without displaying the web page, and the API address is 'http://&lt;host&gt;:&lt;port&gt;/&lt;public_path&gt;/api'. Additionally, If the parameter public_path is not specified, the default address is 'http://&lt;host&gt;:&lt;port&gt;/api'. |
| open_browser  | boolean                                            | Whether or not to open the browser. If this parameter is set as True, the browser will be opened automatically and VisualDL panel will be launched at the same time. If parameter api_only is specified as True,  parameter open_browser can be ignored. |
198

199
To visualize the log file generated in the previous step, developers can launch the panel through the command:
Y
YixinKristy 已提交
200 201 202 203 204

```python
from visualdl.server import app

app.run(logdir="./log")
205
```
D
daminglu 已提交
206

207
After launching the panel by one of the above methods, developers can see the visualization results on the browser shown as blow:
Y
YixinKristy 已提交
208 209

<p align="center">
210
  <img src="https://user-images.githubusercontent.com/48054808/90868674-ba321f00-e3c9-11ea-83c1-f03c6dd19187.png" width="70%"/>
Y
YixinKristy 已提交
211 212
</p>

213

214 215 216
### 3. Read data in log files using LogReader

VisualDL also provide `LogReader` interface to read raw data from log files.
P
Peter Pan 已提交
217

218 219 220 221 222
```python
class LogReader(logdir=None,
                file_name='')
```

223 224 225 226 227 228
#### interface parameters

| parameters | type   | meaning                              |
| ---------- | ------ | ------------------------------------ |
| logdir     | string | Path to the log directory. Required. |
| file_name  | string | File name of the log file. Required. |
229

230
#### Example
231

232
Suppose there is a log file named `vdlrecords.1605533348.log` in directory `./log`. We can get scalar data in `loss` tag by
233 234 235 236 237 238 239 240

```python
from visualdl import LogReader

reader = LogReader(logdir='./log', file_name='vdlrecords.1605533348.log')
data = reader.get_data('scalar', 'loss')
print(data)
```
P
Peter Pan 已提交
241

242
The result is a list of
P
Peter Pan 已提交
243

244 245 246 247 248 249 250 251 252
```python
...
id: 5
tag: "Metrics/Training(Step): loss"
timestamp: 1605533356039
value: 3.1297709941864014
...
```

253
For more information of `LogReader`, please refer to [LogReader](./docs/io/LogReader.md).
Y
YixinKristy 已提交
254

255
## Function Preview
Y
YixinKristy 已提交
256 257 258

### Scalar

259
**Scalar** makes use of various charts to display how the parameters, such as accuracy, loss and learning rate, change during the training process. In this case, developers can observe not only the single but also the multiple groups of parameters in order to understand the training process and thus speed up the process of model tuning.
Y
YixinKristy 已提交
260

261 262 263
#### Dynamic Display

After the launch of VisualDL Board, the LogReader will continuously record the data to display in the front-end. Hence, the changes of parameters can be visualized in real-time, as shown below:
Y
YixinKristy 已提交
264 265

<p align="center">
266
  <img src="https://visualdl.bj.bcebos.com/images/dynamic_display.gif" width="60%"/>
Y
YixinKristy 已提交
267 268 269 270
</p>



271 272 273
#### Comparison of Multiple Experiments

Developers can compare with multiple experiments by specifying and uploading the path of each experiment at the same time so as to visualize the same parameters in the same chart.
Y
YixinKristy 已提交
274 275

<p align="center">
276
  <img src="https://user-images.githubusercontent.com/48054808/90869567-fdd95880-e3ca-11ea-9855-6c97ad5c8ae7.gif" width="100%"/>
Y
YixinKristy 已提交
277 278 279
</p>


280

Y
YixinKristy 已提交
281
### Image
282 283

**Image** provides real-time visualizations of the image data during the training process, allowing developers to observe the changes of images in different training stages and  to deeply understand the effects of the training process.
Y
YixinKristy 已提交
284 285

<p align="center">
286
<img src="https://user-images.githubusercontent.com/48054808/90869677-22353500-e3cb-11ea-9830-2334bdd8e52e.gif" width="55%"/>
Y
YixinKristy 已提交
287 288
</p>

289

Y
YixinKristy 已提交
290
### Audio
291 292

**Audio** aims to allow developers to listen to the audio data in real-time during the training process, helping developers to monitor the process of speech recognition and text-to-speech.
Y
YixinKristy 已提交
293 294

<p align="center">
295
<img src="https://user-images.githubusercontent.com/48054808/90869771-47c23e80-e3cb-11ea-8b2a-a38b6c33d64b.png" width="85%"/>
Y
YixinKristy 已提交
296 297
</p>

298

Y
YixinKristy 已提交
299 300
### Graph

301 302
**Graph** enables developers to visualize model structures by only one click. Moreover, **Graph** allows Developers to explore model attributes, node information, node input and output. aiding them analyze model structure quickly and understand the direction of data flow easily.

Y
YixinKristy 已提交
303
<p align="center">
304
<img src="https://user-images.githubusercontent.com/48054808/90869866-6aecee00-e3cb-11ea-8211-b8af070239e6.png" width="85%"/>
Y
YixinKristy 已提交
305 306
</p>

307

Y
YixinKristy 已提交
308 309
### Histogram

310
Histogram displays how the trend of tensors (weight, bias, gradient, etc.) changes during the training process in the form of histogram. Developers can adjust the model structures accurately by having an in-depth understanding of the effect of each layer.
Y
YixinKristy 已提交
311

312
- Offset Mode
Y
YixinKristy 已提交
313 314

<p align="center">
315
<img src="https://user-images.githubusercontent.com/48054808/90870121-bd2e0f00-e3cb-11ea-89cf-6622cb607b89.png" width="85%"/>
Y
YixinKristy 已提交
316 317
</p>

Y
YixinKristy 已提交
318

Y
YixinKristy 已提交
319

320
- Overlay Mode
Y
YixinKristy 已提交
321 322

<p align="center">
323
<img src="https://user-images.githubusercontent.com/48054808/90870194-cfa84880-e3cb-11ea-8a66-bebcad267a10.png" width="85%"/>
Y
YixinKristy 已提交
324 325
</p>

326

Y
YixinKristy 已提交
327 328
### High Dimensional

329
**High Dimensional** provides two approaches--T-SNE and PCA--to do the dimensionality reduction, allowing developers to have an in-depth analysis of the relationship between high-dimensional data and to optimize algorithms based on the analysis.
Y
YixinKristy 已提交
330 331

<p align="center">
332
<img src="https://user-images.githubusercontent.com/48054808/90870677-85739700-e3cc-11ea-8653-18fa5c4106a3.GIF" width="85%"/>
Y
YixinKristy 已提交
333 334
</p>

335

336 337
### VDL.service

338
**VDL.service** enables developers to easily save, track and share visualization results with anyone for free.
339 340

<p align="center">
341
<img src=https://user-images.githubusercontent.com/48054808/93731055-fbeafb00-fbfd-11ea-80f4-bbfd08a0fc35.png width="85%"/>
342 343
</p>

344

345
## Contribution
Y
YixinKristy 已提交
346

347
VisualDL, in which Graph is powered by [Netron](https://github.com/lutzroeder/netron), is an open source project supported by [PaddlePaddle](https://www.paddlepaddle.org/) and [ECharts](https://echarts.apache.org/).
W
wuzewu 已提交
348

349
Developers are warmly welcomed to use, comment and contribute.
W
wuzewu 已提交
350

351
## More Details
W
wuzewu 已提交
352

353
For more details related to the use of VisualDL, please refer to [**VisualDL User Guide**](./docs/components/UserGuide-en.md)
Y
YixinKristy 已提交
354

355
## Technical Communication
Y
YixinKristy 已提交
356

357
Welcome to join the official QQ group 104578336 to communicate with PaddlePaddle team and other developers.
Y
YixinKristy 已提交
358 359 360 361

<p align="center">
<img src="https://user-images.githubusercontent.com/48054808/82522691-c2758680-9b5c-11ea-9aee-fca994aba175.png" width="20%"/>
</p>
362