diff --git a/README.md b/README.md index 6d8a5978db81eabc8a90023641d922b1cbc0aa03..8dd6c0e5a943056e7d5c9cb1a616c775c50125cc 100644 --- a/README.md +++ b/README.md @@ -231,7 +231,7 @@ app.run(logdir="./log") 实时展示训练过程中的图像数据,用于观察不同训练阶段的图像变化,进而深入了解训练过程及效果。

- +

### Audio diff --git a/docs/README.md b/docs/README.md index b69800f7d8d23414c2dfa7dcffdcbd3d8c70ca14..2cd3c800e17800af119e90c1d0d253ac03e28275 100644 --- a/docs/README.md +++ b/docs/README.md @@ -231,7 +231,7 @@ app.run(logdir="./log") 实时展示训练过程中的图像数据,用于观察不同训练阶段的图像变化,进而深入了解训练过程及效果。

- +

### Audio diff --git a/docs/components/README.md b/docs/components/README.md index f51ad5592ebed3d38c200a7f4d146e770d6452bb..3cfa6f499ad74e73faf2d7758d98b30876327c01 100644 --- a/docs/components/README.md +++ b/docs/components/README.md @@ -289,38 +289,19 @@ add_audio(tag, audio_array, step, sample_rate) ### Demo - +下面展示了使用 Audio 组件记录数据的示例,代码文件请见[Audio组件](https://github.com/PaddlePaddle/VisualDL/blob/develop/demo/components/audio_test.py) ```python from visualdl import LogWriter -import numpy as np -import wave - - -def read_audio_data(audio_path): - """ - Get audio data. - """ - CHUNK = 4096 - f = wave.open(audio_path, "rb") - wavdata = [] - chunk = f.readframes(CHUNK) - while chunk: - data = np.frombuffer(chunk, dtype='uint8') - wavdata.extend(data) - chunk = f.readframes(CHUNK) - # 8k sample rate, 16bit frame, 1 channel - shape = [8000, 2, 1] - return shape, wavdata +from scipy.io import wavfile if __name__ == '__main__': - with LogWriter(logdir="./log") as writer: - audio_shape, audio_data = read_audio_data("./testing.wav") - audio_data = np.array(audio_data) + with LogWriter(logdir="./log/audio_test/train") as writer: + sample_rate, audio_data = wavfile.read('./test.wav') writer.add_audio(tag="audio_tag", audio_array=audio_data, step=0, - sample_rate=8000) + sample_rate=sample_rate) ``` 运行上述程序后,在命令行执行 diff --git a/docs/components/UserGuide-en.md b/docs/components/UserGuide-en.md index a5b99b1a14f0f582ded515849cec9971f2e472c0..40e612b0a73474b6508b65faf662ad2859e1e80b 100644 --- a/docs/components/UserGuide-en.md +++ b/docs/components/UserGuide-en.md @@ -294,35 +294,16 @@ The following shows an example of using Audio to record data, and the script can ```python from visualdl import LogWriter -import numpy as np -import wave - - -def read_audio_data(audio_path): - """ - Get audio data. - """ - CHUNK = 4096 - f = wave.open(audio_path, "rb") - wavdata = [] - chunk = f.readframes(CHUNK) - while chunk: - data = np.frombuffer(chunk, dtype='uint8') - wavdata.extend(data) - chunk = f.readframes(CHUNK) - # 8k sample rate, 16bit frame, 1 channel - shape = [8000, 2, 1] - return shape, wavdata +from scipy.io import wavfile if __name__ == '__main__': - with LogWriter(logdir="./log") as writer: - audio_shape, audio_data = read_audio_data("./testing.wav") - audio_data = np.array(audio_data) + with LogWriter(logdir="./log/audio_test/train") as writer: + sample_rate, audio_data = wavfile.read('./test.wav') writer.add_audio(tag="audio_tag", audio_array=audio_data, step=0, - sample_rate=8000) + sample_rate=sample_rate) ``` After running the above program, developers can launch the panel by: ```shell