Skip to content
体验新版
项目
组织
正在加载...
登录
切换导航
打开侧边栏
PaddlePaddle
hapi
提交
0b93f490
H
hapi
项目概览
PaddlePaddle
/
hapi
通知
11
Star
2
Fork
0
代码
文件
提交
分支
Tags
贡献者
分支图
Diff
Issue
4
列表
看板
标记
里程碑
合并请求
7
Wiki
0
Wiki
分析
仓库
DevOps
项目成员
Pages
H
hapi
项目概览
项目概览
详情
发布
仓库
仓库
文件
提交
分支
标签
贡献者
分支图
比较
Issue
4
Issue
4
列表
看板
标记
里程碑
合并请求
7
合并请求
7
Pages
分析
分析
仓库分析
DevOps
Wiki
0
Wiki
成员
成员
收起侧边栏
关闭侧边栏
动态
分支图
创建新Issue
提交
Issue看板
前往新版Gitcode,体验更适合开发者的 AI 搜索 >>
提交
0b93f490
编写于
3月 31, 2020
作者:
G
guosheng
浏览文件
操作
浏览文件
下载
电子邮件补丁
差异文件
Add random input for seq2seq to test.
上级
21f50136
变更
2
隐藏空白更改
内联
并排
Showing
2 changed file
with
33 addition
and
10 deletion
+33
-10
seq2seq/seq2seq.py
seq2seq/seq2seq.py
+2
-1
seq2seq/train.py
seq2seq/train.py
+31
-9
未找到文件。
seq2seq/seq2seq.py
浏览文件 @
0b93f490
...
...
@@ -223,7 +223,8 @@ class Seq2Seq(Model):
# encoder
encoder_output
,
encoder_final_state
=
self
.
encoder
(
src
,
src_length
)
# decoder initial states
# decoder initial states: use input_feed and the structure is
# [[h,c] * num_layers, input_feed]
decoder_initial_states
=
[
encoder_final_state
,
self
.
decoder
.
lstm_attention
.
cell
.
get_initial_states
(
...
...
seq2seq/train.py
浏览文件 @
0b93f490
...
...
@@ -80,6 +80,37 @@ def do_train(args):
Input
([
None
,
None
,
1
],
"int64"
,
name
=
"label"
),
]
model
=
Seq2Seq
(
args
.
src_vocab_size
,
args
.
trg_vocab_size
,
args
.
embed_dim
,
args
.
hidden_size
,
args
.
num_layers
,
args
.
dropout
)
model
.
prepare
(
fluid
.
optimizer
.
Adam
(
learning_rate
=
args
.
learning_rate
,
parameter_list
=
model
.
parameters
()),
CrossEntropyCriterion
(),
inputs
=
inputs
,
labels
=
labels
)
batch_size
=
32
src_seq_len
=
10
trg_seq_len
=
12
iter_num
=
10
def
random_generator
():
for
i
in
range
(
iter_num
):
src
=
np
.
random
.
randint
(
2
,
args
.
src_vocab_size
,
(
batch_size
,
src_seq_len
)).
astype
(
"int64"
)
src_length
=
np
.
random
.
randint
(
1
,
src_seq_len
,
(
batch_size
,
)).
astype
(
"int64"
)
trg
=
np
.
random
.
randint
(
2
,
args
.
trg_vocab_size
,
(
batch_size
,
trg_seq_len
)).
astype
(
"int64"
)
trg_length
=
np
.
random
.
randint
(
1
,
trg_seq_len
,
(
batch_size
,
)).
astype
(
"int64"
)
label
=
np
.
random
.
randint
(
1
,
trg_seq_len
,
(
batch_size
,
trg_seq_len
,
1
)).
astype
(
"int64"
)
yield
src
,
src_length
,
trg
,
trg_length
,
label
model
.
fit
(
train_data
=
random_generator
,
log_freq
=
1
)
exit
(
0
)
dataset
=
Seq2SeqDataset
(
fpattern
=
args
.
training_file
,
src_vocab_fpath
=
args
.
src_vocab_fpath
,
trg_vocab_fpath
=
args
.
trg_vocab_fpath
,
...
...
@@ -107,15 +138,6 @@ def do_train(args):
num_workers
=
0
,
return_list
=
True
)
model
=
Seq2Seq
(
args
.
src_vocab_size
,
args
.
trg_vocab_size
,
args
.
embed_dim
,
args
.
hidden_size
,
args
.
num_layers
,
args
.
dropout
)
model
.
prepare
(
fluid
.
optimizer
.
Adam
(
learning_rate
=
args
.
learning_rate
,
parameter_list
=
model
.
parameters
()),
CrossEntropyCriterion
(),
inputs
=
inputs
,
labels
=
labels
)
model
.
fit
(
train_data
=
train_loader
,
eval_data
=
None
,
epochs
=
1
,
...
...
编辑
预览
Markdown
is supported
0%
请重试
或
添加新附件
.
添加附件
取消
You are about to add
0
people
to the discussion. Proceed with caution.
先完成此消息的编辑!
取消
想要评论请
注册
或
登录