You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
kimxiaogen 25c88d0b75 111122 2 years ago
..
attn.py 11 2 years ago
dataset.py 【merge】同步GPU版本代码改动内容,并增加Ascend部分代码 2 years ago
embedding.py 【merge】同步GPU版本代码改动内容,并增加Ascend部分代码 2 years ago
layer.py 111122 2 years ago
mem_transformer.py 111122 2 years ago
positionwiseFF.py 111122 2 years ago
vocabulary.py 【merge】同步GPU版本代码改动内容,并增加Ascend部分代码 2 years ago

Transformer-XL是对Transformer的改进,主要是解决长序列的问题。

Text

Contributors (2)