You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
zhangy03 2387a43272 deepspeed分布式训练 2 years ago
bpe-3w-new deepspeed分布式训练 2 years ago
bpe-4w deepspeed分布式训练 2 years ago
examples deepspeed分布式训练 2 years ago
images deepspeed分布式训练 2 years ago
log_dir deepspeed分布式训练 2 years ago
megatron deepspeed分布式训练 2 years ago
tasks deepspeed分布式训练 2 years ago
tensorboard_data deepspeed分布式训练 2 years ago
tools deepspeed分布式训练 2 years ago
1 deepspeed分布式训练 2 years ago
LICENSE deepspeed分布式训练 2 years ago
MANIFEST.in deepspeed分布式训练 2 years ago
README.md first commit 2 years ago
changes.md deepspeed分布式训练 2 years ago
ds_master_ssh_setting.sh deepspeed分布式训练 2 years ago
ds_node_ssh_setting.sh deepspeed分布式训练 2 years ago
hutong_ssh_test.sh deepspeed分布式训练 2 years ago
ninja-linux.zip deepspeed分布式训练 2 years ago
ninja-linux.zip.1 deepspeed分布式训练 2 years ago
pretrain_bert.py deepspeed分布式训练 2 years ago
pretrain_gpt2.py deepspeed分布式训练 2 years ago
pretrain_ict.py deepspeed分布式训练 2 years ago
requirements.txt deepspeed分布式训练 2 years ago
setup.py deepspeed分布式训练 2 years ago

No Description

Text Python C++ Cuda other

Contributors (1)