You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
zhangy03 2387a43272 deepspeed分布式训练 2 years ago
..
.ipynb_checkpoints deepspeed分布式训练 2 years ago
ds deepspeed分布式训练 2 years ago
ds_config.json deepspeed分布式训练 2 years ago
ds_pretrain_gpt2_master0.sh deepspeed分布式训练 2 years ago
ds_pretrain_gpt2_node1.sh deepspeed分布式训练 2 years ago
ds_pretrain_gpt2_pipe.sh deepspeed分布式训练 2 years ago
ds_running_pretrain_13B_gpt2_model_parallel.sh deepspeed分布式训练 2 years ago
ds_zero_stage_2_config.json deepspeed分布式训练 2 years ago
evaluate_zeroshot_gpt2.sh deepspeed分布式训练 2 years ago
finetune_mnli_distributed.sh deepspeed分布式训练 2 years ago
finetune_race_distributed.sh deepspeed分布式训练 2 years ago
generate_text.sh deepspeed分布式训练 2 years ago
merge_mp_bert.sh deepspeed分布式训练 2 years ago
pretrain_bert.sh deepspeed分布式训练 2 years ago
pretrain_bert_distributed.sh deepspeed分布式训练 2 years ago
pretrain_gpt2.sh deepspeed分布式训练 2 years ago
pretrain_gpt2_distributed.sh deepspeed分布式训练 2 years ago
pretrain_gpt_distributed_with_mp_cn.sh deepspeed分布式训练 2 years ago

No Description

Text Python C++ Cuda other

Contributors (1)