You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
i-robot 0288c29301
!376 明确peft 0.4.0在torch1.11.0上面的依赖问题
5 months ago
ascendspeed olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
ci 1.补充样例代码与ST用例 6 months ago
examples olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
sources/images !373 Llama-65B训练优化并新增推理/评估 5 months ago
speed_infer 新增llama pa模型,bugfix 5 months ago
tasks olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
tests add continue training for distributed opt 5 months ago
tools !350 V3 模型测试适配 5 months ago
.gitignore update .gitignore. 7 months ago
LICENSE Initial commit 11 months ago
OWNERS update Baichuan README and add downstream tasks 5 months ago
README.md olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
README_en.md olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
SECURITY.md fork megatron-deepspeed code. 11 months ago
pretrain_baichuan.py 删除llama_model.py,llama模型合并到transformer.py 5 months ago
pretrain_bloom.py Bloom模型适配FA及打通基于megatron框架预训练 5 months ago
pretrain_gpt.py transformer框架升级 6 months ago
pretrain_intern.py 合并脚本后,一些问题修改 5 months ago
pretrain_llama.py baichuan2 update 5 months ago
requirements.txt olve the dependency issue of peft==0.4.0 and update the readme. 5 months ago
setup.py add adapter for te fa 5 months ago