You can not select more than 25 topics Topics must start with a chinese character,a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
i-robot b49e04359d
!3067 【dev】glm2 yaml整改
5 days ago
..
bert update doc and yaml for customized output_dir 4 months ago
bloom update doc and yaml for customized output_dir 4 months ago
clip update doc and yaml for customized output_dir 4 months ago
codegeex2 添加glm2半自动并行eval支持 3 months ago
codellama 【check】codellama配置修改 6 days ago
convert_config 1、mindspore权重转torch权重 2 months ago
general TrainingArguments增加loss_repeated_mean、warmup_epochs入参 1 month ago
glm 权重加载特性优化相关说明 4 months ago
glm2 glm2 yaml整改 6 days ago
glm3 glm3 yaml整改 6 days ago
gpt2 【dev】删除gpt2以及multiheadattention中对ifa的调用 3 weeks ago
llama 【dev】去除弃用参数,防止告警 1 month ago
llama2 !3003 【check】llama2配置改名 5 days ago
mae update doc and yaml for customized output_dir 4 months ago
pangualpha update doc and yaml for customized output_dir 4 months ago
qa update doc and yaml for customized output_dir 4 months ago
sam AutoModel、PretrianedModel基类切换 2 months ago
swin update doc and yaml for customized output_dir 4 months ago
t5 update doc and yaml for customized output_dir 4 months ago
tokcls update doc and yaml for customized output_dir 4 months ago
txtcls update doc and yaml for customized output_dir 4 months ago
vit fix vit config issue 2 months ago
README.md fixed 3df0904 from https://gitee.com/Lin-Bert/transformer/pulls/2952 1 week ago

MindSpore Transformers套件的目标是构建一个大模型训练、微调、评估、推理、部署的全流程开发套件: 提供业内主流的Transformer类预训练模型和SOTA下游任务应用,涵盖丰富的并行特性。期望帮助用户轻松的实现大模型训练和创新研发。

Jupyter Notebook Python Markdown Shell