关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

Sijun He 5ac5803d9c revert (#6561 )		9 months ago
..
README.md	Unify finetune_generation (#6433)	10 months ago

argument.py	Unify finetune_generation (#6433)	10 months ago

data.py	fix causallm shell (#6560)	9 months ago

finetune_generation.py	revert (#6561)	9 months ago

merge_lora_params.py	fix causallm shell (#6560)	9 months ago

merge_tp_params.py	Unify finetune_generation (#6433)	10 months ago

predict_generation.py	revert (#6561)	9 months ago

quant.py	fix causallm shell (#6560)	9 months ago

utils.py	remove is_tp if branchs (#6555)	9 months ago

sft

export MODEL='THUDM/chatglm-6b'
export DATA='data'
python -u  -m paddle.distributed.launch --gpus "0,1,2,3" finetune_generation.py \
    --model_name_or_path $MODEL  \
    --dataset_name_or_path $DATA \
    --output_dir ./checkpoints \
    --per_device_train_batch_size 4 \
    --gradient_accumulation_steps 2 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 3 \
    --learning_rate 3e-5 \
    --warmup_steps 30 \
    --logging_steps 1 \
    --evaluation_strategy epoch \
    --save_strategy epoch \
    --src_length 1024 \
    --tgt_length 1024 \
    --fp16 \
    --fp16_opt_level O2 \
    --do_train \
    --do_eval \
    --disable_tqdm True \
    --load_best_model_at_end True \
    --metric_for_best_model accuracy \
    --eval_with_do_generation False \
    --recompute \
    --save_total_limit 1 \
    --tensor_parallel_degree 4

lora

export MODEL='THUDM/chatglm-6b'
export DATA='data'
python  finetune_generation.py \
    --model_name_or_path $MODEL  \
    --dataset_name_or_path $DATA \
    --output_dir ./checkpoints \
    --per_device_train_batch_size 4 \
    --gradient_accumulation_steps 2 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 1 \
    --learning_rate 3e-4 \
    --warmup_steps 30 \
    --logging_steps 1 \
    --evaluation_strategy epoch \
    --save_strategy epoch \
    --src_length 1024 \
    --tgt_length 1024 \
    --fp16 \
    --fp16_opt_level O2 \
    --do_train \
    --do_eval \
    --disable_tqdm True \
    --load_best_model_at_end True \
    --metric_for_best_model accuracy \
    --eval_with_do_generation False \
    --recompute \
    --save_total_limit 1  \
    --lora True

prefix_tuning

export MODEL='THUDM/chatglm-6b'
export DATA='data'
python  finetune_generation.py \
    --model_name_or_path $MODEL \
    --dataset_name_or_path $DATA \
    --output_dir ./checkpoints \
    --per_device_train_batch_size 4 \
    --gradient_accumulation_steps 2 \
    --per_device_eval_batch_size 8 \
    --num_train_epochs 1 \
    --learning_rate 3e-2 \
    --warmup_steps 30 \
    --logging_steps 1 \
    --evaluation_strategy epoch \
    --save_strategy epoch \
    --src_length 1024 \
    --tgt_length 1024 \
    --fp16 \
    --fp16_opt_level O2 \
    --do_train \
    --do_eval \
    --disable_tqdm True \
    --load_best_model_at_end True \
    --metric_for_best_model accuracy \
    --eval_with_do_generation False \
    --recompute \
    --save_total_limit 1  \
    --prefix_tuning True

👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Documen

Python C++ Cuda Shell Markdown other

fangzeyang0904@hotmail.com sijun.he@hotmail.com zhonghui.net@gmail.com zhoushunjie@baidu.com 380185688@qq.com chenzeyu01@baidu.com 40840292+linjieccc@users.noreply.github.com 1435130236@qq.com 50394665+JunnYu@users.noreply.github.com 63761690+lugimzzz@users.noreply.github.com yyb0576@163.com 33639025+smallv0221@users.noreply.github.com 709153940@qq.com 623543001@qq.com gongel@qq.com wanghuijuan03@baidu.com 397551318@qq.com w5688414@gmail.com liujiaqi06@baidu.com tianxin04@baidu.com westfish@126.com 1834792141@qq.com 48793257+Steffy-zxf@users.noreply.github.com kinghuin_chull@163.com chenshuo07@baidu.com

How to access data resources in code

README.md

sft

lora

prefix_tuning

Contributors (25+) All

Contributors (25+)
All