Are you sure you want to delete this task? Once this task is deleted, it cannot be recovered.
ZJUTER0126 7c50b335a7 | 5 months ago | |
---|---|---|
.github | 5 months ago | |
configs | 5 months ago | |
docs | 5 months ago | |
examples | 5 months ago | |
images | 5 months ago | |
logs | 5 months ago | |
mindcv | 5 months ago | |
requirements | 5 months ago | |
scripts | 5 months ago | |
tests | 5 months ago | |
.DS_Store | 5 months ago | |
.flake8 | 5 months ago | |
.gitattributes | 5 months ago | |
.gitignore | 5 months ago | |
.pre-commit-config.yaml | 5 months ago | |
CONTRIBUTING.md | 5 months ago | |
LICENSE.md | 5 months ago | |
README.md | 5 months ago | |
RELEASE.md | 5 months ago | |
benchmark_results.md | 5 months ago | |
config.py | 5 months ago | |
infer.py | 5 months ago | |
mkdocs.yml | 5 months ago | |
mox_adapter.py | 5 months ago | |
package.sh | 5 months ago | |
profile.py | 5 months ago | |
pyproject.toml | 5 months ago | |
requirements.txt | 5 months ago | |
setup.py | 5 months ago | |
train.py | 5 months ago | |
train_with_func.py | 5 months ago | |
validate.py | 5 months ago | |
validate_with_func.py | 5 months ago |
MaxViT引入了一种新的注意力模块——多轴自注意力(multi-axis self-attention, MaxSA),将传统的自注意机制分解为窗口注意力(Block attention)与网格注意力(Grid attention)两种稀疏形式,
在不损失非局部性的情况下,将普通注意的二次复杂度降低到线性。由于Max-SA的灵活性和可伸缩性,我们可以通过简单地将Max-SA与MBConv在分层体系结构中叠加,从而构建一个称为MaxViT的视觉 Backbone
MaxViT: Multi-Axis Vision Transformer
提供你所使用的数据信息,检查数据版权,通常情况下你需要提供下载数据的链接,数据集的目录结构,数据集大小等信息
使用的数据集:imagenet-1K, 下载地址
数据集大小:共1000个类、224*224彩色图像
训练集:共1,281,167张图像
测试集:共50,000张图像
数据格式:JPEG
└─imagenet
├─train # 训练数据集
└─val # 评估数据集
采用混合精度的训练方法使用支持单精度和半精度数据来提高深度学习神经网络的训练速度,同时保持单精度训练所能达到的网络精度。混合精度训练提高计算速度、减少内存使用的同时,支持在特定硬件上训练更大的模型或实现更大批次的训练。
以FP16算子为例,如果输入数据类型为FP32,MindSpore后台会自动降低精度来处理数据。用户可打开INFO日志,搜索“reduce precision”查看精度降低的算子。
通过官方网站安装MindSpore后,您可以按照如下步骤进行训练和评估:
Ascend处理器环境运行
# 运行分布式训练示例
mpirun -n 8 python train.py --config ./configs/maxvit/maxvit_t_224_ascend.yaml --dataset_path /path/imagenet --ckpt_save_dir /path/ckpt_save_dir --batch_size 64 --distribute True
# 运行评估示例
python3 eval.py --config ./configs/maxvit/maxvit_t_224_ascend.yaml --dataset_path /path/imagenet --ckpt_path /path/ckpt
├── README.md // 自述文件
├── config.py // 配置文件
├── configs // 配置目录
│ ├── README.md // 配置自述文件
│ └── maxvit
│ └── maxvit_t_224_ascend.yaml // maxvit_t_224模型的配置文件
├── infer.py // 推断代码
├── mindcv
│ ├── data
│ │ ├── auto_augment.py // 数据增强模块
│ │ ├── constants.py // 常量定义
│ │ ├── dataset_download.py // 数据集下载模块
│ │ ├── dataset_factory.py // 数据集工厂模块
│ │ ├── distributed_sampler.py// 分布式采样器模块
│ │ ├── loader.py // 数据加载模块
│ │ ├── mixup.py // Mixup模块
│ │ └── transforms_factory.py // 数据预处理模块
│ ├── loss
│ │ ├── asymmetric.py // 不对称损失模块
│ │ ├── binary_cross_entropy_smooth.py // 平滑二值交叉熵损失模块
│ │ ├── cross_entropy_smooth.py // 平滑交叉熵损失模块
│ │ ├── jsd.py // Jensen-Shannon距离损失模块
│ │ └── loss_factory.py // 损失函数工厂模块
│ ├── models
│ │ ├── features.py // 网络特征模块
│ │ ├── maxvit.py // MaxViT模型定义
│ │ ├── helpers.py // 构建网络模块
│ │ ├── model_factory.py // 构建网络模块
│ │ ├── registry.py // 网络注册模块
│ │ └── layers
│ │ ├── activation.py // 激活函数模块
│ │ ├── compatibility.py // 兼容性模块
│ │ ├── conv_norm_act.py // 卷积、归一化和激活模块
│ │ ├── drop_path.py // DropPath模块
│ │ ├── helpers.py // 模型助手函数模块
│ │ ├── identity.py // Identity模块
│ │ ├── mlp.py // MLP模块
│ │ ├── patch_embed.py // Patch Embedding模块
│ │ ├── pooling.py // 池化模块
│ │ ├── selective_kernel.py // 选择性卷积核模块
│ │ └── squeeze_excite.py // Squeeze-and-Excitation模块
│ ├── optim
│ │ ├── adamw.py // AdamW优化器模块
│ │ ├── adan.py // Adaptive Alpha Network优化器模块
│ │ ├── lion.py // Lion优化器模块
│ │ ├── nadam.py // NAdam优化器模块
│ │ └── optim_factory.py // 优化器工厂模块
│ ├── scheduler
│ │ ├── dynamic_lr.py // 动态学习率调度器模块
│ │ └── scheduler_factory.py // 调度器工厂模块
│ ├── utils
│ │ ├── amp.py // Automatic Mixed Precision模块
│ │ ├── callbacks.py // 回调函数模块
│ │ ├── checkpoint_manager.py // 检查点管理器模块
│ │ ├── download.py // 下载工具模块
│ │ ├── logger.py // 日志记录器模块
│ │ ├── path.py // 路径工具模块
│ │ ├── random.py // 随机工具模块
│ │ ├── reduce_manager.py // 分布式训练过程中的梯度平均工具模块
│ │ ├── train_step.py // 训练步骤模块
│ │ └── trainer_factory.py // 训练器工厂模块
│ └── version.py // 版本信息模块
├── network_test.py // 网络测试代码
├── openi.py // Open平台数据模块
├── images
│ └── maxvit.png // maxvit结构
├── requirements
│ ├── dev.txt // 开发环境依赖包列表
│ └── docs.txt // 文档生成依赖包列表
├── requirements.txt // 依赖包列表
├── train.py // 训练代码
├── train_with_func.py // 带有函数的训练代码
├── validate.py // 验证代码
└── validate_with_func.py // 带有函数的验证代码
注解模型中的每个参数,特别是
config.py
中的参数,如有多个配置文件,请注解每一份配置文件的参数
在config.py中可以同时配置训练参数和评估参数。
# system
# python profile.py --config ./configs/maxvit/maxvit_t_224_ascend.yaml --data_dir ../imagenet --epoch_size 4 --batch_size 8
mode: 0
distribute: True
num_parallel_workers: 16
val_while_train: True
val_interval: 1
# dataset
dataset: "imagenet"
data_dir: "/path/to/imagenet"
shuffle: True
dataset_download: False
batch_size: 64
drop_remainder: True
# augmentation
image_resize: 224
scale: [ 0.08, 1.0 ]
ratio: [ 0.75, 1.333 ]
hflip: 0.5
interpolation: "bicubic"
re_prob: 0.1
mixup: 0.2
cutmix: 1.0
cutmix_prob: 1.0
crop_pct: 0.875
color_jitter: [ 0.4, 0.4, 0.4 ]
auto_augment: "randaug-m15-mstd0.5"
# model
model: "maxvit_tiny_tf_224"
num_classes: 1000
pretrained: False
ckpt_path: ""
keep_checkpoint_max: 10
ckpt_save_policy: "top_k"
ckpt_save_dir: "./ckpt"
epoch_size: 300
dataset_sink_mode: True
amp_level: "O2"
# loss
loss: "CE"
loss_scale: 16777216.0
label_smoothing: 0.1
# lr scheduler
scheduler: "cosine_decay"
lr: 0.0005
min_lr: 1e-6
warmup_epochs: 20
decay_epochs: 280
lr_epoch_stair: False
# optimizer
opt: "adamw"
weight_decay: 0.05
filter_bias_and_bn: True
use_nesterov: False
loss_scale_type: dynamic
drop_overflow_update: True
# train
clip_grad: True
clip_value: 2.
drop_path_rate: 0.2
更多配置细节请参考脚本./configs/maxvit/maxvit_t_224_ascend.yaml
。
提供训练信息,区别于quick start,此部分需要提供除用法外的日志等详细信息
参数设置
参数名字 | 参数 |
---|---|
镜像|mindspore_1.8.1_train | |
启动文件 | train.py |
数据集 | imagenet-1K |
运行参数 | 👇 |
ckpt_save_dir | /cache/output/ckpt/ |
distribute | True |
config | configs/maxvit/maxvit_t_224_ascend.yaml |
batch_size | 512 |
资源规格 | NPU: 8*Ascend 910 |
训练checkpoint将被保存在智算平台的下载页面中,你可以从智算平台的日志窗口获取训练结果
[2023-08-28 00:27:56] mindcv.utils.callbacks INFO - Total time since last epoch: 1907.650831(train: 1856.059859, val: 50.042762)s, ETA: 3815.301662s
[2023-08-28 00:27:56] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
time="2023-08-28T00:41:45+08:00" level=info msg="auth file has been updated" file="authentication.go:105" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=srt_log_collection
time="2023-08-28T00:41:45+08:00" level=info msg="auth file has been updated" file="authentication.go:105" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=log_url
time="2023-08-28T00:41:46+08:00" level=info msg="auth info has been updated" file="authentication.go:113" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=srt_log_collection
time="2023-08-28T00:41:47+08:00" level=info msg="auth info has been updated" file="authentication.go:113" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=log_url
[2023-08-28 00:58:53] mindcv.utils.callbacks INFO - Epoch: [299/300], batch: [2502/2502], loss: 2.047940, lr: 0.000001, time: 1856.441611s
[2023-08-28 00:59:43] mindcv.utils.callbacks INFO - Validation Top_1_Accuracy: 82.7520%, Top_5_Accuracy: 96.2560%, time: 50.930224s
[2023-08-28 00:59:45] mindcv.utils.callbacks INFO - Saving model to ./ckpt/maxvit_tiny_tf_224-299_2502.ckpt
[2023-08-28 00:59:47] mindcv.utils.checkpoint_manager INFO - Top-k accuracy checkpoints:
./ckpt/maxvit_tiny_tf_224-297_2502.ckpt 0.8279600143432617
./ckpt/maxvit_tiny_tf_224-292_2502.ckpt 0.8278399705886841
./ckpt/maxvit_tiny_tf_224-293_2502.ckpt 0.8276200294494629
./ckpt/maxvit_tiny_tf_224-299_2502.ckpt 0.8275200128555298
./ckpt/maxvit_tiny_tf_224-289_2502.ckpt 0.8274799585342407
./ckpt/maxvit_tiny_tf_224-296_2502.ckpt 0.8274399638175964
./ckpt/maxvit_tiny_tf_224-287_2502.ckpt 0.8273600339889526
./ckpt/maxvit_tiny_tf_224-288_2502.ckpt 0.8272200226783752
./ckpt/maxvit_tiny_tf_224-290_2502.ckpt 0.8271600604057312
./ckpt/maxvit_tiny_tf_224-295_2502.ckpt 0.8270799517631531
[2023-08-28 00:59:47] mindcv.utils.callbacks INFO - Total time since last epoch: 1910.868750(train: 1856.489848, val: 50.930224)s, ETA: 1910.868750s
[2023-08-28 00:59:47] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
[2023-08-28 01:30:44] mindcv.utils.callbacks INFO - Epoch: [300/300], batch: [2502/2502], loss: 2.281007, lr: 0.000001, time: 1857.371526s
[2023-08-28 01:31:39] mindcv.utils.callbacks INFO - Validation Top_1_Accuracy: 82.7760%, Top_5_Accuracy: 96.2400%, time: 54.921446s
time="2023-08-28T01:31:40+08:00" level=info msg="clean up child process succeed, pid=9138, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:40+08:00" level=info msg="clean up child process succeed, pid=9140, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=7916, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=7940, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9157, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9171, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9571, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9573, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8041, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8057, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8315, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8319, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
[2023-08-28 01:31:41] mindcv.utils.callbacks INFO - Saving model to ./ckpt/maxvit_tiny_tf_224-300_2502.ckpt
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8913, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8914, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
[2023-08-28 01:31:43] mindcv.utils.checkpoint_manager INFO - Top-k accuracy checkpoints:
./ckpt/maxvit_tiny_tf_224-297_2502.ckpt 0.8279600143432617
./ckpt/maxvit_tiny_tf_224-292_2502.ckpt 0.8278399705886841
./ckpt/maxvit_tiny_tf_224-300_2502.ckpt 0.8277599811553955
./ckpt/maxvit_tiny_tf_224-293_2502.ckpt 0.8276200294494629
./ckpt/maxvit_tiny_tf_224-299_2502.ckpt 0.8275200128555298
./ckpt/maxvit_tiny_tf_224-289_2502.ckpt 0.8274799585342407
./ckpt/maxvit_tiny_tf_224-296_2502.ckpt 0.8274399638175964
./ckpt/maxvit_tiny_tf_224-287_2502.ckpt 0.8273600339889526
./ckpt/maxvit_tiny_tf_224-288_2502.ckpt 0.8272200226783752
./ckpt/maxvit_tiny_tf_224-290_2502.ckpt 0.8271600604057312
[2023-08-28 01:31:43] mindcv.utils.callbacks INFO - Total time since last epoch: 1916.365701(train: 1857.397630, val: 54.921446)s, ETA: 0.000000s
[2023-08-28 01:31:43] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - Finish training!
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - The best validation Top_1_Accuracy is: 82.7960% at epoch 297.
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - ================================================================================
参数设置
参数名字 | 参数 |
---|---|
镜像 | mindspore_1.8.1_train |
启动文件 | train.py |
数据集 | imagenet-1K |
运行参数 | 👇 |
ckpt_save_dir | /cache/output/ckpt/ |
distribute | True |
config | configs/maxvit/maxvit_t_224_ascend.yaml |
batch_size | 512 |
资源规格 | NPU: 8*Ascend 910 |
训练checkpoint将被保存在智算平台的下载页面中,你可以从智算平台的日志窗口获取训练结果
[2023-08-28 00:27:56] mindcv.utils.callbacks INFO - Total time since last epoch: 1907.650831(train: 1856.059859, val: 50.042762)s, ETA: 3815.301662s
[2023-08-28 00:27:56] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
time="2023-08-28T00:41:45+08:00" level=info msg="auth file has been updated" file="authentication.go:105" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=srt_log_collection
time="2023-08-28T00:41:45+08:00" level=info msg="auth file has been updated" file="authentication.go:105" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=log_url
time="2023-08-28T00:41:46+08:00" level=info msg="auth info has been updated" file="authentication.go:113" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=srt_log_collection
time="2023-08-28T00:41:47+08:00" level=info msg="auth info has been updated" file="authentication.go:113" Command=obs/upload_by_channels Component=ma-training-toolkit Platform=ModelArts-Service Task=log_url
[2023-08-28 00:58:53] mindcv.utils.callbacks INFO - Epoch: [299/300], batch: [2502/2502], loss: 2.047940, lr: 0.000001, time: 1856.441611s
[2023-08-28 00:59:43] mindcv.utils.callbacks INFO - Validation Top_1_Accuracy: 82.7520%, Top_5_Accuracy: 96.2560%, time: 50.930224s
[2023-08-28 00:59:45] mindcv.utils.callbacks INFO - Saving model to ./ckpt/maxvit_tiny_tf_224-299_2502.ckpt
[2023-08-28 00:59:47] mindcv.utils.checkpoint_manager INFO - Top-k accuracy checkpoints:
./ckpt/maxvit_tiny_tf_224-297_2502.ckpt 0.8279600143432617
./ckpt/maxvit_tiny_tf_224-292_2502.ckpt 0.8278399705886841
./ckpt/maxvit_tiny_tf_224-293_2502.ckpt 0.8276200294494629
./ckpt/maxvit_tiny_tf_224-299_2502.ckpt 0.8275200128555298
./ckpt/maxvit_tiny_tf_224-289_2502.ckpt 0.8274799585342407
./ckpt/maxvit_tiny_tf_224-296_2502.ckpt 0.8274399638175964
./ckpt/maxvit_tiny_tf_224-287_2502.ckpt 0.8273600339889526
./ckpt/maxvit_tiny_tf_224-288_2502.ckpt 0.8272200226783752
./ckpt/maxvit_tiny_tf_224-290_2502.ckpt 0.8271600604057312
./ckpt/maxvit_tiny_tf_224-295_2502.ckpt 0.8270799517631531
[2023-08-28 00:59:47] mindcv.utils.callbacks INFO - Total time since last epoch: 1910.868750(train: 1856.489848, val: 50.930224)s, ETA: 1910.868750s
[2023-08-28 00:59:47] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
[2023-08-28 01:30:44] mindcv.utils.callbacks INFO - Epoch: [300/300], batch: [2502/2502], loss: 2.281007, lr: 0.000001, time: 1857.371526s
[2023-08-28 01:31:39] mindcv.utils.callbacks INFO - Validation Top_1_Accuracy: 82.7760%, Top_5_Accuracy: 96.2400%, time: 54.921446s
time="2023-08-28T01:31:40+08:00" level=info msg="clean up child process succeed, pid=9138, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:40+08:00" level=info msg="clean up child process succeed, pid=9140, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=7916, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=7940, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9157, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9171, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9571, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=9573, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8041, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8057, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8315, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8319, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
[2023-08-28 01:31:41] mindcv.utils.callbacks INFO - Saving model to ./ckpt/maxvit_tiny_tf_224-300_2502.ckpt
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8913, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
time="2023-08-28T01:31:41+08:00" level=info msg="clean up child process succeed, pid=8914, wstatus=0, exit_status=0" file="cleaner_unix.go:75" Command=bootstrap/run Component=ma-training-toolkit Platform=ModelArts-Service
[2023-08-28 01:31:43] mindcv.utils.checkpoint_manager INFO - Top-k accuracy checkpoints:
./ckpt/maxvit_tiny_tf_224-297_2502.ckpt 0.8279600143432617
./ckpt/maxvit_tiny_tf_224-292_2502.ckpt 0.8278399705886841
./ckpt/maxvit_tiny_tf_224-300_2502.ckpt 0.8277599811553955
./ckpt/maxvit_tiny_tf_224-293_2502.ckpt 0.8276200294494629
./ckpt/maxvit_tiny_tf_224-299_2502.ckpt 0.8275200128555298
./ckpt/maxvit_tiny_tf_224-289_2502.ckpt 0.8274799585342407
./ckpt/maxvit_tiny_tf_224-296_2502.ckpt 0.8274399638175964
./ckpt/maxvit_tiny_tf_224-287_2502.ckpt 0.8273600339889526
./ckpt/maxvit_tiny_tf_224-288_2502.ckpt 0.8272200226783752
./ckpt/maxvit_tiny_tf_224-290_2502.ckpt 0.8271600604057312
[2023-08-28 01:31:43] mindcv.utils.callbacks INFO - Total time since last epoch: 1916.365701(train: 1857.397630, val: 54.921446)s, ETA: 0.000000s
[2023-08-28 01:31:43] mindcv.utils.callbacks INFO - --------------------------------------------------------------------------------
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - Finish training!
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - The best validation Top_1_Accuracy is: 82.7960% at epoch 297.
[2023-08-28 01:31:44] mindcv.utils.callbacks INFO - ================================================================================
参数设置
参数名字 | 参数 |
---|---|
AI引擎 | MindSpore_1.8.1-aarch64 |
数据集 | imagenet-1K |
启动文件 | validate.py |
运行参数 | 👇 |
config | configs/maxvit/maxvit_t_224_ascend.yaml |
资源规格 | NPU: 1*Ascend 910 |
提供您训练性能的详细描述,例如finishing loss, throughput, checkpoint size等
你可以参考如下模板
Parameters | Ascend 910 |
---|---|
Model Version | maxvit_tiny_224 |
Resource | Ascend 910; CPU 2.60GHz, 192cores; Memory 755G; OS Euler2.8 |
uploaded Date | 11/29/2023 (month/day/year) |
MindSpore Version | 1.8.1 |
Dataset | imagenet-1K |
Training Parameters | epoch=300, batch_size=512 |
Optimizer | Adamw |
Loss Function | Cross Entropy |
outputs | probability |
Loss | 2.04 |
Speed | 1856 s/epoch(8pcs) |
Total time | 6 days 16 hours |
Parameters (M) | 30.9 |
提供推理性能的详细描述,包括耗时,精度等
你可以参照如下模板
Parameters | Ascend |
---|---|
Model Version | maxvit_tiny_224 |
Resource | Ascend 910; OS Euler2.8 |
Uploaded Date | 11/29/2023 (month/day/year) |
MindSpore Version | 1.8.1 |
Dataset | imagenet-1K |
batch_size | 64 |
outputs | probability |
Accuracy | 82.79% |
启智平台升级可能导致有些超参数传递需要调整!
此部分根据自己的情况进行更改,填写自己的院校和邮箱
请浏览官方主页。
Dear OpenI User
Thank you for your continuous support to the Openl Qizhi Community AI Collaboration Platform. In order to protect your usage rights and ensure network security, we updated the Openl Qizhi Community AI Collaboration Platform Usage Agreement in January 2024. The updated agreement specifies that users are prohibited from using intranet penetration tools. After you click "Agree and continue", you can continue to use our services. Thank you for your cooperation and understanding.
For more agreement content, please refer to the《Openl Qizhi Community AI Collaboration Platform Usage Agreement》