Marvin_CY 43d9bcc761 update		2 years ago
ascend310_infer	update	2 years ago

scripts	update	2 years ago

src	update	2 years ago

README.md	update	2 years ago

eval.py	update	2 years ago

export.py	update	2 years ago

postprocess.py	update	2 years ago

preprocess.py	update	2 years ago

requirements.txt	update	2 years ago

train.py	update StarGAN	3 years ago

README.md

Contents
StarGAN-description
Dataset
Environment Requirements
Script Description
Model Description
- Performance
- - Evaluation Performance
  - Inference Performance
ModelZoo Homepage

StarGAN-description

StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation

Yunjey Choi^1,2, Minje Choi^1,2, Munyoung Kim^2,3, Jung-Woo Ha², Sung Kim^2,4, Jaegul Choo^1,2

¹Korea University, ²Clova AI Research, NAVER Corp.

³The College of New Jersey, ⁴Hong Kong University of Science and Technology

https://arxiv.org/abs/1711.09020

Abstract: Recent studies have shown remarkable success in image-to-image translation for two domains. However, existing approaches have limited scalability and robustness in handling more than two domains, since different models should be built independently for every pair of image domains. To address this limitation, we propose StarGAN, a novel and scalable approach that can perform image-to-image translations for multiple domains using only a single model. Such a unified model architecture of StarGAN allows simultaneous training of multiple datasets with different domains within a single network. This leads to StarGAN's superior quality of translated images compared to existing models as well as the novel capability of flexibly translating an input image to any desired target domain. We empirically demonstrate the effectiveness of our approach on a facial attribute transfer and a facial expression synthesis tasks.

Dataset

Note that you can run the scripts based on the dataset mentioned in original paper or widely used in relevant domain/network architecture. In the following sections, we will introduce how to run the scripts using the related dataset below.

Dataset used: CelebA

CelebFaces Attributes Dataset (CelebA) is a large-scale face attributes dataset with more than 200K celebrity images, each with 40 attribute annotations. The images in this dataset cover large pose variations and background clutter. CelebA has large diversities, large quantities, and rich annotations, including

10,177 number of identities,
202,599 number of face images, and 5 landmark locations, 40 binary attributes annotations per image.

The dataset can be employed as the training and test sets for the following computer vision tasks: face attribute recognition, face detection, landmark (or facial part) localization, and face editing & synthesis.

Environment Requirements

Hardware Ascend
- Prepare hardware environment with Ascend processor.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Script Description

Script and Sample Code

.
└─ cv
  └─ StarGAN
    ├── ascend310_infer                    # 310 infer directory
    ├─ src
      ├─ __init__.py                       # init file
      ├─ cell.py                           # StarGAN model define
      ├─ model.py                          # define subnetwork about generator and discriminator
      ├─ utils.py                          # utils for StarGAN
      ├─ config.py                         # parse args
      ├─ dataset.py                        # prepare celebA dataset to cyclegan format
      ├─ reporter.py                       # Reporter class
      └─ loss.py                           # losses for StarGAN
    ├─ scripts
      ├─ run_distribute_train.sh           # launch distributed training(8p) in ascend
      ├─ run_standalone_train_ascend.sh    # launch standalone training(1p) in ascend
      ├─ eval_ascend.sh                    # launch evaluating in ascend
      └─ run_infer_310.sh                  # shell script for 310 inference
    ├─ eval.py                             # translate attritubes from original images
    ├─ train.py                            # train script
    ├─ export.py                           # export mindir script
    ├─ preprocess.py                       # convert images and labels to bin
    ├─ postprocess.py                      # convert bin to images
    └─ README.md                           # descriptions about StarGAN

Training Process

When training the network, you should selected the attributes in config, then you should change the c_dim in config which is same as the number of selected attributes.

python train.py

Prediction Process

python eval.py

Note: the result will saved at ./results/.

Ascen 310 infer

[Export MindIR]

python export.py

Note: The file_name parameter is the prefix, the final file will as StarGAN_G.[FILE_FORMAT].

Infer on Ascend 310

bash run_infer_310.sh [MINDIR_PATH] [DATA_PATH] [DEVICE_ID]

MINDIR_PATH Directionary of MINDIR
DATA_PATH Directionary of dataset
DEVICE_ID Optional, default 0

Model Description

Performance

Evaluation Performance

Parameters	Ascend 910
Model Version	StarGAN
Resource	Ascend
uploaded Date	03/30/2021 (month/day/year)
MindSpore Version	1.1.1
Dataset	CelebA
Training Parameters	steps=200000, batch_size=1, lr=0.0001
Optimizer	Adam
outputs	image
Speed	1pc: 100 ms/step;
Total time	1pc: 10h;
Parameters (M)	8.423 M
Checkpoint for Fine tuning	32.15M (.ckpt file)
Scripts	StarGAN script

Inference Performance

Parameters	Ascend 910
Model Version	StarGAN
Resource	Ascend
Uploaded Date	03/30/2021 (month/day/year)
MindSpore Version	1.1.1
Dataset	CelebA
batch_size	4
outputs	image

Parameters	Ascend 310
Model Version	StarGAN
Resource	Ascend
Uploaded Date	09/17/2021 (month/day/year)
MindSpore Version	1.2
Dataset	CelebA
batch_size	1
outputs	image

ModelZoo Homepage

Please check the official homepage.

No Description

mindspore

Python C++ Shell Text

marvin_tec@126.com

How to access data resources in code

README.md

Contents

[Export MindIR]

Infer on Ascend 310

Evaluation Performance

Inference Performance

Contributors (2) All

Contributors (2)
All