关于GCU、沐曦GPGPU、MLU、0卡V100资源4月7日恢复上架的公告>>> 关于共建具身智能开源数据集的倡议>>> 关于云脑任务中统一路径访问方式的公告>>> 关于将启智集群GPU资源迁移至智算集群的公告>>>

History

yangzhouxin 51dd4a165f first		3 years ago
..
META-INF	first	3 years ago

__pycache__	first	3 years ago

data	first	3 years ago

eval_results	first	3 years ago

log	first	3 years ago

misc	first	3 years ago

models	first	3 years ago

scripts	first	3 years ago

vis	first	3 years ago

.DS_Store	first	3 years ago

ADVANCED.md	first	3 years ago

HSPLAN.py	first	3 years ago

ImageCaptionModel.py	first	3 years ago

LICENSE	first	3 years ago

data_test.py	first	3 years ago

dataloader.py	first	3 years ago

dataloaderraw.py	first	3 years ago

eval.py	first	3 years ago

eval_ensemble.py	first	3 years ago

eval_utils.py	first	3 years ago

opts.py	first	3 years ago

readme.md	first	3 years ago

test-best.sh	first	3 years ago

test-last.sh	first	3 years ago

test.py	first	3 years ago

test_HSPLAN.py	first	3 years ago

train-wo-refining.sh	first	3 years ago

train.py	first	3 years ago

train.sh	first	3 years ago

HSPLAN

基于transformer的图文识别模型(Image Caption Model)

项目简介

1. 功能

该模型实现了从图像特征中生成文本描述的功能，通过从原始图像中提取的图像特征，可以生成根据图像特征描述该图像的语句。

2. 性能

代码基于pytorch框架，在生成语句阶段，可以输出语法正确的连贯语句。

3. 评估指标

CIDEr-D(Consensus-based Image Description Evaluation), BLEU(Bilingual Evaluation Understudy), METEOR(Metric for Evaluation of Translation with Explicit ORdering), ROUGE-L(Recall-Oriented Understudy for Gisting Evaluation), SPICE(Semantic Propositional Image Caption Evaluation)

4. 使用数据集

使用MSCOCO数据集提取的用bottom up attention的预训练特征(https://github.com/peteanderson80/bottom-up-attention)

运行与环境依赖

代码运行的环境与依赖：

类别	名称	版本
操作系统	Ubuntu	16.04
编程语言	Python	3.6
底层驱动	CUDA	9.0
深度学习框架	Pytorch	1.0

输入与输出

代码的输入与输出。如下所示：

名称	说明
输入	fc_feats[图像中提取的整体特征],att_feats[图像中提取的局部特征],att_masks[图像特征的mask]
输出	对应于图像特征的图像caption以及运行的时间

运行方式

在terminal下运行以下命令。

cd project_dir
python HSPLAN.py

No Description

Pickle Python Shell other

How to access data resources in code