#10 计算BLEU指标用于评估攻击效果

Closed
created 1 year ago by 1030514181 · 2 comments
1030514181 self-assigned this 1 year ago
1030514181 commented 1 year ago
Owner
Bleu是IBM在2002提出的,用于机器翻译任务的评价,发表在ACL,引用次数10000+,原文题目是“BLEU: a Method for Automatic Evaluation of Machine Translation”。 它的总体思想就是准确率,假如给定标准译文reference,神经网络生成的句子是candidate,句子长度为n,candidate中有m个单词出现在reference,m/n就是bleu的1-gram的计算公式。 BLEU还有许多变种。根据n-gram可以划分成多种评价指标,常见的指标有BLEU-1、BLEU-2、BLEU-3、BLEU-4四种,其中n-gram指的是连续的单词个数为n。 BLEU-1衡量的是单词级别的准确性,更高阶的bleu可以衡量句子的流畅性。
1030514181 commented 1 year ago
Owner
计算公式如下: ![image](/attachments/249d50eb-2ea0-4043-843a-39f2813ecd58)
1030514181 started working 1 year ago
1030514181 closed this issue 1 year ago
1030514181 stopped working 1 year ago
4s
Sign in to join this conversation.
No Label
No Milestone
No Assignees
1 Participants
Notifications
Total Time Spent: 4s
Due Date

No due date set.

Dependencies

This issue currently doesn't have any dependencies.

Loading…
There is no content yet.