CIDEr: Consensus-based image description evaluation

[J]. computer vision and pattern recognition, 2015: 4566-4575.