Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended  Abstract)

Raffaella Bernardi; Ruket Cakici; Desmond Elliott; Aykut Erdem; Erkut Erdem; Nazli Ikizler-Cinbis; Frank Keller; Adrian Muscat; Barbara Plank

Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures (Extended Abstract)

Raffaella Bernardi, Ruket Cakici, Desmond Elliott, Aykut Erdem, Erkut Erdem, Nazli Ikizler-Cinbis, Frank Keller, Adrian Muscat, Barbara Plank

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence

Journal track. Pages 4970-4974. https://doi.org/10.24963/ijcai.2017/704

PDF BibTeX

Automatic image description generation is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the known approaches based on how they conceptualise this problem and provide a review of existing models, highlighting their advantages and disadvantages. Moreover, we give an overview of the benchmark image-text datasets and the evaluation measures that have been developed to assess the quality of machine-generated descriptions. Finally we explore future directions in the area of automatic image description.

Keywords:

Natural Language Processing: Natural Language Generation

Robotics and Vision: Vision and Perception