A Weighted Multi-Criteria Decision Making Approach for Image Captioning

Galandouz, Hassan Maleki; Moghaddam, Mohsen Ebrahimi; Shamsfard, Mehrnoush

Computer Science > Computer Vision and Pattern Recognition

arXiv:1904.00766 (cs)

[Submitted on 17 Mar 2019]

Title:A Weighted Multi-Criteria Decision Making Approach for Image Captioning

Authors:Hassan Maleki Galandouz, Mohsen Ebrahimi Moghaddam, Mehrnoush Shamsfard

View PDF

Abstract:Image captioning aims at automatically generating descriptions of an image in natural language. This is a challenging problem in the field of artificial intelligence that has recently received significant attention in the computer vision and natural language processing. Among the existing approaches, visual retrieval based methods have been proven to be highly effective. These approaches search for similar images, then build a caption for the query image based on the captions of the retrieved images. In this study, we present a method for visual retrieval based image captioning, in which we use a multi criteria decision making algorithm to effectively combine several criteria with proportional impact weights to retrieve the most relevant caption for the query image. The main idea of the proposed approach is to design a mechanism to retrieve more semantically relevant captions with the query image and then selecting the most appropriate caption by imitation of the human act based on a weighted multi-criteria decision making algorithm. Experiments conducted on MS COCO benchmark dataset have shown that proposed method provides much more effective results in compare to the state-of-the-art models by using criteria with proportional impact weights .

Comments:	12 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1904.00766 [cs.CV]
	(or arXiv:1904.00766v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1904.00766

Submission history

From: Hassan Maleki Galandouz [view email]
[v1] Sun, 17 Mar 2019 13:20:01 UTC (1,072 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Hassan Maleki Galandouz
Mohsen Ebrahimi Moghaddam
Mehrnoush Shamsfard

export BibTeX citation

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:A Weighted Multi-Criteria Decision Making Approach for Image Captioning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computer Vision and Pattern Recognition

Title:A Weighted Multi-Criteria Decision Making Approach for Image Captioning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators