


default search action
International Journal of Multimedia Information Retrieval, Volume 11
Volume 11, Number 1, March 2022
- Silvan Heller
, Viktor Gsteiger
, Werner Bailer
, Cathal Gurrin
, Björn Þór Jónsson
, Jakub Lokoc
, Andreas Leibetseder
, Frantisek Mejzlík
, Ladislav Peska
, Luca Rossetto
, Konstantin Schall
, Klaus Schoeffmann
, Heiko Schuldt
, Florian Spiess
, Ly-Duyen Tran
, Lucia Vadicamo
, Patrik Veselý, Stefanos Vrochidis
, Jiaxin Wu
:
Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown. 1-18 - S. Suganyadevi
, V. Seethalakshmi
, K. Balasamy
:
A review on deep learning in medical image analysis. 19-38 - Sinda Elghoul, Faouzi Ghorbel
:
A fast and robust affine-invariant method for shape registration under partial occlusion. 39-59 - Mohammad Farhad Bulbul, Saiful Islam
, Zannatul Azme, Preksha Pareek
, Md. Humaun Kabir, Hazrat Ali
:
Enhancing the performance of 3D auto-correlation gradient features in depth action classification. 61-76 - Carlos de la Fuente, Jose J. Valero-Mas
, Francisco J. Castellanos, Jorge Calvo-Zaragoza
:
Multimodal image and audio music transcription. 77-84
Volume 11, Number 2, June 2022
- Devashree R. Patrikar
, Mayur Rajaram Parate
:
Anomaly detection using edge computing in video surveillance system: review. 85-110 - Jie Yan
, Yuxiang Xie, Xidao Luan, Yanming Guo, Quanzhi Gong, Suru Feng:
Caption TLSTMs: combining transformer with LSTMs for image captioning. 111-121 - Md. Meraz
, Md Afzal Ansari, Mohammed Javed, Pavan Chakraborty:
DC-GNN: drop channel graph neural network for object classification and part segmentation in the point cloud. 123-133 - Ohoud Nafea
, Wadood Abdul, Ghulam Muhammad:
Multi-sensor human activity recognition using CNN and GRU. 135-147 - Xiaoyi Wang, Jun Huang
:
A local representation-enhanced recurrent convolutional network for image captioning. 149-157 - Marco Fisichella
:
Siamese coding network and pair similarity prediction for near-duplicate image detection. 159-170 - Masum Shah Junayed
, Md Baharul Islam
, Hassan Imani, Tarkan Aydin
:
PDS-Net: A novel point and depth-wise separable convolution for real-time object detection. 171-188 - Jian Li, Yanming Guo
, Songyang Lao, Xiang Zhao, Liang Bai, Haoran Wang:
Few2Decide: towards a robust model via using few neuron connections to decide. 189-198
Volume 11, Number 3, September 2022
- Xiaoping Zhou, Xiangyu Han, Haoran Li, Jia Wang, Xun Liang:
Cross-domain image retrieval: methods and applications. 199-218 - Deepak Dagar, Dinesh Kumar Vishwakarma
:
A literature review and perspectives in deepfakes: generation, detection, and applications. 219-289 - Veronica Naosekpam
, Nilkanta Sahu:
Text detection, recognition, and script identification in natural scene images: a Review. 291-314 - Ademola Enitan Ilesanmi, Taiwo Ilesanmi, Oluwagbenga Paul Idowu, Drew A. Torigian, Jayaram K. Udupa:
Organ segmentation from computed tomography images using the 3D convolutional neural network: a systematic review. 315-331 - Ahmed Iqbal
, Muhammad Sharif
, Mussarat Yasmin
, Mudassar Raza
, Shabib Aftab
:
Generative adversarial networks and its applications in the biomedical image segmentation: a comprehensive survey. 333-368 - Hao Pan, Jun Huang
:
Semantic-enhanced discriminative embedding learning for cross-modal retrieval. 369-382 - Na He
, Sam Ferguson:
Music emotion recognition based on segment-level two-stage learning. 383-394 - Ihssane Houhou
, Athmane Zitouni, Yassine Ruichek
, Salah Eddine Bekhouche
, Mohamed Kas
, Abdelmalik Taleb-Ahmed:
RGBD deep multi-scale network for background subtraction. 395-407 - Sweta Panigrahi
, U. S. N. Raju
:
InceptionDepth-wiseYOLOv2: improved implementation of YOLO framework for pedestrian detection. 409-430 - Mehdi Ellouze:
How can users' comments posted on social media videos be a source of effective tags? 431-443 - Deepika Varshney, Dinesh Kumar Vishwakarma
:
A unified approach of detecting misleading images via tracing its instances on web and analyzing its past context for the verification of multimedia content. 445-459
Volume 11, Number 4, December 2022
- Pranjal Kumar
, Piyush Rawat, Siddhartha Chauhan
:
Contrastive self-supervised learning: review, progress, challenges and future research directions. 461-488 - Pranjal Kumar
, Siddhartha Chauhan
, Lalit Kumar Awasthi
:
Human pose estimation using deep learning: review, methodologies, progress and future research directions. 489-521 - Jianlong Wu, Richang Hong, Qi Tian:
Special issue on cross-modal retrieval and analysis. 523-524 - Lingtao Meng, Feifei Zhang
, Xi Zhang, Changsheng Xu:
Prototype local-global alignment network for image-text retrieval. 525-538 - Zhengjie Huang, Zhenguang Liu, Jianhai Chen, Qinming He, Shuang Wu, Lei Zhu, Meng Wang:
Who is gambling? Finding cryptocurrency gamblers using multi-modal retrieval methods. 539-551 - Ren Zhang, Ning He
, Shengjie Liu, Ying Wu, Kang Yan, Yuzhe He, Ke Lu:
Your heart rate betrays you: multimodal learning with spatio-temporal fusion networks for micro-expression recognition. 553-566 - Zefan Zhang
, Tianling Jiang, Chunping Liu
, Yi Ji:
Multi-aware coreference relation network for visual dialog. 567-576 - Keyang Cheng
, Xuesen Zhu, Yongzhao Zhan, Yunshen Pei:
Video deblurring and flow-guided feature aggregation for obstacle detection in agricultural videos. 577-588 - Xiaowei Zhang, Quan Fang
, Jun Hu, Shengsheng Qian, Changsheng Xu:
TCKGE: Transformers with contrastive learning for knowledge graph embedding. 589-597 - Silin Cai, Changping Wang, Jiajun Ding, Jun Yu, Jianping Fan:
FDAM: full-dimension attention module for deep convolutional neural networks. 599-610 - Yuxiang Xie, Jie Yan, Lai Kang, Yanming Guo
, Jiahui Zhang, Xidao Luan:
FCT: fusing CNN and transformer for scene classification. 611-618 - Mohammad Javad Parseh
, Mohammad Rahmanimanesh, Parviz Keshavarzi, Zohreh Azimifar:
Semantic-aware visual scene representation. 619-638 - Mohamed Kas
, Youssef El Merabet, Yassine Ruichek
, Rochdi Messoussi:
Generative adversarial networks for 2D-based CNN pose-invariant face recognition. 639-651 - Benoughidene Abdel Halim
, Titouna Faiza:
A novel method for video shot boundary detection using CNN-LSTM approach. 653-667 - Zhiguang Liu, Liangwei Wang, Jian Qiao:
Visual and semantic ensemble for scene text recognition with gated dual mutual attention. 669-680 - Junyan Yang, Jie Jiang
, Yanming Guo:
MHA-WoML: Multi-head attention and Wasserstein-OT for few-shot learning. 681-694 - Mohammadreza Sheikh Fathollahi, Rezvan Heidari:
Gender classification from face images using central difference convolutional networks. 695-703 - You Yang, Yongzhi An
, Juntao Hu, Longyue Pan:
Tri-RAT: optimizing the attention scores for image captioning. 705-715 - Stefanos-Iordanis Papadopoulos
, Christos Koutlis
, Symeon Papadopoulos, Ioannis Kompatsiaris:
Multimodal Quasi-AutoRegression: forecasting the visual popularity of new fashion products. 717-729 - Ren Togo
, Yuki Honma, Maiku Abe, Takahiro Ogawa, Miki Haseyama:
Similar interior coordination image retrieval with multi-view features. 731-740

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.