default search action
Image and Vision Computing, Volume 148
Volume 148, 2024
- Praneeth Nemani, Venkata Surya Sundar Vadali, Prathistith Raj Medi, Ashish Marisetty, Satyanarayana Vollala, Santosh Kumar:
Cross-modal hybrid architectures for gastrointestinal tract image analysis: A systematic review and futuristic applications. 105068 - Shuang Gong, Zhu Teng, Rui Li, Jack Fan, Baopeng Zhang, Jianping Fan:
MINet: Modality interaction network for unified multi-modal tracking. 105071 - Eduardo de O. Andrade, Joris Guérin, José Viterbo, Igor Garcia Ballhausen Sampaio:
Adversarial attacks and defenses in person search: A systematic mapping study and taxonomy. 105096 - Chenglin Zhou, Wei Zhang, Zhichao Lian:
Enhancing consistency in virtual try-on: A novel diffusion-based approach. 105097 - Muhammad Imran, Muhammad Usman Akram, Mohsin Islam Tiwana, Anum Abdul Salam, Danilo Greco:
Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues. 105098 - V. V. Sajith Variyar, V. Sowmya, Ramesh Sivanpillai, Gregory K. Brown:
A multi-branch dual attention segmentation network for epiphyte drone images. 105099 - Xiujin Zhu, Chee-Onn Chow, Joon Huang Chuah:
From darkness to clarity: A comprehensive review of contemporary image shadow removal research (2017-2023). 105100 - Siqi Lu, Fengxu Guan, Haitao Lai:
Underwater image enhancement based on global features and prior distribution guided. 105101 - Qintong Li, Yong Ma, Jun Huang, Can Zhang, Zhao Cai:
LELD: Learn enhancement by learning degradation. 105102 - Mengmei Sang, Shengwei Tian, Long Yu, Guoqi Wang, Yue Peng:
Environmentally adaptive fast object detection in UAV images. 105103 - Ah-Hyung Shin, Jae-Ho Lee, Jiwon Hwang, Yoonhyung Kim, Gyeong-Moon Park:
Wav2NeRF: Audio-driven realistic talking head generation via wavelet-based NeRF. 105104 - Hengyou Wang, Kani Song, Xiang Jiang, Zhiquan He:
ragBERT: Relationship-aligned and grammar-wise BERT model for image captioning. 105105 - Bahareh Ghari, Ali Tourani, Asadollah Shahbahrami, Georgi Gaydadjiev:
Pedestrian detection in low-light conditions: A comprehensive survey. 105106 - Ruixu Wu, Yanli Liu, Xiaogang Wang, Peilin Yang:
Visual tracking based on spatiotemporal transformer and fusion sequences. 105107 - Zhikang Zhao, Yongcheng Wang, Ning Zhang, Yuxi Zhang, Zheng Li, Chi Chen:
A method of degradation mechanism-based unsupervised remote sensing image super-resolution. 105108 - Mengfei He, Zhiyou Yang, Guangben Zhang, Yan Long, Huaibo Song:
IIMT-net: Poly-1 weights balanced multi-task network for semantic segmentation and depth estimation using interactive information. 105109 - Sangwon Choi, Daejune Choi, Duksu Kim:
TIE-KD: Teacher-independent and explainable knowledge distillation for monocular depth estimation. 105110 - Gengsheng Xie, Hanbing Su, Yong Luo, Wenle Wang, Yugen Yi, Shan Zhong:
Person re-identification by utilizing hierarchical spatial relation reasoning. 105111 - Hailong Jin, Huiying Li:
An enhanced approach for few-shot segmentation via smooth downsampling mask and label smoothing loss. 105113 - Mingyu Yuan, Songwei Pei:
RAD-BNN: Regulating activation distribution for accurate binary neural network. 105114 - Lingna Gao, Rencan Nie, Jinde Cao, Gucheng Zhang:
DFG-HCEN: A distinctive-feature guided and hierarchical channel enhanced network-based infrared and visible image fusion. 105115 - Laigan Luo, Benshun Yi, Zhongyuan Wang, Zheng He, Chao Zhu:
Bidirectional scale-aware upsampling network for arbitrary-scale video super-resolution. 105116 - Xiangyang Wang, Yuhui Tian, Fudi Geng, Rui Wang:
DFSTrack: Dual-stream fusion Siamese network for human pose tracking in videos. 105117 - Anping Cai, Leiting Chen, Yongqi Chen, Ziyu He, Shuqing Tao, Chuan Zhou:
Adaptive attribute distribution similarity for few-shot learning. 105118 - Bahy Helmi Hartoyo Putra, Cheol Jeong:
Video captioning based on dual learning via multiple reconstruction blocks. 105119 - Jianhua Qiu, Weihua Liu, Chaochao Lin, Jiaojiao Li, Haoping Yu, Said Boumaraf:
Occlusion-aware deep convolutional neural network via homogeneous Tanh-transforms for face parsing. 105120 - Zilin Zou, Ying Chen:
Modality interactive attention for cross-modality person re-identification. 105128 - Mei Yu, Shouyi Xu, Hang Sun, Yuelin Zheng, Wen Yang:
Hierarchical slice interaction and multi-layer cooperative decoding networks for remote sensing image dehazing. 105129 - Yufei Zha, Xiao Guo, Fan Li, Hangfei Li:
Enhancing small object tracking with reversible rescaling networks. 105131 - Siqi Zhang, Lu Zhang, Zhiyong Liu:
Active domain adaptation for semantic segmentation via dynamically balancing domainness and uncertainty. 105132 - Jingxin Wang, Yunfeng Zhang, Fangxun Bao, Yuetong Liu, Qiuyue Zhang, Caiming Zhang:
Video object segmentation by multi-scale attention using bidirectional strategy. 105136 - Oumaima Moutik, Hiba Sekkat, Taha Ait Tchakoucht, Badr El Kari, Ahmed El Hilali Alaoui:
A puzzle questions form training for self-supervised skeleton-based action recognition. 105137 - Seung-Lee Lee, Minjae Kang, Jong-Uk Hou:
Localization of diffusion model-based inpainting through the inter-intra similarity of frequency features. 105138 - Lifang Zhou, Songlin Rao, Weisheng Li, Bo Hu, Bo Sun:
Multi-branch progressive embedding network for crowd counting. 105140 - Haiying Xia, Chunhai Su, Shuxiang Song, Yumei Tan:
Dual-consistency constraints network for noisy facial expression recognition. 105141 - Cédric Hémon, Blanche Texier, Hilda Chourak, Antoine Simon, Igor Bessières, Renaud de Crevoisier, Joël Castelli, Caroline Lafond, Anaïs Barateau, Jean-Claude Nunes:
Indirect deformable image registration using synthetic image generated by unsupervised deep learning. 105143 - Liangliang Wang, Lei Zhou, Peidong Liang, Ke Wang, Lianzheng Ge:
SMTCNN - A global spatio-temporal texture convolutional neural network for 3D dynamic texture recognition. 105145 - Xiaohui Yu, Jingjun Tian, Zhipeng Chen, Yizhen Meng, Jun Zhang:
Predictive breast cancer diagnosis using ensemble fuzzy model. 105146 - Zhichao Sun, Huachao Zhu, Xin Xiao, Yuliang Gu, Yongchao Xu:
Nighttime image semantic segmentation with retinex theory. 105149 - Yaxin Dong, Fei Li, Kai Yan, Shen Deng, Tao Wen, Yang Yang:
OFACD: An end-to-end change detection network for small UAVs remote sensing with viewpoint differences. 105150 - Nafiseh Jabbari Tofighi, Mohamed Hedi Elfkir, Nevrez Imamoglu, Cagri Ozcinar, Aykut Erdem, Erkut Erdem:
Omnidirectional image quality assessment with local-global vision transformers. 105151 - Weihang Kong, Zepeng Yu, He Li, Liangang Tong, Fengda Zhao, Yang Li:
CrowdAlign: Shared-weight dual-level alignment fusion for RGB-T crowd counting. 105152 - Thamer Alanazi, Khalid Babutain, Ghulam Muhammad:
Mitigating human fall injuries: A novel system utilizing 3D 4-stream convolutional neural networks and image fusion. 105153 - Bahareh Asheghi, Pedram Salehpour, Abdolhamid Moallemi Khiavi, Mahdi Hashemzadeh, Amirhassan Monajemi:
DASOD: Detail-aware salient object detection. 105154 - Muhammad Imran, Muhammad Usman Akram, Mohsin Islam Tiwana, Anum Abdul Salam, Taimur Hassan, Danilo Greco:
Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues. 105147 - Muhammad Imran, Muhammad Usman Akram, Mohsin Islam Tiwana, Anum Abdul Salam, Danilo Greco:
Erratum to "Two-dimensional hybrid incremental learning (2DHIL) framework for semantic segmentation of skin tissues" [Image and Vision Computing. Vol148 (2024) 105098]. 105148
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.