Full text

Turn on search term navigation

© 2025 Zhang, Zhang. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Recommendation systems play a significant role in information presentation and research. In particular, goods recommendations for consumers should match consumer psychology, speed up product search, and improve the efficiency of product transactions. Online platforms provide product information and interactive information between customers and products. However, the interactive modeling effect of the existing multimedia algorithms on this information must be improved, for instance, by deeply integrating product and interactive information. Accordingly, we propose a cross-fusion-activated multi-modal (CFMM) integration method for recommender systems to achieve deep fusion of product and user information. This method adds a cross-fusion module to fuse the features of different modalities through deep-feature fusion. A fusion loss function is further proposed to improve the recommendation performance of the network. Extensive experiments were conducted on three real-world datasets along with multiple ablation studies to illustrate the effects of the different modules. The experimental results show that the proposed method exhibits better recommendation performance, providing a maximum improvement of 3.8% in the recommendation performance metrics Recall@20, NDCG@20, and Precision@20 in comparisons with existing algorithms. This method realizes a deeper integration of multimodal information; however, the performance can be further improved by extending the multimodal information interaction algorithm to include product and user information.

Details

Title
Cross-fusion activates deep modal integration for multimedia recommendation
Author
Zhang, Chong  VIAFID ORCID Logo  ; Zhang, ZhiCai
First page
e0327663
Section
Research Article
Publication year
2025
Publication date
Jul 2025
Publisher
Public Library of Science
e-ISSN
19326203
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
3226650357
Copyright
© 2025 Zhang, Zhang. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.