CIT-Rec: Enhancing Sequential Recommendation System with Large Language Models

Ziyu Li¹, Zhen Chen², Xuejing Fu², Tong Mo^1,*, Weiping Li¹
1 School of Software and Microelectronics, Peking University, Beijing, 100871, China
2 Information Application Research Center of Shanghai Municipal Administration for Market Regulation, Shanghai, 200032, China
* Corresponding Author: Tong Mo. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.071994

Received 17 August 2025; Accepted 02 December 2025; Published online 29 December 2025

Download PDF

Abstract

Recommendation systems are key to boosting user engagement, satisfaction, and retention, particularly on media platforms where personalized content is vital. Sequential recommendation systems learn from user-item interactions to predict future items of interest. However, many current methods rely on unique user and item IDs, limiting their ability to represent users and items effectively, especially in zero-shot learning scenarios where training data is scarce. With the rapid development of Large Language Models (LLMs), researchers are exploring their potential to enhance recommendation systems. However, there is a semantic gap between the linguistic semantics of LLMs and the collaborative semantics of recommendation systems, where items are typically indexed by IDs. Moreover, most research focuses on item representations, neglecting personalized user modeling. To address these issues, we propose a sequential recommendation framework using LLMs, called CIT-Rec, a model that integrates Collaborative semantics for user representation and Image and Text information for item representation to enhance Recommendations. Specifically, by aligning intuitive image information with text containing semantic features, we can more accurately represent items, improving item representation quality. We focus not only on item representations but also on user representations. To more precisely capture users’ personalized preferences, we use traditional sequential recommendation models to train on users’ historical interaction data, effectively capturing behavioral patterns. Finally, by combining LLMs and traditional sequential recommendation models, we allow the LLM to understand linguistic semantics while capturing collaborative semantics. Extensive evaluations on real-world datasets show that our model outperforms baseline methods, effectively combining user interaction history with item visual and textual modalities to provide personalized recommendations.

Keywords

Large language models; vision language models; sequential recommendation; instruction tuning

Downloads
- Full-Text PDF
Citation Tools
- BibTex
- EndNote
- RIS

105

View
18

Download
0

Like

Time Highlighted Multi-Interest Network for Sequential Recommendation
Jiayi Ma, Tianhao Sun, Xiaodong...
Trends in Event Understanding and Caption Generation/Reconstruction in Dense Video: A Review
Ekanayake Mudiyanselage Chulabhaya...
Enhancing Relational Triple Extraction in Specific Domains: Semantic Enhancement and Synergy of Large Language Models and Small Pre-Trained Language Models
Jiakai Li, Jianpeng Hu, Geng Zhang
LKPNR: Large Language Models and Knowledge Graph for Personalized News Recommendation Framework
Hao Chen, Runfeng Xie, Xiangyang...
Learning Dual-Layer User Representation for Enhanced Item Recommendation
Fuxi Zhu, Jin Xie, Mohammed Alshahrani

All issues

Online First

2026

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

CIT-Rec: Enhancing Sequential Recommendation System with Large Language Models

Abstract

Keywords

105

18

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link