DSpace at KOASAS: Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

DSpace at KOASAS

College of Engineering(공과대학)Kim Jaechul Graduate School of AI(김재철AI대학원)AI-Conference Papers(학술대회논문)

Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

Cited 2 time in

Cited 0 time in

Hit : 89
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lee, Hojoon	ko
dc.contributor.author	Hwang, Dongyoon	ko
dc.contributor.author	Min, Kyushik	ko
dc.contributor.author	Choo, Jaegul	ko
dc.date.accessioned	2023-09-19T11:01:04Z	-
dc.date.available	2023-09-19T11:01:04Z	-
dc.date.created	2023-09-19	-
dc.date.issued	2022-07	-
dc.identifier.citation	45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022, pp.2607 - 2611	-
dc.identifier.uri	http://hdl.handle.net/10203/312773	-
dc.description.abstract	Interactive Recommender Systems (IRSs) have attracted a lot of attention, due to their ability to model interactive processes between users and recommender systems. Numerous approaches have adopted Reinforcement Learning (RL) algorithms, as these can directly maximize users' cumulative rewards. In IRS, researchers commonly utilize publicly available review datasets to compare and evaluate algorithms. However, user feedback provided in public datasets merely includes instant responses (e.g., a rating), with no inclusion of delayed responses (e.g., the dwell time and the lifetime value). Thus, the question remains whether these review datasets are an appropriate choice to evaluate the long-term effects in IRS. In this work, we revisited experiments on IRS with review datasets and compared RL-based models with a simple reward model that greedily recommends the item with the highest one-step reward. Following extensive analysis, we can reveal three main findings: First, a simple greedy reward model consistently outperforms RL-based models in maximizing cumulative rewards. Second, applying higher weighting to long-term rewards leads to degradation of recommendation performance. Third, user feedbacks have mere long-term effects in the benchmark datasets. Based on our findings, we conclude that a dataset has to be carefully verified and that a simple greedy baseline should be included for a proper evaluation of RL-based IRS approaches. Our code and dataset are available at https: //github.com/dojeon-ai/irs_validation.	-
dc.language	English	-
dc.publisher	Association for Computing Machinery, Inc	-
dc.title	Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems	-
dc.type	Conference	-
dc.identifier.wosid	000852715902117	-
dc.identifier.scopusid	2-s2.0-85135025185	-
dc.type.rims	CONF	-
dc.citation.beginningpage	2607	-
dc.citation.endingpage	2611	-
dc.citation.publicationname	45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022	-
dc.identifier.conferencecountry	SP	-
dc.identifier.conferencelocation	Madrid	-
dc.identifier.doi	10.1145/3477495.3531869	-
dc.contributor.localauthor	Choo, Jaegul	-
dc.contributor.nonIdAuthor	Min, Kyushik	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 2 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Towards Validating Long-Term User Feedbacks in Interactive Recommendation Systems

This item is cited by other documents in WoS

KOASAS

Communities & Collections