TY - GEN
T1 - REV2
T2 - 11th ACM International Conference on Web Search and Data Mining, WSDM 2018
AU - Kumar, Srijan
AU - Kumar, Mohit
AU - Hooi, Bryan
AU - Faloutsos, Christos
AU - Makhija, Disha
AU - Subrahmanian, V. S.
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/2/2
Y1 - 2018/2/2
N2 - Rating platforms enable large-scale collection of user opinion about items (e.g., products or other users). However, fraudulent users give fake ratings for excessive monetary gains. In this paper, we present Rev2, a system to identify such fraudulent users. We propose three interdependent intrinsic quality metrics-fairness of a user, reliability of a rating and goodness of a product. The fairness and reliability quantify the trustworthiness of a user and rating, respectively, and goodness quantifies the quality of a product. Intuitively, a user is fair if it provides reliable scores that are close to the goodness of products. We propose six axioms to establish the interdependency between the scores, and then, formulate a mutually recursive definition that satisfies these axioms. We extend the formulation to address cold start problem and incorporate behavior properties. We develop the Rev2 algorithm to calculate these intrinsic scores for all users, ratings, and products by combining network and behavior properties. We prove that this algorithm is guaranteed to converge and has linear time complexity. By conducting extensive experiments on five rating datasets, we show that Rev2 outperforms nine existing algorithms in detecting fraudulent users. We reported the 150 most unfair users in the Flipkart network to their review fraud investigators, and 127 users were identified as being fraudulent (84.6% accuracy). The Rev2 algorithm is being deployed at Flipkart.
AB - Rating platforms enable large-scale collection of user opinion about items (e.g., products or other users). However, fraudulent users give fake ratings for excessive monetary gains. In this paper, we present Rev2, a system to identify such fraudulent users. We propose three interdependent intrinsic quality metrics-fairness of a user, reliability of a rating and goodness of a product. The fairness and reliability quantify the trustworthiness of a user and rating, respectively, and goodness quantifies the quality of a product. Intuitively, a user is fair if it provides reliable scores that are close to the goodness of products. We propose six axioms to establish the interdependency between the scores, and then, formulate a mutually recursive definition that satisfies these axioms. We extend the formulation to address cold start problem and incorporate behavior properties. We develop the Rev2 algorithm to calculate these intrinsic scores for all users, ratings, and products by combining network and behavior properties. We prove that this algorithm is guaranteed to converge and has linear time complexity. By conducting extensive experiments on five rating datasets, we show that Rev2 outperforms nine existing algorithms in detecting fraudulent users. We reported the 150 most unfair users in the Flipkart network to their review fraud investigators, and 127 users were identified as being fraudulent (84.6% accuracy). The Rev2 algorithm is being deployed at Flipkart.
UR - http://www.scopus.com/inward/record.url?scp=85046890670&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85046890670&partnerID=8YFLogxK
U2 - 10.1145/3159652.3159729
DO - 10.1145/3159652.3159729
M3 - Conference contribution
AN - SCOPUS:85046890670
T3 - WSDM 2018 - Proceedings of the 11th ACM International Conference on Web Search and Data Mining
SP - 333
EP - 341
BT - WSDM 2018 - Proceedings of the 11th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
Y2 - 5 February 2018 through 9 February 2018
ER -