Model-Based Learning from Preference Data

Qinghua Liu; Marta Crispino; Ida Scheel; Valeria Vitelli; Arnoldo Frigessi

doi:10.1146/annurev-statistics-031017-100213

Annual Review of Statistics and Its Application

Volume 6, 2019

Review Article

Free

Model-Based Learning from Preference Data

Qinghua Liu¹, Marta Crispino², Ida Scheel¹, Valeria Vitelli³, and Arnoldo Frigessi³
View Affiliations Hide Affiliations

Affiliations: ¹Department of Mathematics, University of Oslo, 0316 Oslo, Norway; email: [email protected], [email protected] ²Institut National de Recherche en Informatique et Automatique (INRIA), CNRS, 38330 Montbonnot-Saint-Martin, France; email: [email protected] ³Oslo Centre for Biostatistics and Epidemiology, University of Oslo, 0317 Oslo, Norway; email: [email protected], [email protected]
Vol. 6:329-354 (Volume publication date March 2019) https://doi.org/10.1146/annurev-statistics-031017-100213
First published as a Review in Advance on September 27, 2018
Copyright © 2019 by Annual Reviews. All rights reserved

Abstract

Preference data occur when assessors express comparative opinions about a set of items, by rating, ranking, pair comparing, liking, or clicking. The purpose of preference learning is to (a) infer on the shared consensus preference of a group of users, sometimes called rank aggregation, or (b) estimate for each user her individual ranking of the items, when the user indicates only incomplete preferences; the latter is an important part of recommender systems. We provide an overview of probabilistic approaches to preference learning, including the Mallows, Plackett–Luce, and Bradley–Terry models and collaborative filtering, and some of their variations. We illustrate, compare, and discuss the use of these methods by means of an experiment in which assessors rank potatoes, and with a simulation. The purpose of this article is not to recommend the use of one best method but to present a palette of different possibilities for different questions and different types of data.

Keyword(s): Bayesian inference, Bradley–Terry model, Mallows model, Plackett–Luce model, preference learning with uncertainty, recommender systems

Article metrics loading...

/content/journals/10.1146/annurev-statistics-031017-100213

2019-03-07

2024-04-19

Full text loading...

/deliver/fulltext/statistics/6/1/annurev-statistics-031017-100213.html?itemId=/content/journals/10.1146/annurev-statistics-031017-100213&mimeType=html&fmt=ahah

Literature Cited

Adomavicius G, Tuzhilin A. 2015. Context-aware recommender systems. Recommender Systems Handbook F Ricci, L Rokach, B Shapira, PB Kantor191–226 New York: Springer
[Google Scholar]
Agarwal DK, Chen BC 2016. Statistical Methods for Recommender Systems Cambridge, UK: Cambridge Univ. Press
Aggarwal CC 2016. Recommender Systems New York: Springer
Agresti A 1996. Categorical Data Analysis New York: Wiley
Alvo M, Yu PLH 2014. Statistical Methods for Ranking Data New York: Springer
Asfaw D, Vitelli V, Sørensen Ø, Arjas E, Frigessi A. 2017. Time-varying rankings with the Bayesian Mallows model. Stat 6:14–30
[Google Scholar]
Bradley RA, Terry ME. 1952. Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika 39:324–45
[Google Scholar]
Caron F, Doucet A. 2012. Efficient Bayesian inference for generalized Bradley–Terry models. J. Comput. Graph. Stat. 21:174–96
[Google Scholar]
Caron F, Teh YW, Murphy TB. 2014. Bayesian nonparametric Plackett-Luce models for the analysis of preferences for college degree programmes. Ann. Appl. Stat. 8:1145–81
[Google Scholar]
Cattelan M. 2012. Models for paired comparison data: a review with emphasis on dependent data. Stat. Sci. 27:412–33
[Google Scholar]
Crispino M, Vitelli V, Barrett N, Arjas E, Frigessi A. 2017. A Bayesian Mallows approach to non-transitive pair comparison data: How human are sounds?. arXiv:1705.08805 [stat.AP]
[Google Scholar]
Davidson RR. 1970. On extending the Bradley–Terry model to accommodate ties in paired comparison experiments. J. Am. Stat. Assoc. 65:317–28
[Google Scholar]
Desarkar MS, Sarkar S, Mitra P. 2016. Preference relations based unsupervised rank aggregation for metasearch. Expert Syst. Appl. 49:86–98
[Google Scholar]
Diaconis P 1988. Group Representations in Probability and Statistics Hayward, CA: Inst. Math. Stat.
Diaconis P, Graham RL. 1977. Spearman's footrule as a measure of disarray. J. R. Stat. Soc. B 39:262–68
[Google Scholar]
Fang H, Wang Y, Jin Q, Ma J. 2017. RankwithTA: A robust and accurate peer grading mechanism for MOOCs. 2017 IEEE 6th International Conference on Teaching, Assessment, and Learning for Engineering (TALE)497–502 Washington, DC: IEEE
[Google Scholar]
Ford LR. 1957. Solution of a ranking problem from binary comparisons. Am. Math. Mon. 64:28–33
[Google Scholar]
Fürnkranz J, Hüllermeier E 2010. Preference Learning: An Introduction New York: Springer
Glickman ME. 1999. Parameter estimation in large dynamic paired comparison experiments. Appl. Stat. 48:377–94
[Google Scholar]
Green P, Hastie T. 1999. Reversible jump MCMC. Genetics 155:1391–1403
[Google Scholar]
Gunawardana A, Shani G. 2015. Evaluating recommender systems. Recommender Systems Handbook F Ricci, L Rokach, B Shapira, PB Kantor265–308 New York: Springer
[Google Scholar]
Hu Y, Koren Y, Volinsky C. 2008. Collaborative filtering for implicit feedback datasets. Proceedings of the 2008 Eighth IEEE International Conference on Data Mining (ICDM ’08) Washington, DC: IEEE
[Google Scholar]
Hunter DR. 2004. MM algorithms for generalized Bradley-Terry models. Ann. Stat. 32:384–406
[Google Scholar]
Irurozki E, Calvo B, Lozano A. 2014. Sampling and learning the Mallows and generalized Mallows models under the Hamming distance Tech. Rep., Dep. Comput. Sci. Artif. Intell., Univ. Basque Ctry., Leioa, Spain
Irurozki E, Calvo B, Lozano JA. 2018. Sampling and learning Mallows and generalized Mallows models under the Cayley distance. Methodol. Comput. Appl. Probab. 20:1–35
[Google Scholar]
Jacques J, Biernacki C. 2014. Model-based clustering for multivariate partial ranking data. J. Stat. Plann. Inference 149:201–17
[Google Scholar]
Karatzoglou A, Hidasi B. 2017. Deep learning for recommender systems. Proceedings of the Eleventh ACM Conference on Recommender Systems396–97 New York: ACM
[Google Scholar]
Kemeny JG, Snell JL 1962. Mathematical Models in the Social Sciences Waltham, MA: Blaisdell
Koren Y, Bell R, Volinsky C. 2009. Matrix factorization techniques for recommender systems. Computer 42:30–37
[Google Scholar]
Lansdowne ZF, Woodward BS. 1996. Applying the Borda ranking method. Air Force J. Logist. 20:27–29
[Google Scholar]
Li L, Chu W, Langford J, Schapire RE. 2010. A contextual-bandit approach to personalized news article recommendation. Proceedings of the 19th International Conference on World Wide Web661–70 New York: ACM
[Google Scholar]
Lu T, Boutilier C. 2014. Effective sampling and learning for Mallows models with pairwise-preference data. J. Mach. Learn. Res. 15:3783–829
[Google Scholar]
Luce RD 1959. Individual Choice Behavior: A Theoretical Analysis New York: Wiley
Mallows CL. 1957. Non-null ranking models. I. Biometrika 44:114–30
[Google Scholar]
Marden JI. 1995. Analyzing and Modeling Rank Data Boca Raton, FL: CRC/Chapman & Hall
Meil M, Chen H. 2010. Dirichlet Process Mixtures of Generalized Mallows Models. Proceedings of the Twenty-Sixth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-10) Corvallis, OR: AUAI
[Google Scholar]
Mollica C, Tardella L 2014. Epitope profiling via mixture modeling for ranked data. Stat. Med. 33:213738–58
[Google Scholar]
Mollica C, Tardella L 2017. Bayesian Plackett–Luce mixture models for partially ranked data. Psychometrika 82:2442–58
[Google Scholar]
Mukherjee S. 2016. Estimation in exponential families on permutations. Ann. Stat. 44:853–75
[Google Scholar]
Plackett RL. 1975. The analysis of permutations. J. R. Stat. Soc. C 24:193–202
[Google Scholar]
Raman K, Joachims T. 2015. Bayesian ordinal peer grading. Proceedings of the Second (2015) ACM Conference on Learning @ Scale149–56 New York: ACM
[Google Scholar]
Rao P, Kupper LL. 1967. Ties in paired-comparison experiments: a generalization of the Bradley–Terry model. J. Am. Stat. Assoc. 62:194–204
[Google Scholar]
Ricci F, Rokach L, Shapira B. 2015. Recommender systems: introduction and challenges. Recommender Systems Handbook F Ricci, L Rokach, B Shapira, PB Kantor1–34 New York: Springer
[Google Scholar]
Salakhutdinov R, Mnih A. 2008. Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. Proceedings of the 25th International Conference on Machine Learning (ICML ’08) New York: ACM
[Google Scholar]
Tanner M, Wong W. 1987. The calculation of posterior distributions by data augmentation (with discussion). J. Am. Stat. Assoc. 82:528–50
[Google Scholar]
Thurstone LL. 1927. A law of comparative judgment. Psychol. Rev. 34:273
[Google Scholar]
Turner H, van Etten J, Firth D, Kosmidis I. 2018. Introduction to PlackettLuce. https://cran.rstudio.com/web/packages/PlackettLuce/vignettes/Overview.html
[Google Scholar]
Vitelli V, Sørensen Ø, Crispino M, Frigessi A, Arjas E. 2018. Probabilistic preference learning with the Mallows rank model. J. Mach. Learn. Res. 18:1–49
[Google Scholar]
Wu R, Xu J, Srikant R, Massoulié L, Lelarge M, Hajek B. 2015. Clustering and inference from pairwise comparisons. Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS ’15) New York: ACM
[Google Scholar]
Yan T. 2016. Ranking in the generalized Bradley–Terry models when the strong connection condition fails. Commun. Stat. Theory Methods 45:340–53
[Google Scholar]
Zermelo E. 1929. Die Berechnung der Turnier-Ergebnisse als ein Maximumproblem der Wahrscheinlichkeitsrechnung. Math. Z. 29:436–60
[Google Scholar]

/content/journals/10.1146/annurev-statistics-031017-100213

Model-Based Learning from Preference Data

Annual Review of Statistics and Its Application 6, 329 (2019); https://doi.org/10.1146/annurev-statistics-031017-100213

/content/journals/10.1146/annurev-statistics-031017-100213

Data & Media loading...

Article Type: Review Article

Most Cited Most Cited RSS feed

- Probabilistic Forecasting
  
  Tilmann Gneiting, and Matthias Katzfuss
  
  Vol. 1 (2014), pp. 125–151
- Functional Data Analysis
  
  Jane-Ling Wang, Jeng-Min Chiou, and Hans-Georg Müller
  
  Vol. 3 (2016), pp. 257–295
- Bayesian Computing with INLA: A Review
  
  Håvard Rue, Andrea Riebler, Sigrunn H. Sørbye, Janine B. Illian, Daniel P. Simpson, and Finn K. Lindgren
  
  Vol. 4 (2017), pp. 395–421
- Functional Regression
  
  Jeffrey S. Morris
  
  Vol. 2 (2015), pp. 321–359
- Topological Data Analysis
  
  Larry Wasserman
  
  Vol. 5 (2018), pp. 501–532
- Algorithmic Fairness: Choices, Assumptions, and Definitions
  
  Shira Mitchell, Eric Potash, Solon Barocas, Alexander D'Amour, and Kristian Lum
  
  Vol. 8 (2021), pp. 141–163
- Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
  
  Hongzhe Li
  
  Vol. 2 (2015), pp. 73–94
- Learning Deep Generative Models
  
  Ruslan Salakhutdinov
  
  Vol. 2 (2015), pp. 361–385
- On p-Values and Bayes Factors
  
  Leonhard Held, and Manuela Ott
  
  Vol. 5 (2018), pp. 393–419
- High-Dimensional Statistics with a View Toward Applications in Biology
  
  Peter Bühlmann, Markus Kalisch, and Lukas Meier
  
  Vol. 1 (2014), pp. 255–278
More Less

Annual Review of Statistics and Its Application

Volume 6, 2019

Review Article

Free

Model-Based Learning from Preference Data

Abstract

Most Read This Month

Most Cited Most Cited RSS feed