Measuring the importance of user-generated content to search engines

Nicholas Vincent, Isaac Johnson, Patrick Sheehan, Brent Hecht

Research output: Contribution to conferencePaper

1 Scopus citations

Abstract

Search engines are some of the most popular and profitable intelligent technologies in existence. Recent research, however, has suggested that search engines may be surprisingly dependent on user-created content like Wikipedia articles to address user information needs. In this paper, we perform a rigorous audit of the extent to which Google leverages Wikipedia and other user-generated content to respond to queries. Analyzing results for six types of important queries (e.g. most popular, trending, expensive advertising), we observe that Wikipedia appears in over 80% of results pages for some query types and is by far the most prevalent individual content source across all query types. More generally, our results provide empirical information to inform a nascent but rapidly-growing debate surrounding a highly-consequential question: Do users provide enough value to intelligent technologies that they should receive more of the economic benefits from intelligent technologies?

Original languageEnglish (US)
Pages505-516
Number of pages12
StatePublished - Jan 1 2019
Event13th International Conference on Web and Social Media, ICWSM 2019 - Munich, Germany
Duration: Jun 11 2019Jun 14 2019

Conference

Conference13th International Conference on Web and Social Media, ICWSM 2019
CountryGermany
CityMunich
Period6/11/196/14/19

    Fingerprint

ASJC Scopus subject areas

  • Computer Networks and Communications

Cite this

Vincent, N., Johnson, I., Sheehan, P., & Hecht, B. (2019). Measuring the importance of user-generated content to search engines. 505-516. Paper presented at 13th International Conference on Web and Social Media, ICWSM 2019, Munich, Germany.