TY - JOUR
T1 - Cross-evaluation of metrics to estimate the significance of creative works
AU - Wasserman, Max
AU - Zeng, Xiao Han T.
AU - Amaral, Luís A.Nunes
PY - 2015/2/3
Y1 - 2015/2/3
N2 - In a world overflowing with creative works, it is useful to be able to filter out the unimportant works so that the significant ones can be identified and thereby absorbed. An automated method could provide an objective approach for evaluating the significance of works on a universal scale. However, there have been few attempts at creating such a measure, and there are few "ground truths" for validating the effectiveness of potential metrics for significance. For movies, the US Library of Congress's National Film Registry (NFR) contains American films that are "culturally, historically, or aesthetically significant " as chosen through a careful evaluation and deliberation process. By analyzing a network of citations between 15,425 United States-produced films procured from the Internet Movie Database (IMDb), we obtain several automated metrics for significance. The best of these metrics is able to indicate a film's presence in the NFR at least as well or better than metrics based on aggregated expert opinions or large population surveys. Importantly, automated metrics can easily be applied to older films for which no other rating may be available. Our results may have implications for the evaluation of other creative works such as scientific research.
AB - In a world overflowing with creative works, it is useful to be able to filter out the unimportant works so that the significant ones can be identified and thereby absorbed. An automated method could provide an objective approach for evaluating the significance of works on a universal scale. However, there have been few attempts at creating such a measure, and there are few "ground truths" for validating the effectiveness of potential metrics for significance. For movies, the US Library of Congress's National Film Registry (NFR) contains American films that are "culturally, historically, or aesthetically significant " as chosen through a careful evaluation and deliberation process. By analyzing a network of citations between 15,425 United States-produced films procured from the Internet Movie Database (IMDb), we obtain several automated metrics for significance. The best of these metrics is able to indicate a film's presence in the NFR at least as well or better than metrics based on aggregated expert opinions or large population surveys. Importantly, automated metrics can easily be applied to older films for which no other rating may be available. Our results may have implications for the evaluation of other creative works such as scientific research.
KW - Citations
KW - Complex networks
KW - Data science
KW - Films
KW - IMDb
UR - http://www.scopus.com/inward/record.url?scp=84922247451&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84922247451&partnerID=8YFLogxK
U2 - 10.1073/pnas.1412198112
DO - 10.1073/pnas.1412198112
M3 - Article
C2 - 25605881
AN - SCOPUS:84922247451
SN - 0027-8424
VL - 112
SP - 1281
EP - 1286
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 5
ER -