Description
This replication dataset includes code and data to replicate the paper "Communication networks do not predict success in attempts at peer production". The data included are of three types: 1. A zipped tar file of compressed XML files of edits made to wikis. This includes the full text of every revision made to the 1430 wikis that were part of our analysis as of early 2010 (different wikis were collected at different times). Note: Due to the Dataverse's file size limit, this file is in two parts - wiki_com_networks-wiki_dump.tar.xz.partaa and wiki_com_networks-wiki_dump.tar.xz.partab To combine them run: cat wiki_com_networks-wiki_dump.tar.xz.part* > wiki_com_networks-wiki_dump.tar.xz 2. A zipped tar file of the wikiq TSV files with metadata about each edit, created using the wikiq parser (https://code.communitydata.science/mediawiki_dump_tools.git). Those wishing to convert the XML files into TSV files can use the wikiq parser. 3. Summary CSV files with data about the communication network and activity levels for each wiki---in other words, the data used for the analyses in the paper. Code for converting the TSV files into these summary CSV files is included. A more detailed description of how to replicate the figures and analyses from the paper is given in the README file included with the code.
Date made available | 2023 |
---|---|
Publisher | Harvard Dataverse |