TY - JOUR
T1 - A general pipeline for quality and statistical assessment of protein interaction data using R and Bioconductor
AU - Chiang, Tony
AU - Scholtens, Denise
PY - 2009/4/22
Y1 - 2009/4/22
N2 - The systematic mapping of protein interactions by bait-prey techniques, including affinity purification-mass spectrometry or the yeast two-hybrid system, contributes a unique and relevant perspective on the comprehensive picture of cellular machines. We describe here a protocol for statistical analysis of node-and-edge graph representations of these data using R and Bioconductor, recognizing that steps may be added or omitted depending on the data set at hand. The fundamental purpose of such analyses is feature estimation, defined here as the estimation of data-type-specific biological features, such as protein complex composition and the physical interaction integrity of known or estimated complexes. In preparation for feature estimation tasks, we outline a progression through three analytic components common to all bait-prey data types: Preliminary setup, exploratory analysis and quality assessment. The end result is a collection of descriptive and inferred characteristics of the data, ready for biological interpretation in a computationally tractable form.
AB - The systematic mapping of protein interactions by bait-prey techniques, including affinity purification-mass spectrometry or the yeast two-hybrid system, contributes a unique and relevant perspective on the comprehensive picture of cellular machines. We describe here a protocol for statistical analysis of node-and-edge graph representations of these data using R and Bioconductor, recognizing that steps may be added or omitted depending on the data set at hand. The fundamental purpose of such analyses is feature estimation, defined here as the estimation of data-type-specific biological features, such as protein complex composition and the physical interaction integrity of known or estimated complexes. In preparation for feature estimation tasks, we outline a progression through three analytic components common to all bait-prey data types: Preliminary setup, exploratory analysis and quality assessment. The end result is a collection of descriptive and inferred characteristics of the data, ready for biological interpretation in a computationally tractable form.
UR - http://www.scopus.com/inward/record.url?scp=64749102604&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=64749102604&partnerID=8YFLogxK
U2 - 10.1038/nprot.2009.26
DO - 10.1038/nprot.2009.26
M3 - Article
C2 - 19325550
AN - SCOPUS:64749102604
VL - 4
SP - 535
EP - 546
JO - Nature Protocols
JF - Nature Protocols
SN - 1754-2189
IS - 4
ER -