TY - JOUR
T1 - Use of glide-ins in CMS for production and analysis
AU - Bradley, D.
AU - Gutsche, O.
AU - Hahn, K.
AU - Holzman, B.
AU - Padhi, S.
AU - Pi, H.
AU - Spiga, D.
AU - Sfiligoi, I.
AU - Vaandering, E.
AU - Würthwein, F.
PY - 2010
Y1 - 2010
N2 - With the evolution of various grid federations, the Condor glide-ins represent a key feature in providing a homogeneous pool of resources using late-binding technology. The CMS collaboration uses the glide-in based Workload Management System, glideinWMS, for production (ProdAgent) and distributed analysis (CRAB) of the data. The Condor glide-in daemons traverse to the worker nodes, submitted via Condor-G. Once activated, they preserve the Master-Worker relationships, with the worker first validating the execution environment on the worker node before pulling the jobs sequentially until the expiry of their lifetimes. The combination of late-binding and validation significantly reduces the overall failure rate visible to CMS physicists. We discuss the extensive use of the glideinWMS since the computing challenge, CCRC-08, in order to prepare for the forthcoming LHC data-taking period. The key features essential to the success of large-scale production and analysis on CMS resources across major grid federations, including EGEE, OSG and NorduGrid are outlined. Use of glide-ins via the CRAB server mechanism and ProdAgent, as well as first hand experience of using the next generation CREAM computing element within the CMS framework is discussed.
AB - With the evolution of various grid federations, the Condor glide-ins represent a key feature in providing a homogeneous pool of resources using late-binding technology. The CMS collaboration uses the glide-in based Workload Management System, glideinWMS, for production (ProdAgent) and distributed analysis (CRAB) of the data. The Condor glide-in daemons traverse to the worker nodes, submitted via Condor-G. Once activated, they preserve the Master-Worker relationships, with the worker first validating the execution environment on the worker node before pulling the jobs sequentially until the expiry of their lifetimes. The combination of late-binding and validation significantly reduces the overall failure rate visible to CMS physicists. We discuss the extensive use of the glideinWMS since the computing challenge, CCRC-08, in order to prepare for the forthcoming LHC data-taking period. The key features essential to the success of large-scale production and analysis on CMS resources across major grid federations, including EGEE, OSG and NorduGrid are outlined. Use of glide-ins via the CRAB server mechanism and ProdAgent, as well as first hand experience of using the next generation CREAM computing element within the CMS framework is discussed.
UR - http://www.scopus.com/inward/record.url?scp=77955346310&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77955346310&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/219/7/072013
DO - 10.1088/1742-6596/219/7/072013
M3 - Conference article
AN - SCOPUS:77955346310
SN - 1742-6588
VL - 219
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - 1 PART 7
M1 - 072013
T2 - 17th International Conference on Computing in High Energy and Nuclear Physics, CHEP 2009
Y2 - 21 March 2009 through 27 March 2009
ER -