TY - JOUR
T1 - CMS distributed computing workflow experience
AU - Adelman-Mccarthy, Jennifer
AU - Gutsche, Oliver
AU - Haas, Jeffrey D.
AU - Prosper, Harrison B.
AU - Dutta, Valentina
AU - Gomez-Ceballos, Guillelmo
AU - Hahn, Kristian
AU - Klute, Markus
AU - Mohapatra, Ajit
AU - Spinoso, Vincenzo
AU - Kcira, Dorian
AU - Caudron, Julien
AU - Liao, Junhui
AU - Pin, Arnaud
AU - Schul, Nicolas
AU - De Lentdecker, Gilles
AU - McCartin, Joseph
AU - Vanelderen, Lukas
AU - Janssen, Xavier
AU - Tsyganov, Andrey
AU - Barge, Derek
AU - Lahiff, Andrew
PY - 2011
Y1 - 2011
N2 - The vast majority of the CMS Computing capacity, which is organized in a tiered hierarchy, is located away from CERN. The 7 Tier-1 sites archive the LHC proton-proton collision data that is initially processed at CERN. These sites provide access to all recorded and simulated data for the Tier-2 sites, via wide-area network (WAN) transfers. All central data processing workflows are executed at the Tier-1 level, which contain re-reconstruction and skimming workflows of collision data as well as reprocessing of simulated data to adapt to changing detector conditions. This paper describes the operation of the CMS processing infrastructure at the Tier-1 level. The Tier-1 workflows are described in detail. The operational optimization of resource usage is described. In particular, the variation of different workflows during the data taking period of 2010, their efficiencies and latencies as well as their impact on the delivery of physics results is discussed and lessons are drawn from this experience. The simulation of proton-proton collisions for the CMS experiment is primarily carried out at the second tier of the CMS computing infrastructure. Half of the Tier-2 sites of CMS are reserved for central Monte Carlo (MC) production while the other half is available for user analysis. This paper summarizes the large throughput of the MC production operation during the data taking period of 2010 and discusses the latencies and efficiencies of the various types of MC production workflows. We present the operational procedures to optimize the usage of available resources and we the operational model of CMS for including opportunistic resources, such as the larger Tier-3 sites, into the central production operation.
AB - The vast majority of the CMS Computing capacity, which is organized in a tiered hierarchy, is located away from CERN. The 7 Tier-1 sites archive the LHC proton-proton collision data that is initially processed at CERN. These sites provide access to all recorded and simulated data for the Tier-2 sites, via wide-area network (WAN) transfers. All central data processing workflows are executed at the Tier-1 level, which contain re-reconstruction and skimming workflows of collision data as well as reprocessing of simulated data to adapt to changing detector conditions. This paper describes the operation of the CMS processing infrastructure at the Tier-1 level. The Tier-1 workflows are described in detail. The operational optimization of resource usage is described. In particular, the variation of different workflows during the data taking period of 2010, their efficiencies and latencies as well as their impact on the delivery of physics results is discussed and lessons are drawn from this experience. The simulation of proton-proton collisions for the CMS experiment is primarily carried out at the second tier of the CMS computing infrastructure. Half of the Tier-2 sites of CMS are reserved for central Monte Carlo (MC) production while the other half is available for user analysis. This paper summarizes the large throughput of the MC production operation during the data taking period of 2010 and discusses the latencies and efficiencies of the various types of MC production workflows. We present the operational procedures to optimize the usage of available resources and we the operational model of CMS for including opportunistic resources, such as the larger Tier-3 sites, into the central production operation.
UR - http://www.scopus.com/inward/record.url?scp=84858110599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84858110599&partnerID=8YFLogxK
U2 - 10.1088/1742-6596/331/7/072019
DO - 10.1088/1742-6596/331/7/072019
M3 - Conference article
AN - SCOPUS:84858110599
SN - 1742-6588
VL - 331
JO - Journal of Physics: Conference Series
JF - Journal of Physics: Conference Series
IS - PART 7
M1 - 72019
T2 - International Conference on Computing in High Energy and Nuclear Physics, CHEP 2010
Y2 - 18 October 2010 through 22 October 2010
ER -