TY - JOUR
T1 - A Reconfigurable Neural Network ASIC for Detector Front-End Data Compression at the HL-LHC
AU - Guglielmo, Giuseppe Di
AU - Fahim, Farah
AU - Herwig, Christian
AU - Valentin, Manuel Blanco
AU - Duarte, Javier
AU - Gingu, Cristian
AU - Harris, Philip
AU - Hirschauer, James
AU - Kwok, Martin
AU - Loncar, Vladimir
AU - Luo, Yingyi
AU - Miranda, Llovizna
AU - Ngadiuba, Jennifer
AU - Noonan, Daniel
AU - Ogrenci-Memik, Seda
AU - Pierini, Maurizio
AU - Summers, Sioni
AU - Tran, Nhan
N1 - Funding Information:
ACKNOWLEDGMENT The authors would like to acknowledge CAD support from Sandeep Garg and Anoop Saha from Mentor Graphics for Catapult high-level synthesis (HLS) and Bruce Cauble and Brent Carlson from Cadence for Innovus and Incisive. They also like to thank the Fermilab application-specific integrated circuit (ASIC) group for incorporating the autoencoder block into the ECON-T ASIC; CMS high-granularity endcap calorimeter (HGCAL) and Jean-Baptiste Sauvan for providing simulated module images for training; and Andre Davide for extensive input on network optimization. They acknowledge the Fast Machine Learning Collective as an open community of multidomain experts and collaborators. This community was important for the development of this project.
Funding Information:
Manuscript received October 31, 2020; revised February 12, 2021 and April 6, 2021; accepted May 23, 2021. Date of publication June 7, 2021; date of current version August 16, 2021. The work of Farah Fahim, Christian Herwig, Cristian Gingu, James Hirschauer, Llovizna Miranda, and Nhan Tran was supported by the Fermi Research Alliance, LLC through the U.S. Department of Energy (DOE), Office of Science, Office of High Energy I. INTRODUCTION Physics under Contract DE-AC02-07CH11359. The work of Javier Duarte B REAKTHROUGHS in the precision and speed of sensing EarlyCareer ResearchProgramunderAwardDE-SC0021187.TheworkofwassupportedbytheDOE,OfficeofScience,OfficeofHighEnergyPhysics instrumentation are impactful on advances in scientific Philip Harris was supported by the Massachusetts Institute of Technology methodologies and theories. Thus, a common paradigm across UniversityGrant.TheworkofVladimirLoncar,MaurizioPierini,andSioni many scientific disciplines in physics has been to increase theEuropeanUnion’sHorizon2020ResearchandInnovationProgramunderSummers wassupportedby theEuropeanResearchCouncil(ERC) through the resolution of the sensing equipment in order to increase Grant 772369. either the robustness or the sensitivity of the experiment itself. Giuseppe Di Guglielmo is with the Computer This demand for increasingly higher sensitivity in experiments, FarahFahimandNhanTranarewith theFermi National AcceleratorColumbiaUniversity,NewYork,NY10027USA. along with advances in the design of state-of-the-art sensing Laboratory, Batavia, IL 60510 USA, and also with the Electrical and Computer systems, has resulted in rapidly growing big data pipelines EngineeringDepartment,NorthwesternUniversity,Evanston,IL60208USA such that transmission of acquired data to the processing unit ChristianHerwig,CristianGingu,JamesHirschauer,andLloviznaMiranda(e-mail:[email protected]). via conventional methods is no longer feasible. Data trans- are with the Fermi National Accelerator Laboratory, Batavia, IL 60510 USA. mission is commonly much less efficient than data process-ManuelBlancoValentin,YingyiLuo,andSedaOgrenci-Memikarewiththe ing. Therefore, placing data compression and processing as Evanston,IL60208USA.ElectricalandComputerEngineering Department, Northwestern University, close as possible to data creation while maintaining physics Javier Duarte is with the performance is a crucial task in modern physics experiments. CA92093USA. At the CERN Large Hadron Collider (LHC) and its high MA02139USA.PhilipHarrisiswiththeMassachusettsInstituteofTechnology,Cambridge, luminosity upgrade (HL-LHC), extreme collision rates present Martin Kwok is with the extreme challenges for data processing and transmission at Providence,RI02912USA. multiple stages in detector readout and trigger systems. As the theInstituteofPhysicsBelgrade,11080Belgrade,Serbia.Vladimir LoncariswithCERN,1211Geneva, Switzerland, initial stage in the data chain, the on-detector (front-end) Jennifer Ngadiuba is with the California Institute of Technology, Pasadena, electronics that readout detector sensors must operate with CA91125USA. low latency and low-power (LP) dissipation in a high-radiation FL32901USA.Daniel Noonan environment, necessitating the use of application-specific Maurizio Pierini integrated circuits (ASICs). In order to mitigate the ini-Switzerland. tial bottleneck of moving data from front-end ASICs to https://doi.org/10.1109/TNS.2021.3087100.Colorversionsof oneormore figures off-detector (back-end) systems based on field-programmable Digital Object Identifier 10.1109/TNS.2021.3087100 gate arrays (FPGAs), front-end ASICs must provide edge com- 0018-9499 © 2021 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See https://www.ieee.org/publications/rights/index.html for more information.
Publisher Copyright:
© 1963-2012 IEEE.
PY - 2021/8
Y1 - 2021/8
N2 - Despite advances in the programmable logic capabilities of modern trigger systems, a significant bottleneck remains in the amount of data to be transported from the detector to off-detector logic where trigger decisions are made. We demonstrate that a neural network (NN) autoencoder model can be implemented in a radiation-tolerant application-specific integrated circuit (ASIC) to perform lossy data compression alleviating the data transmission problem while preserving critical information of the detector energy profile. For our application, we consider the high-granularity calorimeter from the Compact Muon Solenoid (CMS) experiment at the CERN Large Hadron Collider. The advantage of the machine learning approach is in the flexibility and configurability of the algorithm. By changing the NN weights, a unique data compression algorithm can be deployed for each sensor in different detector regions and changing detector or collider conditions. To meet area, performance, and power constraints, we perform quantization-aware training to create an optimized NN hardware implementation. The design is achieved through the use of high-level synthesis tools and the hls4ml framework and was processed through synthesis and physical layout flows based on a low-power (LP)-CMOS 65-nm technology node. The flow anticipates 200 Mrad of ionizing radiation to select gates and reports a total area of 3.6 mm2 and consumes 95 mW of power. The simulated energy consumption per inference is 2.4 nJ. This is the first radiation-tolerant on-detector ASIC implementation of an NN that has been designed for particle physics applications.
AB - Despite advances in the programmable logic capabilities of modern trigger systems, a significant bottleneck remains in the amount of data to be transported from the detector to off-detector logic where trigger decisions are made. We demonstrate that a neural network (NN) autoencoder model can be implemented in a radiation-tolerant application-specific integrated circuit (ASIC) to perform lossy data compression alleviating the data transmission problem while preserving critical information of the detector energy profile. For our application, we consider the high-granularity calorimeter from the Compact Muon Solenoid (CMS) experiment at the CERN Large Hadron Collider. The advantage of the machine learning approach is in the flexibility and configurability of the algorithm. By changing the NN weights, a unique data compression algorithm can be deployed for each sensor in different detector regions and changing detector or collider conditions. To meet area, performance, and power constraints, we perform quantization-aware training to create an optimized NN hardware implementation. The design is achieved through the use of high-level synthesis tools and the hls4ml framework and was processed through synthesis and physical layout flows based on a low-power (LP)-CMOS 65-nm technology node. The flow anticipates 200 Mrad of ionizing radiation to select gates and reports a total area of 3.6 mm2 and consumes 95 mW of power. The simulated energy consumption per inference is 2.4 nJ. This is the first radiation-tolerant on-detector ASIC implementation of an NN that has been designed for particle physics applications.
KW - Application-specific integrated circuit (ASIC)
KW - Large Hadron Collider (LHC)
KW - artificial intelligence (AI)
KW - autoencoder
KW - hardware accelerator
KW - high-level synthesis (HLS)
KW - machine learning (ML)
KW - single-event effect (SEE) mitigation
UR - http://www.scopus.com/inward/record.url?scp=85111010012&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85111010012&partnerID=8YFLogxK
U2 - 10.1109/TNS.2021.3087100
DO - 10.1109/TNS.2021.3087100
M3 - Article
AN - SCOPUS:85111010012
SN - 0018-9499
VL - 68
SP - 2179
EP - 2186
JO - IEEE Transactions on Nuclear Science
JF - IEEE Transactions on Nuclear Science
IS - 8
M1 - 9447722
ER -