Portable Acceleration of Cms Computing Workflows With Coprocessors as a Service
No Thumbnail Available
Date
2024
Authors
Gürpınar Güler, Emine
Güler, Yalçın
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Nature
Open Access Color
HYBRID
Green Open Access
Yes
OpenAIRE Downloads
40
OpenAIRE Views
73
Publicly Funded
Yes
Abstract
Computing demands for large scientific experiments, such as the CMS experiment at the CERN LHC, will increase dramatically in the next decades. To complement the future performance increases of software running on central processing units (CPUs), explorations of coprocessor usage in data processing hold great potential and interest. Coprocessors are a class of computer processors that supplement CPUs, often improving the execution of certain functions due to architectural design choices. We explore the approach of Services for Optimized Network Inference on Coprocessors (SONIC) and study the deployment of this as-a-service approach in large-scale data processing. In the studies, we take a data processing workflow of the CMS experiment and run the main workflow on CPUs, while offloading several machine learning (ML) inference tasks onto either remote or local coprocessors, specifically graphics processing units (GPUs). With experiments performed at Google Cloud, the Purdue Tier-2 computing center, and combinations of the two, we demonstrate the acceleration of these ML algorithms individually on coprocessors and the corresponding throughput improvement for the entire workflow. This approach can be easily generalized to different types of coprocessors and deployed on local CPUs without decreasing the throughput performance. We emphasize that the SONIC approach enables high coprocessor usage and enables the portability to run workflows on different types of coprocessors. © The Author(s) 2024.
Description
Keywords
CMS, Machine learning, Offline and computing, ddc:004, FOS: Computer and information sciences, CERN Lab, Physics - Instrumentation and Detectors, [PHYS.HEXP] Physics [physics]/High Energy Physics - Experiment [hep-ex], cms, CMS; Machine learning; Offline and computing, FOS: Physical sciences, [INFO] Computer Science [cs], programming, High Energy Physics - Experiment, computer: network, Machine Learning, High Energy Physics - Experiment (hep-ex), Machine learning, [PHYS.HEXP]Physics [physics]/High Energy Physics - Experiment [hep-ex], multiprocessor: graphics, cloud, [INFO]Computer Science [cs], [PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det], computer, CMS; Machine learning; Offline and computing;, 000, PARTICLE PHYSICS;LARGE HADRON COLLIDER;CMS, CMS, graphics, Research, DATA processing & computer science, acceleration, Instrumentation and Detectors (physics.ins-det), LARGE HADRON COLLIDER, Offline and computing, offline and computing, 004, CERN LHC Coll, machine learning, Computer Science - Distributed, Parallel, and Cluster Computing, [PHYS.PHYS.PHYS-INS-DET] Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det], network, Grid computing, microprocessor, PARTICLE PHYSICS, data management, Distributed, Parallel, and Cluster Computing (cs.DC), info:eu-repo/classification/ddc/004, performance, LHC, High energy physics, Experimental particle physics
Turkish CoHE Thesis Center URL
Fields of Science
Citation
WoS Q
N/A
Scopus Q
Q2

OpenCitations Citation Count
N/A
Source
Computing and Software for Big Science
Volume
8
Issue
1
Start Page
End Page
PlumX Metrics
Captures
Mendeley Readers : 16
SCOPUS™ Citations
3
checked on Feb 04, 2026
Google Scholar™

OpenAlex FWCI
3.3474644
Sustainable Development Goals
4
QUALITY EDUCATION

7
AFFORDABLE AND CLEAN ENERGY

9
INDUSTRY, INNOVATION AND INFRASTRUCTURE

11
SUSTAINABLE CITIES AND COMMUNITIES

13
CLIMATE ACTION

15
LIFE ON LAND


