Scalable Data Analysis Applications for High Energy Physics

PIs: Douglas Thain and Kevin Lannon

Scalable Data Analysis Applications for High Energy Physics image

For more than 10 years, we have collaborated with Prof. Kevin Lannon and the CMS physics group at Notre Dame to design and build large scale data analysis applications that interpret data produced by the Compact Muon Solenoid detector at CERN. These applications are both interesting and challenging from a computer science perspective, because they must consume large quantities of data (Terabytes to Petabytes), scale up to thousands of nodes in clusters, and yet also remain reliable and responsive to the end user. Our latest work makes use of the TaskVine framework along with software such as Dask and Coffea as the foundation to create a variety of custom applications, including Lobster, TopEFT, DV4, RS-Triphoton, and more. We continue to innovate at the interface between computer science and physical science.

Related Publications

  1. Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine
    Barry Sly-Delgado, Ben Tovar, Jin Zhou, and Douglas Thain
    In ACM/IEEE Supercomputing, 2024
  2. Shepherd: Seamless Integration of Service Workflows into Task-Based Workflows through Log Monitoring
    Saiful Islam and Douglas Thain
    In Workshop on Workflows at ACM Supercomputing, 2024
  3. Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics
    Ben Tovar, Ben Lyons, Kelci Mohrman, Barry Sly-Delgado, Kevin Lannon, and Douglas Thain
    In IEEE International Parallel and Distributed Processing Symposium, 2022
    doi: 10.1109/IPDPS53621.2022.00041
  4. Analysis Cyberinfrastructure: Challenges and Opportunities
    Kevin Lannon, Paul Brenner, Michael Hildreth, Kenya Hurtado Anampa, Alan Malta, Rodrigues, Kelci Mohrman, Douglas Thain, and Ben Tovar
    In Snowmass, 2022
  5. Scaling Data Intensive Physics Applications to 10k Cores on Non-Dedicated Clusters with Lobster
    Anna Woodard, Matthias Wolf, Charles Mueller, Nil Valls, Ben Tovar, Patrick Donnelly, Peter Ivie, Kenyi Hurtado Anampa, Paul Brenner, Douglas Thain, Kevin Lannon, and Michael Hildreth
    In IEEE Conference on Cluster Computing, 2015