SRP-HPC Tech Projects in the DOE Supercomputing Facilities

Do you want to learn more about what it takes to build, deploy, operate, maintain, and utilize some of the largest supercomputers in the world? You can get that opportunity by collaborating with staff of the three US Department of Energy (DOE) Office of Science User Facilities:

•          Argonne Leadership Computing Facility (ALCF)

•          National Energy Research Scientific Computing Center (NERSC)

•          Oak Ridge Leadership Computing Facility (OLCF)

 Supported by DOE’s Advanced Scientific Computing Research (ASCR) program, these facilities offer researchers computing and data analysis resources many times more powerful than they can generally access elsewhere. Every year, researchers use these resources to achieve breakthroughs ranging from understanding the smallest building blocks of matter to how supernovae spread elements across the universe. 

Working at one of the ASCR Facilities affords you with numerous opportunities to align your research or technical interests to activities within the Facility.  Whether you want to work on the hardware, software, or contribute to the development of scientific applications, you can do it at a Facility.  Here are just a few opportunities for engagement within the Facilities.   

•          Help port and test scientific applications, libraries, software, and tools onto next-generation systems like Perlmutter, Aurora, and Frontier

•          Help improve algorithms and application performance of large scientific applications

•          Contribute to the development of continuous integration (CI) pipelines for large and distributed scientific applications

•          Contribute to the development of end-to-end scientific workflow technologies

•          Explore emerging large-scale data science and artificial intelligence (AI) methods for scientific applications

•          Identify gaps in the system software and hardware stack and develop, harden, and deploy solutions for file and storage systems, system architecture, non-volatile memory, data management, high-performance networking, intelligent facility operations, data analytics, and more

•          Build and package scientific software using Spack

•          Develop tools for the next-generation systems to optimize the user experience on these cutting-edge resources

•          Contribute to the development of tools for the administration of data management platforms to extract and analyze telemetry, event logs, and system state information to ensure the security of these national resources