Header

Batch and Cloud Computing System Administrator

SLAC National Accelerator Laboratory is home to a two-mile linear accelerator—the longest in the world. Originally a particle physics research center, SLAC is now a multipurpose laboratory for astrophysics, photon science, accelerator and particle physics research.
The computational science program at SLAC includes data processing and analysis of experimental data from astrophysics, photon science, accelerator and particle physics programs and advanced simulations in accelerator modeling, material science, catalysis and cosmology.

The Computing Division at SLAC is dedicated to providing leadership and support in computing and communications to the laboratory as a whole, and to scientific research in particular.  We’re looking for a System Administrator with experience in administering batch computing farms in a grid/cloud environment. Reporting to the Technical Lead for Scientific Computing Services, the successful candidate will deploy and operate batch computing services and associated services for grid/cloud access.
Specific responsibilities (including but not limited to):
  • Responsible for operating batch compute farms at SLAC in a secure and efficient way.   
  • Provide queue configuration
  • Responsible for the life cycle process of procuring, installing and retiring batch farms
  • Provide grid/cloud services and assist users with transitions.
  • Participate in Open Science Grid technical forums.
  • Work with peers within the Computing Division to ensure that core infrastructure services such as networking and data center requirements are delivered.
  • Work closely with the SLAC scientific staff
  • Contribute to the development of technical roadmaps for Cloud Computing and batch farms
  • Contribute metrics to a reporting system and dashboard for Computing Division KPIs (Key Performance Indicators)
  
Qualifications:• Bachelor’s degree in computer science, information technology or closely related fields or a combination of training and extensive experience.
• Must have 2+ years experience in high performance computing environments
• Must have demonstrated knowledge of and experience in administering one or more batch queuing systems such as PBS Pro, Condor, Grid Engine or LSF
• Demonstrated knowledge and experience in grid and/or cloud computing
• Proven ability to successfully manage key relationships with internal customers
• Demonstrated knowledge and experience in at least one variant of the UNIX operating systems
• Demonstrated knowledge and experience in one or more scripting language such as python, perl and/or Ruby
• Must be capable of managing multiple priorities simultaneously under tight timeframes when necessary
• Effective presentation, written and verbal communication skills
• Demonstrated willingness to learn advanced technology for research computing and commitment for continuous improvement
Desired Skills:• Advanced knowledge of multicore and/or GPU architectures
• Demonstrated technical knowledge in the use of virtualization in an HPC environment
• Experience working with science and technical researchers in a consensus driven environment.
• Technical knowledge of large-scale file systems is highly desirable.

Please Note:  The SLAC National Accelerator Laboratory values diversity and is an affirmative action, equal opportunity employer. SLAC confirms employment authorization for all new hires through the E-Verify Program.

Final candidates, with the exception of Staff Scientists, are subject to background checks prior to commencement of employment at the SLAC National Accelerator Laboratory.


Batch and Cloud Computing System Administrator