We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Senior Linux HPC Storage Engineer

Oak Ridge National Laboratory
life insurance, parental leave, 401(k), retirement plan, relocation assistance
United States, Tennessee, Oak Ridge
1 Bethel Valley Road (Show on map)
Oct 15, 2025

Requisition Id15469

Overview:

We are hiring a Senior Linux HPC Storage Engineer to design, operate and maintain clusters, servers, and workstations storage supporting services where science happens at ORNL! This position resides in the Emerging Technologies & Computing team in the Research Computing group in the Information Technology Services Directorate at Oak Ridge National Laboratory (ORNL).

The Emerging Technology Computational Group facilitates ORNL goals through HPC systems engineering, integration, and support for the research community at ORNL. By providing design, deployment, optimization, monitoring, and tooling support across multiple clustered infrastructures, we facilitate Lab-wide R&D projects. Our HPC clusters range in scope from just a handful of nodes to over fifty-thousand cores.

We partner with ORNL research organizations to enable research excellence and delivery. We work with other clustered computing and HPC groups to help research programs identify the best solutions for their needs. When we build our customer's environments, our team collaborates to design, implement, and maintain the systems from inception to retirement.

Major Duties/Responsibilities:

  • Architect, deploy, and manage large-scale HPC storage systems, including parallel file systems such as Lustre, GPFS/Spectrum Scale, BeeGFS and WEKA.
  • Design, implement, and operate large-scale Ceph storage clusters for HPC and research workloads, delivering reliable, high-performance object, block, and file storage services.
  • Ensure the availability, performance, scalability, and security of production storage environments.
  • Administer and optimize enterprise storage platforms such as Qumulo and NetApp in support of HPC and research workloads.
  • Design, deploy, and maintain archival storage solutions including Spectra Logic BlackPearl and large-scale tape libraries to ensure long-term data preservation and accessibility.
  • Integrate high-performance, enterprise, and archival storage layers into cohesive tiered storage architectures that balance cost, scalability, and performance for diverse scientific workflows.
  • Leverage automation and monitoring solutions to minimize day-to-day maintenance while identifying opportunities to optimize system performance and management.
  • Collaborate with researchers and technical POCs to support large data workflows and optimize I/O performance for scientific workloads.
  • Automate storage provisioning, monitoring, and maintenance using scripting and configuration management tools.
  • Diagnose and resolve complex storage and I/O-related issues in high-throughput, low-latency HPC environments.
  • Evaluate emerging storage technologies (NVMe, object storage, hierarchical storage management, burst buffers) and contribute to strategic planning for future HPC systems.
  • Work with 24/7 operations staff to streamline monitoring and troubleshooting, significantly reducing the need for off-hours support.
  • Deliver ORNL's mission by aligning behaviors, priorities, and interactions with our core values of Impact, Integrity, Teamwork, Safety, and Service. Promote equal opportunity by fostering a respectful workplace - in how we treat one another, work together, and measure success.

Basic Qualifications:

  • A BS degree in computer science, computer engineering, information technology, information systems, science, engineering, business, or a related discipline and a minimum of eight (8) to twelve (12) years of aligned professional experience is required for consideration. An overall combination of equivalent education and experience may be considered.
    • Masters and PhD degree holders in the same fields of study are also encouraged to apply:
      • Masters' holders should have a minimum of seven (7) to ten (10) years of relevant and aligned experience.
      • PhD holders should have a minimum of four (4) to six (6) years of relevant and aligned experience.
  • Five (5) or more years managing UNIX/Linux systems.
  • Demonstrated experience managing HPC storage and large-scale enterprise storage systems.
  • Three (3) or more years working with configuration management and automation tools such as Git, Jenkins, Ansible, or Puppet.
  • Proficiency with at least one scripting language (Bash, Python, Perl, etc.).
  • Strong Linux administration and advanced troubleshooting experience.
  • Experience supporting large data systems and/or HPC scientific workloads.
  • Strong desire to innovate and evaluate new technologies for HPC and storage environments.
  • Collaborative approach and ability to become a trusted advisor to research teams.

Preferred Qualifications:

  • Active DOE Q, DoD Top Secret, or TS/SCI clearance is strongly preferred.
  • Solid understanding of multiple operating systems and HPC cluster technologies.
  • Experience with Rocky/CentOS/RHEL, Ubuntu, VMware.
  • Understanding of HPC job schedulers (SLURM) and user support workflows.
  • Experience with container technologies in HPC environments.
  • Experience with multiple system deployment mechanisms (Warewulf, PXEboot, Cobbler, Bright).
  • Experience with GPU clusters (NVIDIA, AMD) for AI/ML and scientific workloads.
  • Deep expertise with high-performance parallel file systems (Lustre, GPFS/Spectrum Scale, BeeGFS, WEKA).
  • Knowledge of storage networking (Infiniband, NVMe-oF, SAN/NAS architectures).
  • Familiarity with RAID, ZFS, and object storage technologies.
  • Strong background in performance monitoring, benchmarking, and I/O optimization.
  • Experience with monitoring systems such as Grafana, CheckMK, Nagios, Zabbix, Ganglia.
  • Previous experience working in a government, scientific, or other highly technical environment.
  • Strong documentation skills and ability to prepare web-based documentation.

Special Requirements:

  • Visa sponsorship is not available for this position.
  • This position requires the ability to obtain and maintain clearance from the Department of Energy. As such, this position is a Workplace Substance Abuse (WSAP) testing designated position. WSAP positions require passing a pre-placement drug test and participation in an ongoing random drug testing program.

Security, Credentialing, and Eligibility Requirements:
For employment at Oak Ridge National Laboratory (ORNL), a Real ID compliant form of identification will be required. Additionally, ORNL is subject to Department of Energy (DOE) access restrictions. All employees must also be able to obtain and maintain a federal Personal Identity Verification (PIV) card as mandated by Homeland Security Presidential Directive 12 (HSPD-12) and Department of Energy (DOE) Order 473.1A, which requires a favorable post-employment background investigation.


To obtain this credential, new employees must successfully complete and pass a Federal Tier 1 background check investigation. This investigation includes a declaration of illegal drug activities, including use, supply, possession, or manufacture within the last year. This includes marijuana and cannabis derivatives, which are still considered illegal under federal law, regardless of state laws.


For foreign national candidates:
If you have not resided in the U.S. for three consecutive years, you are not eligible for the PIV credential and instead will need to obtain a favorable Local Site Specific Only (LSSO) risk determination to maintain employment. Once you meet the three-year residency requirement, you will be required to obtain a PIV credential to maintain employment.

About ORNL:

As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an impressive 80-year legacy of addressing the nation's most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals! Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive scientific breakthroughs and translate them into solutions for energy, environmental, and security challenges facing the nation.

ORNL offers competitive pay and benefits programs to attract and retain individuals who demonstrate exceptional work behaviors. The laboratory provides a range of employee benefits, including medical and retirement plans and flexible work hours, to support the well-being of you and your family.

Employee amenities such as on-site fitness, banking, and cafeteria facilities are also available for added convenience.

Other benefits include the following: Prescription Drug Plan, Dental Plan, Vision Plan, 401(k) Retirement Plan, Contributory Pension Plan, Life Insurance, Disability Benefits, Generous Vacation and Holidays, Parental Leave, Legal Insurance with Identity Theft Protection, Employee Assistance Plan, Flexible Spending Accounts, Health Savings Accounts, Wellness Programs, Educational Assistance, Relocation Assistance, and Employee Discounts.

If you have difficulty using the online application system or need an accommodation to apply due to a disability, please email: ORNLRecruiting@ornl.gov.

#LI-CS1

This position will remain open for a minimum of 5 days after which it will close when a qualified candidate is identified and/or hired.

We accept Word (.doc, .docx), Adobe (unsecured .pdf), Rich Text Format (.rtf), and HTML (.htm, .html) up to 5MB in size. Resumes from third party vendors will not be accepted; these resumes will be deleted and the candidates submitted will not be considered for employment.

If you have trouble applying for a position, please email ORNLRecruiting@ornl.gov.

ORNL is an equal opportunity employer. All qualified applicants, including individuals with disabilities and protected veterans, are encouraged to apply. UT-Battelle is an E-Verify employer.

Applied = 0

(web-675dddd98f-rz56g)