Careers

High Performance Computing Engineer/Sr. Systems Administrator

  • The MITRE Corporation
  • 1 Olympic Plaza, Colorado Springs, CO 80909, USA
  • Nov 09, 2020
[Information Technology]

Job Description

Why choose between doing meaningful work and having a fulfilling life? At MITRE, you can have both. That's because MITRE people are committed to tackling our nation's toughest challenges-and we're committed to the long-term well-being of our employees. MITRE is different from most technology companies. We are a not-for-profit corporation chartered to work for the public interest, with no commercial conflicts to influence what we do. The R&D centers we operate for the government create lasting impact in fields as diverse as cybersecurity, healthcare, aviation, defense, and enterprise transformation. We're making a difference every day-working for a safer, healthier, and more secure nation and world. Our workplace reflects our values. We offer competitive benefits, exceptional professional development opportunities, and a culture of innovation that embraces diversity, inclusion, flexibility, collaboration, and career growth. If this sounds like the choice you want to make, then choose MITRE-and make a difference with us.

MITRE'sEnterprise Technical Computing (ETC) division provides multiple compute services including High Performance Computing (HPC), DevOps Tools, Internal/External Cloud Services, and Protected Computing to MITRE research organizations.

The HPC group is looking for an experienced Linux systems administrator with HPC experience and programming skills, to join our team. The HPC team is responsible for purchasing, deploying, and maintaining HPC hardware and user tools for over 450 MITRE employees. The HPC team engages with projects across MITRE to build and support computing environment for Artificial Intelligence, Deep Learning, Machine Learning, Data Analysis, Modeling and Simulation, and more.

Responsibilities:

  • Maintain, monitor, and update the basic operating systems, user access, and the clusters' resource manager.
  • Both lead and collaborate on projects to maintain and enhance system functionality, in areas such as systems monitoring, scheduling and resource management, configuration management, and backups.
  • Contribute and work independently as well as part of the group to recognize and diagnose problems and then develop and implement solutions.
  • Leverage tools and develop scripts to implement task automation on the HPC systems.
  • Participate in team-oriented agile development and management process for HPC systems.
  • Provide technical support to the MITRE HPC user community via the corporate ticketing system.
  • Install software, including programming languages, software, and device drivers.
  • Conduct and document performance and evaluation tests or experiments on the HPC clusters.
  • Develop and maintain documentation on the clusters and related processes.
  • Maintain the systems with NIST 800-171 security requirements.

Minimum Qualifications:

  • Bachelor's Degree in Computer Science or Computer Engineering or similar field.
  • Linux systems administration experience (3+ years)
  • Software development experience in at least one of the following languages: C, C++, Perl, Python, Java, shell scripts
  • Hands on experience with basic computer networking and routing including troubleshooting basic internet protocols (DNS, DHCP, BOOTP, SMTP, S/FTP, HTTP/S, NFS, CIFS/SMB, PKI, TCP, UDP)
  • Applicants selected for this position will be subject to a government security investigation and must meet eligibility requirements for access to classified information. Only US citizens are eligible for a security clearance. For this position, MITRE will consider only applicants with security clearances or applicants who are eligible for security clearances.

Desired Skills:

  • Experience in systems administration on distribute computing systems.
  • Familiarity with HPC-specific resources and technologies, such as GPUs, FPGAs, MPI, Infiniband, OmniPath.
  • Experience with resource managers and schedulers for HPC clusters such as Slurm, Moab, TORQUE, or PBS.
  • Experience with Docker, Singularity, or another container technology.
  • Experience with logging and monitoring tools such as Splunk.
  • Experience with configuration management systems such as Chef or Puppet.
  • Candidates holding current / active US Government security clearance(s) are preferred.

MITRE is proud to be an equal opportunity employer. MITRE recruits, employs, trains, compensates, and promotes regardless of race, religion, color, national origin, gender, gender expression, sexual identity, disability, age, veteran status, and other protected status.

MITRE intends to maintain a website that is fully accessible to all individuals. If you are unable to search or apply for jobs and would like to request a reasonable accommodation for any part of MITRE's employment process, please contact MITRE's Recruiting Help Line at 703-###-#### or email at ...@mitre.org.

Copyright 1997-2020, The MITRE Corporation. All rights reserved. MITRE is a registered trademark of The MITRE Corporation. Material on this site may be copied and distributed with permission only.


Associated topics: chief program officer, cpo, manage, manager, management, monitor, product manager, project manager, relationship manager, task