Innovim Career logo

Principal System Administrator - NASA (REMOTE)

Innovim Career
Full-time
Remote

Innovim is seeking a Principal System Administrator to support all phases of system development and operations support on our EED (EOSDIS Evolution and Development) team. These activities include technical analysis and design as well as support of development and operations staff in testing and deploying the system to operational environment. Participation also involves the transition of software from development to production by performing deployment activities within the project life-cycle.

This position may work remotely. Relocation is NOT available. Quarterly travel for a few days may be required for quarterly planning.

Duties & Responsibilities:

Systems Administrators are responsible for effective provisioning, installation, configuration, operation, and maintenance of system COTS, application software and related infrastructure.  Specific duties and responsibilities include:

  • Provisioning physical and virtual servers and application services, peripherals, settings, storage, etc. in accordance with NASA standards and project/operational requirements.
  • Developing and maintaining procedures for installation, configuration, and daily operations of hardware and software.
  • Performing daily system monitoring, verifying the integrity and availability of all hardware, server resources, systems, and key processes, updating monitoring as needed.
  • Repairing and recovering from hardware or software failures. Coordinate and communicate with Operations and impacted end user communities.
  • Applying OS, application, and hardware patches and upgrades on a routine basis, automating where possible.
  • Implementing R&D deployments of new application services, and improvements upon existing services.
  • Performing ongoing performance, system and application tuning, hardware upgrades, and resource optimization as required. Configure CPU, memory, and disk partitions as required.
  • Monitoring virtual computing environment, servers, virtual machines, data stores, networks, with ability to identify preventive measures based on trends.
  • Developing Puppet manifests or Ansible code to automate the configuration and deployments of configurations.
  • Communicating progress and obstacles daily via Scale Agile Framework scrum and Jira trouble-ticketing system.

Required Skills: 

  • Must have a Bachelor’ degree in a technical major, such as engineering or computer science, and at least 10 years of system administration experience.
  • At least 4 years of experience as a Linux System Administrator (Red Hat Enterprise Linux 8/9 or other Red Hat distros).
  • Experience with automation using scripting languages such as Shell, Bash, or Python.
  • Demonstratable skills in analyzing and correcting application and network anomalies.
  • Knowledge of VMware vSphere (version 7 or higher) virtualization platform, in provisioning VMs, virtual storage, and virtual networking resources with ability to automate many simple tasks.
  • Demonstrated ability and commitment to share knowledge via written articles and technical talks with immediate and peer teams.

Desired Qualifications: 

  • Certification as a Red Hat Certified Engineer (RHCE).
  • Experience with database such as Postgres, MySQL or Oracle.
  • Ability to troubleshoot basic network configuration and troubleshooting.
  • Understanding of high availability, data/system replication, and disaster recovery methodologies in the system and database spaces.
  • Knowledgeable of container technologies (Docker, Kubernetes) and how to orchestrate and manage container lifecycles.
  • Experience with automation frameworks such as Puppet, Chef, Ansible, Salt.
  • Experience supporting software development teams, tools, and processes.
  • Experience with Agile development methodologies and the Atlassian Tool Suite (JIRA, Confluence, Bamboo, Bitbucket).
  • Knowledgeable of web service platforms such as Apache, Nginx, Tomcat, Ruby on Rails, Clojure, and Java.
  • Knowledgeable of storage volume management, storage area networks, and file systems.

May have deep knowledge of project management. Advanced knowledge of related disciplines within work area and ability to identify links and potential impact on projects, programs, or systems.

 
Typically requires:

A University Degree or equivalent experience and minimum 10 years prior relevant experience, or An Advanced Degree in a related field and minimum 8 years of experience in Engineering/Other Technical Positions: Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and a minimum of 8 years of prior relevant experience unless prohibited by local laws/regulations.

INNOVIM is committed to providing superior work in the fields of science, engineering, data analytics and technology to government agencies. We offer competitive compensation packages, including comprehensive nationwide Medical/Dental/Vision insurance programs, life insurance, matching 401k contribution and Educational/Training support.