MCS Site Reliability Engineer in Aurora, Colorado, United States

Job Information

SciTec MCS Site Reliability Engineer in Aurora, Colorado

SciTec, a wholly owned subsidiary of Firefly Aerospace, is a dynamic non-traditional defense contractor that delivers advanced technologies in support of U.S. National Security and Defense. For the past forty-five plus years, we have supported Department of Defense customers by developing innovative remote sensing algorithms, tools, and techniques to deliver world-class data exploitation capabilities supporting missile defense; intelligence, surveillance, & reconnaissance; space domain awareness; and aircraft survivability missions.

Important Notice: SciTec exclusively works on U.S. government contracts that require U.S. citizenship for all employees. Applicants that do not meet this requirement will not be considered.

SciTec has an immediate opportunity for a talented engineer to support our programs delivering Next-Generation Missile Warning software. This is a unique opportunity to join a business delivering core capabilities for National defense. You will work within a fast-paced team delivering end-to-end software processing of Overhead Persistent InfraRed (OPIR) sensor data for Missile Warning, Missile Defense, Battlespace Awareness, and Technical Intelligence.

We are seeking an MCS Site Reliability Engineer (SRE) to support the reliability, performance, and operational excellence of mission-critical infrastructure services. This role sits within the Infrastructure as a Service (IaaS) team and focuses on availability, scalability, observability, and automation across compute, storage, networking, and platform services deployed at a customer site.

The ideal candidate is a strong systems engineer with an SRE mindset—someone who can troubleshoot complex infrastructure issues, improve system resilience, and reduce operational toil through automation.

Responsibilities

Support the availability, reliability, and performance of IaaS services supporting mission systems
Monitor infrastructure health using metrics, logs, and alerts; respond to and resolve incidents
Perform root-cause analysis for infrastructure and service outages; implement corrective and preventative actions
Improve system reliability through automation, standardization, and proactive engineering
Support capacity planning, performance analysis, and scaling of infrastructure services
Maintain and enhance monitoring, logging, and alerting solutions
Participate in incident response, on-call rotations (as required), and post-incident reviews
Collaborate with network, systems, platform, and application teams to resolve cross-stack issues
Support infrastructure lifecycle activities including upgrades, patches, and configuration changes
Apply security best practices and support compliance requirements in a regulated environment
Develop and maintain runbooks, procedures, and operational documentation
Contribute to CI/CD and Infrastructure-as-Code workflows supporting IaaS services
Participate in Agile ceremonies and operational planning activities
Perform other duties as assigned

Requirements

5+ years of professional experience in systems engineering, SRE, DevOps, or infrastructure operations
Strong experience administering Linux systems
Experience supporting on-prem, cloud, or hybrid infrastructure environments
Hands-on experience with monitoring, logging, and alerting systems
Strong troubleshooting skills across compute, storage, networking, and OS layers
Experience scripting or automating tasks using Bash, Python, or similar languages
Familiarity with Infrastructure as Code concepts and tooling
Strong verbal and written communication skills
Detail-oriented, self-motivated, and able to own issues through resolution
Ability to obtain and maintain a DoD security clearance
Ability to work on-site at the customer location

Candidates who have any of the following skills will be preferred:

Experience working on an IaaS or platform operations team
Experience with virtualization platforms (e.g., VMware vSphere)
Experience supporting container platforms (Kubernetes, OpenShift)
Experience with cloud environments (AWS, Azure, or GovCloud)
Familiarity with SRE concepts such as SLIs, SLOs, error budgets, and toil reduction
Experience with configuration management or automation tools (Ansible, Terraform)
Experience with CI/CD pipelines (GitLab CI, Jenkins, or similar)
Experience operating systems in government or secure environments
Experience with incident management and operational readiness reviews

*Resumes, Cover Letters, and Applications which are generated by AI will not be considered for employment.

Colorado Residents: In any materials you submit, you may redact or remove age-identifying information such as age, date of birth, or dates of school attendance or graduation. You will not be penalized for redacting or removing this information.

Benefits

SciTec offers a highly competitive salary and benefits package, including:

4% Safe Harbor 401(k) match
100% company paid HSA Medical insurance, with a choice of 2 buy-up options
80% company paid Dental insurance
100% company paid Vision insurance
100% company paid Life insurance
100% company paid Long-term Disability insurance
Short-term Disability insurance
Annual Profit-Sharing Plan
Discretionary Performance Bonus
Paid Parental Leave
Generous Paid Time Off, including Holiday, Vacation, and Sick Pay
Flexible work hours

The pay range for this position is $146,000 - $175,000 / year. SciTec considers several factors when extending an offer of employment, including but not limited to the role and associated responsibilities, a candidate's work experience, education/training, and key skills. This is not a guarantee of compensation.

SciTec is proud to be an Equal Opportunity employer. VET/Disabled.

Apply Now

OneMain Financial Jobs

Job Information

SciTec MCS Site Reliability Engineer in Aurora, Colorado

Current Search Criteria