Reliability Engineer
Systems Engineering Solutions Corporation | |
life insurance, paid time off, 401(k)
| |
| |
Mar 31, 2026 | |
|
This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Reliability Engineer. The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of missioncritical systems. This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement. The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities. Location: This position will be hybrid remote. Candidates will be required to work onsite as needed. Candidates preferred to be located near Hanscom AFB (Boston, MA). This role supports the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Reliability Engineer. The Reliability Engineer is responsible for ensuring the availability, performance, scalability, and resiliency of missioncritical systems. This role applies software engineering principles to infrastructure and operations, with a strong emphasis on automation, monitoring, incident response, and continuous reliability improvement. The reliability engineer serves as the bridge between development, operations, and platform teams to ensure production systems consistently meet defined service level objectives (SLOs) while supporting rapid, safe delivery of new capabilities. Location: This position will be hybrid remote. Candidates will be required to work onsite as needed. Candidates preferred to be located near Hanscom AFB (Boston, MA). System Reliability & Availability
Monitoring, Observability & Alerting
Incident Response & Problem Management
Automation & Engineering Excellence
ReliabilityFocused Engineering
Collaboration & Governance
Required Skills: * Bachelors and eight (8) years or more of experience; Masters and six (6) years or more of experience. Additional experience may be accepted in lieu of degree. * Active Secret clearance at a minimum required to start * US citizenship required * Experience with cloud platforms (AWS, Azure, OCI, or GCP), including managed services * Experience with containerized environments (Docker, Kubernetes) * Familiarity with CI/CD pipelines and deployment automation * SLOs and error budgets * Capacity modeling and performance testing * Strong understanding of: * Distributed systems and highavailability architectures * Linux/Windows system administration * Networking fundamentals (DNS, TCP/IP, load balancing) * Hands-on experience with: * Monitoring and observability tools (e.g., Prometheus, Grafana, ELK/Elastic, Datadog, Azure Monitor) * Infrastructure as Code (Terraform, ARM, CloudFormation) * Scripting or programming languages (Python, Bash, Go, PowerShell, or similar) * Experience supporting incident management and oncall operations Preferred Skills
SES provides a competitive salary and the following benefits:
| |
life insurance, paid time off, 401(k)
Mar 31, 2026