Description
The NMCI Service Management Integration and Transport (SMIT) group at Leidos is seeking a talented Site Reliability Engineer to enhance the reliability, performance, and scalability of complex distributed systems. As part of the SMIT Contract, the Leidos team manages the essential infrastructure for the Navy-Marine Corps Intranet, encompassing cybersecurity services, network operations, network engineering, service desk support, seat services, and data transportation.
The SRE will play a crucial role in developing and conducting tests aimed at system resilience, performance under load, and failure scenarios. Collaborating closely with fellow Site Reliability Engineers and development teams, you will create automated testing frameworks that replicate real-world conditions, ensuring system reliability while meeting established service level objectives (SLOs). Your contributions will be vital to building robust and scalable services that operate efficiently in production environments.
Your responsibilities will include maintaining complex computer systems by automating software releases, monitoring systems, and proactively identifying and resolving issues before they impact users. Through these efforts, you will enhance overall site performance and reliability.
This role is also responsible for supporting migration, automation, optimization of software development and deployment processes, infrastructure as code, and advancing the maturity of the Site Reliability Engineering program.
Primary Responsibilities:
Basic Qualifications:
Preferred Qualifications:
If you are driven by challenges and looking for a dynamic environment, we encourage you to apply. Join Leidos, where we think outside the box and drive innovation to meet mission demands.