Unlimited Job Postings Subscription - $99/yr!

Job Details

Senior Site Reliability Engineer

  2026-05-04     Aptino     Houston,TX  
Description:

Job Description: Senior Site Reliability Engineer – Agentic Operations. Lynx is evolving from a traditional, human driven production support model to an automation first, AI assisted reliability platform following its migration from on prem infrastructure to McKesson's Azure cloud. This role builds directly on the existing Site Reliability Engineer responsibilities for Lynx operating, stabilizing, and improving highly available production systems—while extending them to include agentic AI applied specifically to production operations, incident response, and reliability workflows. This is a senior/staff level scope role, focused on owning reliability automation across services, not a single application. Required Qualifications: • 7+ years of Site Reliability Engineering or Production Engineering experience • Demonstrated experience automating infrastructure and operational workflows • Deep understanding of SRE principles (SLIs, SLOs, error budgets) • Strong hands on experience with: Azure cloud infrastructure Kubernetes and Docker Java production systems CI/CD pipelines (GitHub Actions) Observability platforms (Dynatrace strongly preferred)


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search