Job Information
Oracle [Remote] Senior Site Reliability Developer- USC Required in Annapolis, Maryland
Job Description
Come and join us! Building on our cloud momentum, Oracle has formed a new organization— Oracle Health . This team focuses on product deployment, sustainability, troubleshooting, and product strategy while building a modern, automated healthcare platform. This is a net-new line of business with an entrepreneurial spirit, offering a unique opportunity to help build a world-class engineering organization centered on excellence, innovation, and real-world impact.
As a Site Reliability DevOps Engineer , you will play a critical role in operating and scaling a Clinical AI Assistant platform used by healthcare professionals worldwide . This system is designed to improve the quality, safety, and efficiency of care delivery for billions of patients globally . Your work will directly influence the reliability and performance of AI-driven systems that clinicians depend on in high-stakes environments.
This role goes beyond traditional SRE responsibilities—you will have the opportunity to leverage AI/ML techniques and develop AIOps solutions to proactively manage system reliability, detect anomalies, automate remediation, and continuously improve service performance. You will help define how reliability engineering evolves in the context of intelligent, AI-powered healthcare systems.
You will be responsible for architecture, production operations, capacity planning, performance management, deployment, and release engineering, working across cross-functional teams to deliver highly reliable, scalable, and secure services.
Responsibilities
Responsibilities
Own the architecture, design, implementation, and production operations of core platform and AI-driven system services
Ensure the reliability, availability, and performance of the Clinical AI Assistant platform used in real-world healthcare settings
Build and operate AIOps-driven capabilities (e.g., intelligent alerting, anomaly detection, automated remediation, predictive scaling)
Continuously improve systems through automation, self-healing mechanisms, and real-time observability
Design and develop software to enhance system scalability, efficiency, and resilience
Partner with cross-functional teams to prototype and deliver new platform services
Lead efforts in capacity planning, demand forecasting, performance tuning, and cost optimization
Solve complex distributed systems challenges in cloud-native environments and prevent recurrence through engineering rigor
Contribute to platform engineering best practices, including infrastructure as code, CI/CD, and service reliability standards
Stay current with emerging technologies in cloud, distributed systems, and AI/ML-driven operations
Key Requirements / Experience
Must-have:
Ability to obtain and maintain a federal security clearance (US citizenship required)
4-6 years of experience in Site Reliability Engineering, DevOps, or related roles
Proven experience operating large-scale, distributed, production systems with high availability requirements
Strong experience with container orchestration (Kubernetes, Docker, or similar)
Infrastructure as Code expertise (Terraform, Ansible, Chef, Puppet, Packer, etc.)
Experience building and operating CI/CD pipelines (Git, Jenkins, GitLab, Rundeck, etc.)
Proficiency in scripting and automation (Bash, Python, PowerShell, etc.)
Experience with at least one major cloud provider (OCI, AWS, Azure, etc.)
Strong Linux systems expertise
Experience with observability tooling (monitoring, logging, tracing) and performance optimization
Nice-to-have :
Experience supporting or operating AI/ML or LLM-based systems in production
Exposure to AIOps, intelligent automation, or ML-driven observability
Experience in healthcare or other regulated environments (HIPAA, security, compliance)
Background in high-throughput, low-latency systems supporting mission-critical workloads
Software engineering experience in Java, Python, C++, or similar languages
Why This Role Matters
You will be building and operating infrastructure that directly supports clinicians and healthcare providers around the world. The systems you design and maintain will impact clinical decision-making, operational efficiency, and ultimately patient outcomes at global scale . This is a rare opportunity to apply advanced SRE practices and AI-driven operations to one of the most meaningful domains—healthcare.
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $79,100 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.