Job Information
UnitedHealth Group Software Engineering Lead in Bangalore, India
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health optimization on a global scale. Join us to start Caring. Connecting. Growing together.
We are seeking highly skilled Senior Developers with solid expertise in monitor production systems/support and applications to ensure high availability and performance with skill set - Java, Spring Boot, cloud-native development, Databricks, and Azure. The ideal candidate will be hands-on, capable of independently delivering complex modules, collaborating closely with architects and leads, and ensuring high-quality, scalable engineering solutions.
Primary Responsibilities:
Incident & Issue Management
Monitor production systems and applications to ensure high availability and performance
Respond to production incidents within defined SLAs and drive resolution
Perform root cause analysis (RCA) for recurring or critical issues
Escalate issues to appropriate engineering or vendor teams when required
Application & System Support
Provide Level 2 / Level 3 support for production applications and services
Troubleshoot application, database, and infrastructure issues
Support batch jobs, schedulers, and data pipelines; ensure successful daily operations
Validate fixes and coordinate deployments to production environments
Monitoring & Stability
Actively monitor alerts, logs, and dashboards to proactively detect issues
Improve system stability by identifying and mitigating potential risks
Participate in on-call rotations and weekend or after-hours support as needed
Change & Release Support
Support production releases, hotfixes, and configuration changes
Validate change requests and ensure adherence to change management processes
Perform post-deployment verification and rollback support if required
Automation & Continuous Improvement
Identify opportunities to automate manual operational tasks
Create or enhance monitoring, alerting, and self-healing mechanisms
Continuously improve incident response and operational processes
Documentation & Knowledge Management
Maintain runbooks, and operational documentation
Document known issues, workaround procedures, and lessons learned
Share knowledge with team members to improve overall support efficiency
Collaboration & Communication
Coordinate with development, QA, infrastructure, and business teams
Provide clear production status updates to stakeholders and leadership
Participate in incident post-mortems and continuous improvement discussions
Compliance & Security
Ensure production support activities comply with security, audit, and regulatory requirements
Support audits and implement required controls in production environments
Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so
Required Qualifications:
Fulltime graduation degree or equivalent experience
Solid hands-on experience in Java, Spring Boot, Angular, Azure, Python, MicroServices, Kafka, API, Databricks
Experience with Databricks, SQL MI, and RDBMS platforms
Solid understanding of Azure cloud services and cloud-native development
Good understanding of Kafka and real-time data processing concepts
Proven ability to monitor production systems and applications to ensure high availability and performance
Proven ability to use AI tools for incident triage, root cause analysis, and troubleshooting
Proven ability to apply AI to log analysis, monitoring, and anomaly detection
Proven ability to leverage AI for automated alerts, diagnostics, and remediation support
Proven ability to ensure production stability and code quality using AI assisted insights
Proven solid analytical and problem-solving skills
Proven ability to work independently as well as collaboratively in agile teams
Proven excellent communication and documentation skills
Preferred Qualification:
- Exposure to AI/ML workflows or end-to-end model integration
At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone-of every race, gender, sexuality, age, location and income-deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes - an enterprise priority reflected in our mission.