OneMain Financial Jobs

Job Information

Amazon Senior Systems Development Engineer, Global Edge Compute (Amazon Robotics) in Austin, Texas

Description

We're seeking a Senior Systems Development Engineer to join the Unified Workcell Compute (UWC) team, reporting directly to UWC leadership. This is a high-ownership, high-impact role where you'll architect and build foundational systems that manage Amazon's edge device fleet - over a million devices across thousands of locations worldwide. You'll work at the intersection of cloud infrastructure, device management, robotics systems, and operational excellence, solving significantly complex technical problems that enable Amazon's robotics and fulfillment operations to scale globally.

As a Senior SysDE, you'll be a technical leader who defines technical strategy, drives architectural decisions, and builds systems that enable robotics and automation teams to deploy and manage their edge compute solutions with the same ease as deploying to AWS. You'll work with ambiguity, translating undefined business problems into concrete technical solutions while balancing short-term tactical needs with long-term strategic vision. This role requires deep technical expertise across multiple domains - Linux systems, AWS services, IoT platforms, robotics compute infrastructure, and large-scale distributed systems - combined with the ability to influence and mentor engineers across the organization.

Key job responsibilities

• Build and deploy highly resilient, scalable distributed systems that operate at Amazon scale, managing robotics device fleets across thousands of sites with 99.99%+ availability requirements

• Lead the technical strategy for your team's systems within the UWC architecture, influencing decisions around hyperscale deployments, robotics compute patterns, fleet management, and edge device automation.

• Drive architectural reviews and design discussions across UWC and robotics customer teams, providing technical guidance on device lifecycle management, software distribution, multi-compute workcell support, and operational excellence patterns.

• Develop automation solutions using Python, Rust, CDK, and AWS services that eliminate entire classes of operational load and enable self-service for robotics solution teams.

• Implement and optimize Linux-based systems, OS image creation pipelines (Yocto/mkosi), and BSP solutions for diverse robotics hardware platforms including x86, ARM, NVIDIA GPU systems, and embedded devices.

• Create tooling and frameworks that enable robotics teams to provision, configure, and manage their edge compute fleets - from AI perception systems to manipulation robotics - with minimal hands-on-keyboard time.

• Define and apply standards for engineering, testing, and operational excellence best practices for your team and partner teams within the UWC and robotics organizations.

• Lead projects that streamline or eliminate excess processes, improving agility and reducing complexity for robotics teams building on UWC.

• Proactively identify and mitigate risks at the product and service level, improving the resilience, performance, and cost efficiency of UWC systems assisting critical robotics operations.

• Troubleshoot complex production issues across the full stack - from robotics device hardware and Linux kernel to AWS cloud services - extrapolating patterns and implementing general solutions that prevent future incidents.

• Partner with robotics solution teams (Amazon Robotics, manipulation systems, AI perception, workcell automation) to understand their device management challenges and architect solutions that meet their specific requirements.

• Partner with Product Managers, Technical Program Managers, and Software Development Managers across UWC and robotics teams to translate business requirements into technical solutions and drive execution.

• Mentor and develop engineers on your team and across the organization, raising the technical bar and building engineering capabilities that support Amazon's robotics strategy.

• Lead by example, delivering solutions that are inventive, resilient, and extensible, making it easier for robotics teams to build on UWC.

• Participate in hiring, technical assessments for promotions, and performance discussions to strengthen the engineering organization supporting Amazon's robotics initiatives.

A day in the life

Your day might start by investigating a critical issue where 5,000 robotics devices across multiple fulfillment centers are experiencing intermittent kernel panics during high-load operations. You dive deep into kernel logs, memory dumps, and device telemetry, correlating the failures with a recent driver update for NVIDIA GPU systems. You quickly develop a Python or Rust-based diagnostic tool to capture more granular system metrics and work with your team to roll back the problematic driver version while engineering a proper fix that addresses the underlying memory management issue.

Mid-morning, you're troubleshooting why a new OS image isn't booting correctly on ARM-based manipulation robotics devices. You boot into a recovery environment, examine the initramfs, trace through systemd unit dependencies, and discover a race condition in the device initialization sequence. You modify the Yocto recipe to fix the boot ordering, test across multiple hardware variants, and document the pattern for other teams building custom images. You then lead a quick sync with an Amazon Robotics team to help them debug why their software components are failing to deploy - walking through IoT certificate validation, network connectivity from the edge device, and AWS IAM permissions until you identify a misconfigured security group.

After lunch, you're deep in code review for a new credential rotation service providing written feedback on error handling patterns, memory safety, and how to better structure the state machine for resilience. You spend time optimizing a Linux system configuration that's causing performance bottlenecks on AI perception systems - configuring and optimizing Linux system parameters to drive high-performance compute workloads at scale . You mentor a mid-level engineer who's struggling with a complex Yocto build failure, helping them understand layer dependencies and BitBake recipe inheritance while teaching them debugging techniques they can apply independently.

The afternoon includes responding to an urgent page where devices in a specific building can't connect to AWS IoT Core. You systematically eliminate possibilities - checking DNS resolution, testing TLS handshakes, examining certificate chains, and analyzing network packet captures - until you discover a misconfigured firewall rule blocking MQTT traffic. You implement a monitoring enhancement to detect this class of issue proactively across all sites. You then write a technical design document proposing improvements to UWC's device provisioning workflow that will reduce provisioning time from 20 minutes to under 10 minutes by parallelizing certificate generation and optimizing the Linux boot sequence. You'll end your day reviewing system metrics across the fleet, identifying devices with degraded disk I/O that need proactive maintenance, and ensuring your team is unblocked for tomorrow's work.

About the team

The Unified Workcell Compute (UWC) team is at the forefront of Amazon's robotics and automation efforts, building and operating the foundational device management platform for Amazon's on-premise edge compute fleet. Our services manage over a million robotic devices across thousands of locations worldwide - from the latest NVIDIA GPU offerings supporting AI perception efforts to bleeding-edge manipulation robotics systems, industrial PCs, thin clients, Drive Units, and embedded devices across Amazon's global fulfillment network.

Our mission is to enable robotics solution teams to deploy to Operations buildings with the same self-service, ownership, and accountability as deploying to AWS cloud. We're revolutionizing Amazon's logistics and fulfillment operations by pushing the boundaries of what's possible in automation and compute management at unprecedented scale.

We're a team of builders who value automation, operational excellence, and customer obsession. We own a critical technology ecosystem that powers device provisioning, software distribution, credential management, and fleet operations for robotics workcells and fulfillment systems. Our work directly impacts millions of customer orders and enables Amazon's promise to fast, reliable delivery. We're solving problems that few organizations face, building systems that have never existed before, and defining the future of edge compute management for robotics at Amazon scale.

We foster a culture that encourages personal and professional growth, empowering our team members to continually expand their skills and knowledge. Work-life balance is a priority for us, and we strive to create an environment where our team can thrive both professionally and personally.

Basic Qualifications

  • Experience leading the design, automation, deployment, and support of large-scale infrastructure

  • Experience programming with at least one modern language such as Python, Ruby, Golang, Java, C++, C#, Rust

  • Experience with Linux/Unix

  • Experience with CI/CD pipelines build processes

Preferred Qualifications

  • Experience with distributed systems at scale

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits .

USA, TX, Austin - 151,200.00 - 204,600.00 USD annually

DirectEmployers