Job Information
Figma, Inc. Manager, Software Engineering Observability in NEW YORK, New York
As the Engineering Manager for Observability, you will lead a team of five engineers responsible for shaping the future of visibility and efficiency at Figma. Youll define the strategy for instrumentation standards and cost transparency, drive initiatives to optimize observability footprint and spend, and explore innovative AI-driven approaches to anomaly detection and operational automation. This role is well-suited for a leader with strong distributed systems experience who is motivated by platform leverage, cross-functional impact, and building systems that enable every engineering team to operate with confidence and precision.This is a full time role that can be held from one of our US hubs or remotely in the United States.#### What youll do at Figma: * Lead and grow a team of engineers responsible for the reliability, scalability, and evolution of Figmas observability and cost engineering platforms * Own and operate Figmas core observability stack, including vendor platforms such as Datadog, ensuring high availability, strong data quality, and effective signal-to-noise across metrics, logs, and traces * Define and drive the technical strategy for instrumentation standards, observability libraries, agents, and operators used to monitor internal and external facing services * Explore and implement innovative, AI-driven approaches to anomaly detection, root cause analysis, signal correlation, and operational automation * Establish clear frameworks for cost attribution, budgeting, forecasting, and alerting across infrastructure and observability spend, enabling teams to make informed tradeoffs * Partner with infrastructure, product engineering, finance, and security teams to improve visibility into system health and cost efficiency at scale * Lead initiatives to optimize observability footprint and spend, balancing depth of insight with performance and cost considerations * Coach and mentor engineers through career development, performance feedback, and technical leadership, fostering a culture of ownership, collaboration, and high quality execution#### We'd love to hear from you if you have: * 4+ years of experience leading infrastructure, observability, or platform engineering teams, with a track record of delivering highly reliable production systems * Deep hands-on experience with modern observability platforms (e.g., Datadog, OpenTelemetry) across metrics, logs, and distributed tracing * Strong understanding of distributed systems, instrumentation best practices, SLO design, and incident response workflows * Experience driving cost transparency and accountability initiatives, including cost attribution, budgeting, forecasting, and alerting in cloud environments * Demonstrated ability to set technical direction, drive cross-functional alignment (Engineering, Finance, Security), and make sound architectural decisions in complex environments#### While not required, its an added plus if you also have: * Experience designing or evolving company-wide observability standards, shared libraries, and agent/operator-based integrations * Background in cost optimization for infrastructure or observability tooling, including vendor negotiations and usage modeling * Experience applying AI or machine learning techniques to anomaly detection, root cause analysis, or operational automation * Familiarity with OpenTelemetry and modern instrumentation frameworks across multiple programming languages * Experience scaling and mentoring high-performing engineering teams through platform expansion or significant architectural changeAt Figma, one of our values is Grow as you go. We believe in hiring smart, curious people who are excited to learn and develop their skills. If youre excited about this role but your past experience doesnt align perfectly with the points outlined in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other