OneMain Financial Jobs

Job Information

Meta Data Center Network Engineer, AI Repair in Richmond, Virginia

Summary:

Are you passionate about cutting-edge technology and its implementation on a global scale? This exciting role within the Edge and Network Services (ENS) Foundation team offers a unique opportunity to tackle the challenges of introducing innovative compute and networking technologies across Meta's global data centers. You will collaborate with cross-functional teams such as Production AI engineers, Network and Hardware Engineering, Data Center Connectivity, Facility Engineering/Operations, and SiteOps to execute and support ENS's repair and operational support of the largest AI clusters. This collaboration ensures that new network technologies can be deployed and managed at scale. In this role, your focus will be on network hardware within integrated AI system rack infrastructure and related IP systems. You will ensure that ENS operations processes and tooling are well-defined and executable for these new technologies.

Required Skills:

Data Center Network Engineer, AI Repair Responsibilities:

  1. Work cross functionally to maintain AI and DC network health while leading long term initiatives to drive for better repair and greater efficiencies

  2. Contribute to organizational level strategy and establish team roadmaps and goals that align with current business priorities and organizational strategy

  3. Accountable for driving improvements in technical references, NPI process, and deployment/operations documentation standards in support of continuous improvement initiatives

  4. Facilitate clear communication of technical requirements, risks, and escalations to leadership and cross-functional partners

  5. Integrate new networking technologies into ENS operations processes to efficiently scale Meta’s AI, Compute, and Network capabilities

  6. Develop new operational support models for deploying and operating new data center infrastructure

  7. Influence design of data center, network, server, and applications to ensure seamless integration

  8. Publish technical reference, process, and training documentation for a global network deployment and operations teams

  9. Build and nurture business relationships with key stakeholders, partners, and vendors

  10. 15 to 20% travel required based on project demand

Minimum Qualifications:

Minimum Qualifications:

  1. Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience

  2. 10+ years of work experience with designing and deploying large-scale data center network infrastructure

  3. Experience with data center design, structured cabling, and fiber optic network infrastructure

  4. Demonstrated knowledge of NICs, optical transceivers, AOC, and DAC for high-speed interconnects

  5. Demonstrated knowledge of TCP, IPv4/6, Routing Protocols, and related network services (DHCP, DNS)

  6. Experience with implementing tooling and automation for network configuration and monitoring

  7. Track record of solving complex problems, executing tactically, and delivering on infrastructure projects

  8. Experience to work independently, stay organized, multitask, prioritize, and communicate effectively

Preferred Qualifications:

Preferred Qualifications:

  1. Demonstrated experience working with scaled AI network solutions for training and inference use cases

  2. Operating HPC/AI systems across global locations

Public Compensation:

$193,000/year to $271,000/year + bonus + equity + benefits

Industry: Internet

Equal Opportunity:

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at accommodations-ext@meta.com.

DirectEmployers