Job Information
NTT America, Inc. Unix & Linux - Systems Administration - Hybrid / Partially Client Onsite in Montreal, Quebec
Req ID: 369180
NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
We are currently seeking a Unix & Linux - Systems Administration - Hybrid / Partially Client Onsite to join our team in Montreal, Quebec (CA-QC), Canada (CA).
Day to Day / Job Function
The role is critical to our day-to-day incident management function with primary responsibilities for:
Diagnosis and resolution of immediate production impacting issues in the compute and storage plant
Working with other infrastructure teams including networking, database administration and hosted solution teams for outage resolution, as well as customers aligned with the business users of our plant to determine scope, impact, and appropriate resolution path
Carry out proactive health and hygiene tasks to maintain operational stability and compliance for risk & control programs to ensure the production environment is not put at risk
Collaborate with engineering teams to test and certify new hardware and software products
Occasional weekend project work responsibilities to on-board new UNIX assets for growth or large programs such as new datacenter build outs
Required Skills:
5 to 7 years of experience in a similar role
Must have strong knowledge and experience with Linux, preferably RedHat, and/or any other Linux distributions
Strong knowledge and experience of various services (i.e. DNS, DHCP, NTP, Kerberos, SSHD, PXE, SFTP, HTTPD, Docker, etc.)
Knowledge of various enterprise server hardware models (blades, rackmount, standalone) networking, routers and switches
Must be able to read, understand and write intermediate to complex scripts using KSH, Bash, Perl, Python, etc.
Good understanding and workings of configuration management tools, RedHat Satellite servers, Quattor, Puppet, Chef, etc.
Good knowldege and understanding of Clustering, Virtualization, NAS, NFS and SAN
Excellent communication and written skills. Being able to explain technical problems to non-technical audience
Available for on-call (1 week out of ever 4-6 weeks), rotated weekly within the team, and become a point person for any production issues
Ability to work in a global distributed team
Nice to have:
Experience with troubleshooting incidents involving compute resources, network problems, remote storage related problems (SAN, NAS), etc.
Experience with analyzing and diagnosing kernel crash/core dumps, network packet captures and identifying the root cause of problems
Sound knowledge of networking, TCP/IP, Layer 2/3 network design, firewall, switches and routers, etc.
Experience working in a DevOps environment
Knowlege and experience with various server hardware models and vendors (i.e. IBM, Dell, HP, etc.)
Ability to identify performance bottlenecks and tune the system parameters to provide more throughput
Good understanding and knowledge of Load Balancing, High Availability and BC
#LI-NorthAmerica
Fonctions quotidiennes / Responsabilités principales
Ce rôle est essentiel à notre fonction quotidienne de gestion des incidents, avec les responsabilités principales suivantes :
Diagnostic et résolution des problèmes de production immédiats ayant un impact sur l’infrastructure de calcul et de stockage
Collaboration avec les autres équipes d’infrastructure, notamment les équipes réseau, administration de bases de données et solutions hébergées, pour la résolution des pannes, ainsi qu’avec les clients et les utilisateurs métiers de notre infrastructure afin de déterminer la portée, l’impact et la solution appropriée
Mise en œuvre de mesures proactives de maintenance et de sécurité pour maintenir la stabilité opérationnelle et la conformité aux programmes de gestion des risques et de contrôle, afin de garantir la sécurité de l’environnement de production
Collaboration avec les équipes d’ingénierie pour tester et certifier les nouveaux produits matériels et logiciels
Participation occasionnelle à des projets le week-end, notamment l’intégration de nouveaux équipements UNIX pour la croissance ou les grands projets tels que la construction de nouveaux centres de données
Compétences requises :
5 à 7 ans d’expérience dans un poste similaire
Excellente connaissance et expérience de Linux, de préférence Red Hat, et/ou d’autres distributions Linux
Excellente connaissance et expérience de divers services (DNS, DHCP, NTP, Kerberos, SSHD, etc.) PXE, SFTP, HTTPD, Docker, etc.
Connaissance des différents modèles de serveurs d'entreprise (lames, racks, serveurs autonomes), des réseaux, routeurs et commutateurs.
Maîtrise de la lecture, de la compréhension et de l'écriture de scripts de niveau intermédiaire à complexe en KSH, Bash, Perl, Python, etc.
Bonne compréhension et fonctionnement des outils de gestion de configuration, des serveurs Red Hat Satellite, de Quattor, de Puppet, de Chef, etc.
Bonne connaissance et compréhension du clustering, de la virtualisation, du NAS, du NFS et du SAN.
Excellentes compétences en communication écrite et orale. Capacité à expliquer des problèmes techniques à un public non technique
Disponibilité pour les astreintes (1 semaine toutes les 4 à 6 semaines), rotation hebdomadaire au sein de l'équipe, et rôle de référent pour tout problème de production
Capacité à travailler au sein d'une équipe internationale et distribuée
Atouts :
Expérience du dépannage d'incidents liés aux ressources de calcul, aux problèmes réseau, aux problèmes de stockage distant (SAN, NAS), etc.
Expérience de l'analyse et du diagnostic des plantages du noyau/vidages mémoire, des captures de paquets réseau et de l'identification de la cause racine des problèmes
Solides connaissances en réseaux, TCP/IP, architecture réseau de couche 2/3, pare-feu, commutateurs et routeurs, etc.
Expérience en environnement DevOps
Connaissance et expérience des différents modèles et fournisseurs de serveurs (IBM, Dell, HP, etc.)
Capacité à identifier les goulots d'étranglement des performances et à optimiser les paramètres système pour améliorer le débit
Bonne compréhension et connaissance de l'équilibrage de charge, de la haute disponibilité et de la continuité d'activité
About NTT DATA
NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. our consulting and Industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.
Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored to each client’s needs. While many positions offer remote or hybrid work options, these arrangements are subject to change based on client requirements. For employees near an NTT DATA office or client site, in-office attendance may be required for meetings or events, depending on business needs. At NTT DATA, we are committed to staying flexible and meeting the evolving needs of both our clients and employees. NTT DATA recruiters will never ask for payment or banking information and will only use @nttdata.com and @talent.nttdataservices.com email addresses. If you are requested to provide payment or disclose banking information, please submit a contact us form, https://us.nttdata.com/en/contact-us .
NTT DATA endeavors to make https://us.nttdata.com accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at https://us.nttdata.com/en/contact-us . This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click here (http://us.nttdata.com/en/compliance#eeos) . If you'd like more information on your EEO rights under the law, please click here (http://us.nttdata.com/en/compliance#know-your-rights) . For Pay Transparency information, please click here (http://us.nttdata.com/en/compliance#ppnp) .