We are seeking a Senior Data Center Infra Engineer to support large-scale infrastructure operations and drive automation and efficiency across multi-cloud environments. This role is ideal for professionals with deep technical expertise and a passion for operational excellence.
Key Responsibilities:
• Act as a Subject Matter Expert (SME) for infrastructure operations, sharing best practices and conducting root-cause analyses for high-impact incidents.
• Lead incident management, coordinate responses, and maintain proactive communication during outages and production issues.
• Facilitate regular stakeholder meetings to provide updates, clarify concerns, and gather feedback.
• Analyze operational metrics to support data-driven decisions and continuous improvement.
• Develop and enhance operational tools and automation solutions to reduce manual effort.
• Document operational procedures, configurations, and environment setups.
• Identify and eliminate operational toil through process optimization and automation.
• Mentor junior engineers across various technical domains.
• Participate in a 24x7 shifting rotation.
Qualifications:
• Bachelor’s degree in Information Technology, Engineering, or a related field.
• At least 5 years of experience supporting high-availability production environments with a focus on automation and operational improvements.
• Proficiency in at least 1–2 tools per domain:
o Linux Systems Administration: RHEL, CentOS, Ubuntu, or similar Unix- based OS
o Version Control: Git, GitHub, GitLab
o Networking: Core networking principles, load balancing, reverse proxies (e.g., Nginx), SDN, DHCP, DNS
Preferred Qualifications:
• Relevant certifications (e.g., CKA, CKAD, AWS Certified).
• Experience working in cross-functional teams using modern DevOps practices.
• Strong background in automation using Bash, Python, or similar scripting languages to improve reliability and reduce manual tasks.