Job Description
We are seeking a Senior Data Center Infra Engineer to support large-scale infrastructure operations and drive automation and efficiency across multi-cloud environments. This role is ideal for professionals with deep technical expertise and a passion for operational excellence.
Key Responsibilities:
- Act as a Subject Matter Expert (SME) for infrastructure operations, sharing best practices and conducting root-cause analyses for high-impact incidents.
- Lead incident management, coordinate responses, and maintain proactive communication during outages and production issues.
- Facilitate regular stakeholder meetings to provide updates, clarify concerns, and gather feedback.
- Analyze operational metrics to support data-driven decisions and continuous improvement.
- Develop and enhance operational tools and automation solutions to reduce manual effort.
- Document operational procedures, configurations, and environment setups.
- Identify and eliminate operational toil through process optimization and automation.
- Mentor junior engineers across various technical domains.
- Participate in a 24x7 shifting rotation.
Qualifications:
- Bachelor’s degree in Information Technology, Engineering, or a related field.
- At least 5 years of experience supporting high-availability production environments with a focus on automation and operational improvements.
- Proficiency in at least 1–2 tools per domain:
-
- Linux Systems Administration: RHEL, CentOS, Ubuntu, or similar Unix- based OS
- Version Control: Git, GitHub, GitLab
- Networking: Core networking principles, load balancing, reverse proxies (e.g., Nginx), SDN, DHCP, DNS
Preferred Qualifications:
- Relevant certifications (e.g., CKA, CKAD, AWS Certified).
- Experience working in cross-functional teams using modern DevOps practices.
- Strong background in automation using Bash, Python, or similar scripting languages to improve reliability and reduce manual tasks.