We are seeking a highly skilled and motivated Sr DevOps Engineer to join our growing team. The ideal candidate will have strong problem-solving abilities, proficiency in Infrastructure as Code (IaC) using Terraform and Ansible, experience in automating GitLab pipelines, experience in platform Engineering, and experience on building scalable platforms from scratch. This role requires a deep understanding of IaC code automation, infrastructure, advanced level Linux Troubleshooting skills, Kubernetes hands-on experience, networking, AWS, and Azure cloud services, and development skills in automation with shell & python.
Key Responsibilities:
- Design, Develop, and Implement IaC: Create and maintain Infrastructure as Code using Terraform and Ansible to ensure efficient and reliable deployment of resources.
- Platform Engineering: Hands-on experience in building platforms from scratch and building operational excellence.
- Pipeline Automation: Develop and manage GitLab CI/CD pipelines to automate the build, test, and deployment processes, ensuring high-quality software delivery.
- Cloud Management: Architect, deploy, and manage scalable, secure, and highly available infrastructure on AWS, Azure or on-prem BareMetal/VMs
- Scripting and Automation: Write and maintain shell and Python scripts to automate routine tasks, improve system efficiency, and support operational processes.
- Compliance Assurance: Ensure that infrastructure and deployments comply with industry standards and regulations, implementing necessary controls and documentation to maintain compliance
- Collaboration and Support: Work closely with development, QA, and operations teams to troubleshoot issues, optimize performance, and ensure seamless integration and deployment.
- Monitoring and Optimization: Implement and maintain monitoring solutions to ensure system health and performance, and proactively address potential issues.
- Documentation and Best Practices: Document processes, configurations, and procedures, and promote best practices in infrastructure and deployment management.
- BCP / DR: Business continuity and DR plan for platform contingency
Continuous Improvement: Stay updated with industry trends, tools, and technologies to continuously improve the DevOps practices and infrastructure.
Required Skills and Qualifications:
- Problem-Solving Skills: Strong analytical and problem-solving skills to efficiently identify, diagnose, and resolve technical issues.
- IaC Proficiency: Hands-on experience with Infrastructure as Code tools such as Terraform and Ansible.
- CI/CD Expertise: Extensive experience with GitLab CI/CD pipeline automation.
- Cloud Platforms: Proficient in managing AWS and Azure cloud environments, including services like EC2, S3, RDS, Azure VMs, Azure Blob Storage, etc.
- Scripting Languages: Proficiency in shell scripting and Python for automation and system management tasks.
- Self-motivated and Collaborative: Highly self-motivated with a strong collaborative mindset to work effectively in a team-oriented environment.
- Communication Skills: Excellent verbal and written communication skills to articulate technical concepts and solutions clearly.
Good to have:
- Gitlab CI/CD
- OpenShift Kubernetes
- Mentioning platforms like Grafana & Prometheus