About the job
Job Title: DevOps Engineer
Employment Type: Full-time
Location: Remote
Shift: Night Shift
About the Role
Join our dynamic Hadoop/Big Data Infrastructure Team as a talented DevOps Engineer. In this role, you will manage and support expansive distributed systems while automating infrastructure operations across our global data centers. Your contributions will be crucial in ensuring the reliability, scalability, and performance of over 5000 compute, storage, and distributed systems.
Key Responsibilities
Deliver operational support for Hadoop clusters and associated Big Data tools including HBase, Scylla, Hive/Hue, Kafka, and Storm.
Automate deployment processes and operations to improve efficiency and reliability.
Execute upgrades for Hadoop and related tools as necessary.
Enhance configurations using management tools such as Puppet and InfraDB (our proprietary Ruby on Rails CM Database).
Coordinate deployments and services with Ansible.
Establish continuous integration processes adhering to change management protocols utilizing Git and Gerrit.
Create monitoring and alerting scripts with Python and Bash/Shell.
Oversee system health using tools like Nagios, Graphite, Grafana, OpenTSDB, and Prometheus.
Diagnose and optimize system and service performance.
Collaborate with various teams including Data Center Operations, NOC, Network, Database, and Engineering.
Track and resolve incidents, conduct root cause analysis, and maintain comprehensive runbooks and documentation.
