Manage large scale multi-nodes Kafka cluster environments residing on on-perm and AWS.
Handle all Kafka environment builds, including design, capacity planning, cluster setup, performance tuning and ongoing monitoring.
High availability cluster setup, maintenance, and ongoing support
Hands-on experience with confluent Kafka clusters and control center hosted on on-prem and Amazon cloud is a plus.
Perform high-level, day-to-day operational maintenance, support, and upgrades for the Kafka Cluster.
Creation of key performance metrics, measuring the utilization, performance, and overall health of the cluster.
Capacity planning and implementation of new/upgraded hardware and software releases as well as for storage infrastructure.
Research and recommend innovative, and where possible, automated approaches for system administration tasks using Ansible and Chef.
Create topics, setup redundancy cluster, deploy monitoring tools, alerts using New Relic and control center.
Proactively monitor and setup alerting mechanism for Kafka Cluster and supporting hardware to ensure system health and maximum availability
Experience of Kafka Producer and Consumer Microservices concepts and Kafka distributed Architecture
Knowledge of best practices related to security, performance, and disaster recovery.
Good hands-on experience on Red hat Linux enterprise System administration and trouble shooting.
Understanding of high-availability and patching production systems while minimizing downtime
Experience in RHEL Patching and kernel upgrade
Develop and maintain Ansible playbooks to patch Linux enterprise System
Bachelors
B.Tech
Kafka,
IT-Software- Software services