Role and responsibilities:

  • Production Management for Kafka on Kubernetes and MQ plants
  • Deep-diving into complex troubleshoots, implementing changes, and serving as an escalation point for the Level 2 teams
  • Creating and maintaining best practices / policies and ensuring that they are followed
  • Work with external vendors and internal Project Managers to develop the execution plans for changes
  • Participate in the weekly review meetings and actively engage with various engineering teams to review the infrastructure
  • Actively participate in the weekly change management meetings to explain the changes and answer any questions from the change board
  • Incident and problem management
  • Build automation to replace manual tasks with tools
  • Build tools to enable self-service operations by application users and L 1/2 staff to reduce the overhead involved in day to day operations