• Managed Kubernetes clusters to orchestrate containerized applications, ensuring scalability, resilience, and efficient resource utilization.
• Implemented and maintained Azure AKS (Azure Kubernetes Service) and Azure Container instances for containerized deployments, optimizing infrastructure for real-time data processing.
• Utilized MongoDB for efficient storage and retrieval of structured and unstructured data, ensuring seamless integration with real-time data aggregation systems.
• Employed Podman and Docker for container management and deployment, streamlining the packaging and distribution of applications across various environments.
• Automated deployment and configuration tasks using Ansible, enhancing efficiency and consistency in system setup and maintenance processes.
• Implemented monitoring solutions such as Grafana and Prometheus to visualize system performance metrics and ensure proactive issue detection and resolution.
• Conducted Root Cause Analysis (RCA) and maintained comprehensive documentation to facilitate continuous improvement and mitigate future incidents.
• Managed incident response processes, ensuring timely resolution of issues impacting data aggregation, transmission, and RTOC applications.
• Logged and tracked issues and enhancement requests to drive continuous optimization of systems and processes, fostering a culture of operational excellence.
• Provided expert technical support to RTOC and Company end-users, ensuring their productivity and satisfaction through proficient troubleshooting and resolution of hardware, software, and network issues.
• Collaborated closely with the Company IT department to address technical challenges across hardware, software, and networks, promoting a collaborative environment for problem-solving and knowledge sharing.
• Demonstrated adaptability and commitment to 24/7 operations, participating in a shift-based coverage model and working 12-hour shifts to ensure continuous support and availability of critical systems.