Job Description
Role Overview We’re looking for a highly analytical and performance-driven L3 Application Support Engineer with deep expertise in SRE, DevOps, and enterprise infrastructure. The role focuses on ensuring high availability for critical banking/corporate systems, driving automation, and leading complex incident resolution with minimal business impact. Key Responsibilities System Reliability & Incident Management Provide L3 application support for 24/7 production environments with strict SLA adherence Rapid triage, root cause analysis, and resolution of high-severity incidents for banking/corporate apps Lead DR drills, post-mortems, and implement resilience improvements Proactive health checks, patching, and performance monitoring to prevent outages Manage CI/CD pipelines using Jenkins/GitLab, Bitbucket, and Azure Automation Runbooks Implement Infrastructure as Code and automated deployments across Dev, UAT, Prod Build scripts in PowerShell, Bash, Python, KQL, T-SQL to reduce manual toil and streamline operations Track EOS/EOL components and drive technical upgrades for compliance Platform & Database Administration Administer Azure SQL PaaS/IaaS, PostgreSQL, MongoDB, Redis, Elasticsearch, MariaDB Manage Windows Server, RHEL8/9, OpenShift, and AWS S3 workloads Configure monitoring and dashboards using Azure Log Analytics, ELK Stack, Grafana, OAPM Security & Collaboration Remediate vulnerabilities using Microsoft Defender for Cloud/SQL and ensure compliance Partner with Dev, Security, Compliance, and Vendor teams for long-term fixes Own change/release management and maintain environment consistency Create runbooks, SOPs, and knowledge transfer docs for L1/L2 teams Required Skills & Qualifications Cloud & Infrastructure: Azure Portal, Azure SQL, Logic Apps, VM Mgmt, RHEL8/9, OpenShift, AWS S3 Databases: SQL Server 2005-2019, MongoDB, Redis, Elasticsearch, PostgreSQL, MariaDB DevOps/Tools: Jenkins/GitLab CI/CD, Bitbucket, Jira, Confluence, ELK, Grafana Automation: PowerShel