SalaryPeak

SeniorSite Reliability Engineer (SRE)

ADVANCED E-SOLUTIONS PTE. LTD.
Singapore 5+ years Posted 3w ago

Salary Range

SGD 84,000 - SGD 102,000 /year

SGD 7,000 - SGD 8,500/month

Skills Required

Incident ResponseDevOpsIAMAutomation Systems MaintenanceLoad BalancingReplicationCloud ServicesDockerCapacity PlanningDatabasesApplication DeploymentProxy

Job Description

POSITION OVERVIEW : Software Development Senior Specialist


POSITION GENERAL DUTIES AND TASKS :
Role Summary

The Senior Site Reliability Engineer (L7) is a hands‑on technical engineeringrole responsible for building, automating, scaling, and maintaining highlyreliable, secure, and resilient cloud and hybrid infrastructure platforms.
The role focuses on cloud infrastructure engineering, container orchestration,Infrastructure‑as‑Code (IaC), observability, incident response, and platform‑levelapplication development.

Key Responsibilities

Deploy, configure, and maintain AWS resources including EC2, ECS, EKS, VPC,IAM, NAT, and networking components.
• Build secure and scalable cloud networking (VPCs, subnets, routing, VPN,firewalls).
• Work with load balancers, reverse proxies, API gateways, DNS management, andnetwork routing.• Build CI/CD pipelines using Jenkins, GitLab CI, or GitHubActions.
• Support application releases and coordinate deployments across environments.
• Implement logging/monitoring using Prometheus, Grafana, Datadog, Splunk, orCloudWatch.
• Participate in incident response, troubleshooting, on-call rotation, andpost-incident RCA.
• Perform system performance tuning, patching, capacity planning, andoptimization.
• Improve system reliability through automation, redundancy, and engineeringbest practices.
• Implement and maintain IaC using Terraform or CloudFormation.
• Automate provisioning, configuration, and environment setup using scripting(Python, Bash, Go).
• Develop reusable automation modules, templates, pipelines, and cloudengineering patterns.
• Build, deploy, and manage containerized applications using Docker.
• Operate and optimize Kubernetes clusters (EKS or on‑prem).
• Implement autoscaling, service mesh, pod security, and workload monitoring.
• Develop automation services, internal tooling, and platform utilities usingCore Java, Spring Boot, Quartz, and Erlang.
• Build wrappers/services for IBM MQ and RabbitMQ messaging flows.
• Create schedulers, orchestration components, and internal micro‑services foroperational tasks.
• Write integrations, connectors, and event-driven components forinfra-automation.
• Build custom alerts, webhook handlers, log processors, and reliabilitytooling.





Technologies / Tools:

Operating Systems & Virtualization

Enterprise Linux, VMware, OVM, X86 server clusters

Containerization & Orchestration

Kubernetes, Docker

Application Development (Platform)

Core Java1.8, Spring, Spring Boot, Quartz, Erlang

Messaging Platforms

IBM MQ, RabbitMQ, Erlang/Mnesia

IaC & Automation

Terraform, Ansible, CloudFormation, Chef

Scripting Languages

Python, Go, Bash

CI/CD Tooling

Jenkins, GitLab CI, GitHub Actions

Observability & Logging

Prometheus, Grafana, Datadog, Splunk

Databases & Storage

Oracle, HA DB clusters, NFS, HPE Nimble, DataDomain

Load Balancing & Networking

F5 LTM/ASM/ASR, DNS, network routing, proxies

File Transfer & Directory Services

GoAnywhere, Tivoli Directory Server

Cloud Platforms

AWS, Azure, GCP

Security Technologies

Hardware Security Modules (Payshield or equivalent)

Experience Requirements:

5+ years of experience as an SRE, DevOps Engineer, Cloud Engineer, or PlatformEngineer.
Strong hands‑on expertise with AWS cloud services (EC2, ECS, EKS).
Practical experience with IaC tools such as Terraform and CloudFormation.
Deep working knowledge of Kubernetes, Docker, cloud networking, load balancers,and proxies.
Hands‑on experience with CI/CD pipelines, release engineering, observabilitytooling, and monitoring stacks.
Experience supporting databases including partitioning, replication, sharding,and high availability setups.
Prior involvement in incident response, production support, and reliabilityengineering practices.

Desirable:
Good Understanding on infrastructure, F5, network
knowledge of ISO20022, ISO8583, and Swift MT formats
Experience in shell scripting, Python.
Experience within payments processing systems or finance/banking industry.
Experience in supporting applications using different languages and/orcharacter sets.