Salary Range
SGD 60,000 - SGD 114,000 /year
SGD 5,000 - SGD 9,500/month
Skills Required
Job Description
Job Title: Game Operations Engineer / Game SRE
【Responsibilities】
1. Game Operations & Assurance
Lifecycle Management: Responsible for the full lifecycle management of game servers, including resource planning, environment provisioning, version deployment/patching, server merges, auto-scaling, and daily maintenance.
Stability & Monitoring: Build and refine the monitoring and alerting systems for game services (covering both system-level and business-level metrics). Ensure rapid response to production incidents to guarantee 24/7 high availability and stability of the game.
2. Cloud-Native Architecture & Automation
Infrastructure as Code (IaC): Manage and maintain infrastructure on AWS or GCP platforms using IaC tools such as Terraform.
Automation & Containerization: Drive the containerization process (Docker/K8s) for game services. Develop Ansible playbooks or scripts in Python/Go/Shell to standardize and automate operational workflows.
3. Optimization & Troubleshooting
Performance Tuning: Gain a deep understanding of game business logic; collaborate with the R&D team to optimize system architecture and resolve production performance bottlenecks.
Incident Management: Lead the troubleshooting, post-mortem analysis, and remediation of online incidents. Produce technical documentation to build and maintain the operations knowledge base.
【Requirements】
1. Infrastructure & Cloud Platforms
Linux & Networking: Deep understanding of Linux system internals and network protocols (TCP/IP, HTTP/HTTPS).
Cloud Expertise: Familiarity with AWS or GCP public cloud services (EC2/GCE, S3/GCS, VPC, IAM, etc.). Experience with multi-cloud operations is a plus.
2. Containerization & Automation
K8s Expertise: Proficient in the Kubernetes (K8s) and Docker ecosystem, with proven experience maintaining large-scale clusters in a production environment.
Automation Tools: Skilled in using automation tools such as Ansible and Terraform.
Scripting/Coding: Strong scripting capabilities. Proficient in at least one language among Python, Shell, or Go. Capable of independently developing internal operations tools or platforms.
3. Monitoring & Engineering Best Practices
Observability: Familiar with mainstream monitoring and logging stacks (Prometheus, Grafana, ELK, Zabbix, etc.), with experience in designing custom monitoring metrics.
DevOps Mindset: Possess a certain level of development background (Java/Go/Node.js), with the ability to assist in troubleshooting at the code level or develop simple utilities.
CI/CD: Value engineering quality; familiar with CI/CD methodologies and related tools (Jenkins, GitLab CI, etc.).
4. Bilingual in Chinese and English
Please send your resume and cover letter to [email protected]