Site Reliability Engineer (Infrastructure)

Apply Now

What you do:

  • Managing multiple kubernetes clusters and container workloads.
  • Troubleshoot identify root cause and fix infrastructure issues.
  • Implement some open source infrastructure stack; specifically: RabbitMQ, Redis, Elasticsearch, Thanos, ELK.
  • Capacity planning and scaling infrastructure resources.

What you need to succeed in this role:

  • Good understanding of Linux and Network administration skills for troubleshooting.
  • Experience automating infrastructure provisioning such as Ansible, Terraform.
  • In-depth hands-on work experience with implementations of Kubernetes.
  • Ability to debug issues with Kubernetes clusters and complex applications hosted on k8s.


It would be great if you have:

  • Hands-on experience on Linux debugging, performance tuning.
  • Good understanding of Cloud Platform (AWS or Google Cloud)
  • Experience on monitoring, logging, and tracing systems such as ELK, Prometheus, Thanos, Vector, OpenTelemetry.
  • Experience designing and managing MongoDB and MySQL databases is an advantage.
  • Experience in writing software in one or more languages, such as Go, Java, Python or similar is an advantage.
  • Knowledge in Security is an advantage.
Apply Now
Benefits & Perks
Flexible hours that suit your pace
Breakfast and lunch provided
Work from home or anywhere
Barista service available daily
MacBook provided for everyone
Afterwork beer available daily
No cap on your annual leave
Yearly company outing trip
Copyright @2021 LINE MAN Wongnai All right reserved.
Privacy Policy