Senior/Principal DevOps

Aghanim
Lisboa

Candidatura rápida

Detalhes da oferta

Período Integral

Descrição completa da oferta

Aghanim is an integrated commerce, liveops automation, community engagement, and payments platform for video games.

Mobile games have traditionally depended on app stores for distribution, payments, and player relationships. We believe there is a better way. Aghanim helps game studios build direct relationships with players, sell directly, and build their future on their own terms. Today, more than 100 games worldwide are already building this future with Aghanim.

Our team brings together people across Los Angeles, New York, Seoul, Beijing, London, Lisbon, Belgrade and other locations around the globe, with deep expertise in gaming, fintech and technology. We move quickly, keep communication direct, and focus on getting things done. We believe the best people thrive when they have autonomy, ownership, and a stake in the company's success.

We’re looking for a Senior/Principal DevOps to own our cloud-only platform and keep it reliable under high-load and bursty traffic. Our services run entirely on GCP, fronted by Cloudflare, with deep observability in Datadog and CI/CD in GitHub Actions.

This is a hands-on role with real ownership: ensuring we meet our SLA/SLOs, scaling fast (10–50×), and keeping infrastructure efficient and cost-conscious as the company grows and microservices multiply.

Key Responsibilities

Platform Reliability and Scalability

Own and evolve production infrastructure on GCP and Cloudflare
Ensure platform availability, performance, and reliability in line with SLA/SLO targets
Design and operate systems capable of handling rapid traffic spikes and high-load scenarios
Drive capacity planning, autoscaling, bottleneck removal, and resilience improvements

Kubernetes and Infrastructure

Build and maintain Infrastructure as Code using Terraform (and Terragrunt where applicable)
Own Kubernetes operations on GKE, including upgrades, scaling, and operational hardening
Maintain platform tooling, deployment infrastructure, and operational standards

Observability and Incident Management

Build and improve observability using Datadog, including monitoring, logging, and APM
Lead incident response, root cause analysis, and long-term reliability improvements
Reduce operational noise and improve detection of critical system issues

Security and Delivery

Operate core security tooling and coordinate remediation efforts across teams
Improve deployment reliability and developer experience through CI/CD automation and platform guardrails

Cost Optimization

Own infrastructure cost visibility, optimization, and FinOps initiatives

Required Qualifications

5+ years of experience in DevOps, SRE, or Platform Engineering roles
Proven experience operating production SaaS infrastructure with defined reliability targets and SLA/SLO ownership
Strong expertise in GCP, Kubernetes (GKE), and Infrastructure as Code using Terraform
Experience with Cloudflare, observability platforms (Datadog or similar), and modern CI/CD practices
Strong automation and scripting skills with a reliability-first mindset
Experience designing and operating scalable, high-availability distributed systems
Familiarity with SOC 2 / PCI-DSS audits and security architecture requirements.

Preferred Qualifications

Experience in game dev or similarly bursty high-load consumer products.
Mature SRE practices: error budgets, on-call maturity, runbooks, proactive incident prevention.

Why Join Us

World-class team – work alongside experienced professionals from around the globe who have built products used by millions of players
High growth, high impact – be part of a fast-growing company where ideas turn into products and reach customers in days, not months
Autonomy and ownership – we trust people to make decisions, take initiative, and drive results
Modern tools and technology – use AI, automation, and modern tools as part of your everyday work
Equity – participate in the company's growth and long-term success

Candidatura rápida

Key Responsibilities

Required Qualifications

Preferred Qualifications

Why Join Us

Ferramentas para Candidatos

Ferramentas da empresa

Procurar

Mantenha-se em Contacto