Inworld AI Logo

Inworld AI

Staff Cloud DevOps/Site Reliability Engineer (SRE) - Canada

Posted 24 Days Ago
Be an Early Applicant
Vancouver, BC
Senior level
Vancouver, BC
Senior level
The Staff Cloud DevOps/Site Reliability Engineer will manage the infrastructure, DevOps, and Site Reliability of the platform, maintain Infrastructure-as-Code using Terraform, orchestrate CI/CD pipelines with tools like GitHub Actions and ArgoCD, and ensure the scalability of microservices through Kubernetes administration. The role includes monitoring the availability and health of services and managing incident responses.
The summary above was generated by AI

view open roles

Why Join Inworld

Inworld is the best-funded startup in AI and games with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners, Section 32, BITKRAFT Ventures, Kleiner Perkins, Founders Fund, and First Spark Ventures.

Inworld is the leading AI engine for games and interactive media. Inworld’s suite of AI components enables developers to build interactive, responsive, and personalized AI gaming experiences, orchestrate models to create intelligent game behaviors, and unlock enhanced productivity with AI-generated content. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft Xbox, Epic Games, and Unity. 

Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world in 2024 and was also named among LinkedIn's Top Startups of 2024 in the USA.

Our Technical Operations team manages the infrastructure, DevOps, and Site Reliability of our platform. We are looking for a Staff Cloud DevOps/Site Reliability Engineer to join our team. 

Qualifications

  • Bachelor's degree in Computer Science, Engineering, or a related field
  • 7+ years of experience as a DevOps, Infrastructure, Operations, or Site Reliability Engineer (or as a software engineer with relevant experience).
  • At least 2 years experience each with:
    • Terraform
    • Helm
    • Kubernetes
    • AWS, Azure, or GCP
    • CI/CD using modern tools (GitOps)
  • Optional (not required but considered a plus):
    • MLOps (building, orchestrating, and maintaining Machine Learning Pipelines)
    • Prometheus / Grafana
    • Multi-cloud deployments (2 or more)
    • ArgoCD
    • Network management and VPNs

Responsibilities

  • Infrastructure: Maintain and contribute to Infrastructure-as-Code (Terraform)
  • DevOps and CI/CD Pipelines: Orchestrate pipelines using Github Actions, Helm, ArgoCD
  • Microservices scalability: Kubernetes Administration
  • Cloud Administration
  • Site Reliability: Measure and monitor availability, latency, and overall service health, drive incident management and post-mortem analysis

Work location: British Columbia, Canada.

The base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes bonus, equity and benefits. Within the range, individual pay is determined by work location and additional factors, including competencies and experience.


Inworld Jobs Privacy

Top Skills

AWS
Azure
GCP
Helm
Kubernetes
Terraform

Similar Jobs

Be an Early Applicant
9 Days Ago
Vancouver, BC, CAN
Hybrid
6,500 Employees
Senior level
6,500 Employees
Senior level
Gaming • Information Technology • Mobile • Software
The Senior Site Reliability Engineer will support infrastructure, monitoring, and tooling needs. Responsibilities include developing automated scalable cloud infrastructure, monitoring systems, diagnosing technical issues, and enhancing CI/CD pipelines. The SRE will collaborate with engineers to maintain highly available services across various technologies, while participating in an on-call rotation for live service issues.
Be an Early Applicant
2 Days Ago
Vancouver, BC, CAN
145,454 Employees
Entry level
145,454 Employees
Entry level
Computer Vision • Hardware • Mobile • Software • Semiconductor
As a Jr DevOps Engineer at Samsung, you will support and enhance the cloud infrastructure, collaborating with teams to implement cloud technologies, and ensure system reliability. You will develop tools for efficiency and participate in an on-call rotation.
Be an Early Applicant
6 Days Ago
Vancouver, BC, CAN
13,285 Employees
Senior level
13,285 Employees
Senior level
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
As a Senior Site Reliability Engineer at Autodesk, you will oversee the Autodesk Identity service, enhance cloud infrastructure for millions of users, automate systems for reliability, monitor performance, and lead incident response efforts while ensuring security compliance. You'll also mentor teams and foster continuous learning.

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account