Camunda Logo

Camunda

Senior Site Reliability Engineer (East Coast-Remote)

Reposted 21 Days Ago
Remote
3 Locations
Senior level
Remote
3 Locations
Senior level
As a Site Reliability Engineer, you'll design and maintain Kubernetes-based infrastructure, enhance monitoring tools, support critical incidents, and drive automation to improve operational efficiency.
The summary above was generated by AI

Camunda is the leader in enterprise agentic automation, orchestrating complex business processes, including high-value knowledge work, across agents, people, and systems. By creating production-ready, enterprise-grade agents with built-in governance, Camunda uniquely delivers trusted AI agents for business-critical processes. Over 700 leading innovators like Atlassian, ING, and Vodafone, rely on Camunda to slash time-to-value from months to days, boost operational efficiency, and elevate customer experiences.


As a fully remote, global company, we’re rewriting the rules of modern business. Named GP Bullhound’s 2024 Top 100 Next Unicorn list, certified as a Great Place to Work, and recognized by Flexa for true flexibility, we’re growing fast and looking for top talent to join our team. If you’re excited to do meaningful work and make real impact, keep reading, this role could be the one you’ve been waiting for.

About the role:

At Camunda, we believe in empowering businesses to automate their processes – and that starts with building incredibly reliable platforms. As a Senior Site Reliability Engineer, you’ll be at the heart of this mission! You'll play a crucial role in designing, maintaining, and improving our Kubernetes-based infrastructure and multi-cloud platform. You’ll work alongside talented engineers across teams to ensure Camunda runs smoothly for our customers worldwide, proactively identifying opportunities for improvement and contributing to a culture of continuous learning and operational excellence. This isn't just about keeping things running; it's about shaping the future of how we deliver value through automation.

Curious about the kind of challenges you'll work on at Camunda? Watch this quick 30-minute talk from our engineers to learn more about the new Camunda Exporter and how we’re solving complex problems at scale

What You’ll Be Doing:

  • Architect & Maintain Our Platform: Design, build, and maintain our Kubernetes-based infrastructure and multi-cloud platform, focusing on availability, scalability, fault tolerance. You will be directly involved in expanding Camunda SaaS capabilities by playing an important role in upcoming projects like:

    • Making our service available as a multi-region offering

    • Expanding the availability of our service to new regions and cloud providers

  • Champion Observability: Implement and enhance our monitoring tools to provide clear visibility into the health and performance of our entire stack – for both SREs and developers. You will be directly involved in helping Camunda continue its Observability journey by being an instrumental part of evolving our monitoring and observability practice supporting a multi-cloud, multi-region product.

  • Collaborate & Innovate: Work closely with cross-functional teams (development, product, etc.) to define, improve, and efficiently ship new features. Bring your experience to bear on how we can innovate and automate our processes further. You will be directly involved in developing new capabilities for Camunda SaaS.

  • Be a Trusted Resource: Provide 3rd level support for critical incidents and participate in our on-call rotation, ensuring rapid response and resolution. You will directly assist our customers and partners in providing a world-class SaaS experience.

  • Drive Automation: Identify opportunities to automate manual tasks and improve operational efficiency across the platform. You will help Camunda:

    • Continue to scale operations with automation

    • Evolve operational strategy to uplevel Camunda as a world-class SaaS provider

What You Bring:

  • Must Haves:

    • 5+ years of experience in Site Reliability Engineering (SRE) or a similar role, with a strong focus on cloud infrastructure.

    • Deep understanding and practical experience with Kubernetes and containerization technologies (Docker, etc.).

    • Proficiency in at least one scripting language (Python, Go, Bash) for automation and tooling development.

    • Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, Datadog, New Relic – or similar).

    Nice to Haves:

    • Experience working in a multi-cloud environment (AWS, Azure, GCP).

    • Familiarity with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.

 

#LI-SK1 #Li-Remote #USEAST

What We Have to Offer:

Compensation

We offer competitive, fair, and transparent compensation. Salary ranges are location-based, with Standard and Major markets (global tech hubs) reflecting local competition.

The Annual Total Target Cash (base salary + 100% variable target, where applicable) shown below spans from the minimum in a Standard market to the maximum in a Major market. Final offers depend on skills, experience, and location, and we typically hire in the first half of the range to allow room for growth:

  • United States: $149,800 to $247,200

  • Germany: €96,800 to €160,100

  • United Kingdom: £94,100 to £154,700

  • Singapore: S$186,100 to S$279,100

If you’re based elsewhere, you’ll be hired via Remote.com (our global employer partner), and your Talent Acquisition Partner will provide a personalized Total Rewards Calculator after your first interview.

Equity: We also offer equity (where applicable) through our Virtual Stock Option Plan (VSOP).

 

Benefits & Perks

We invest in your wellbeing, growth, and ability to connect, along with perks that support you no matter where you’re based. Our benefits are globally designed and locally delivered where applicable.

  • Remote & Flexible: Work from anywhere with the setup that suits you, home office budget, co-working space support, and flexible time off to recharge when you need it.

  • In Person Connection: We invest in meaningful face time through our Annual Kickoff (Vienna in 2025, Madrid in 2026!), team offsites, and Camundi Connection Budgets, including contributing to meetups while travelling,, and local gatherings with fellow Camundi.

  • Health & Wellbeing: Access locally tailored healthcare, Modern Health for global mental wellbeing, and an annual fitness reimbursement.

  • Financial Security: Retirement and pension plans (often with company contributions), plus life and disability insurance where relevant.

  • Professional Growth: Up to $/€/£1,000 per year for self-driven learning: courses, certifications, books, you decide!

  • More of what we offer globally & in your country can be found here.

”Everyone is welcome at Camunda” it’s a celebrated component of our culture. We strive to create an inclusive environment that empowers our people. At Camunda, we honour diverse cultures and backgrounds and are proud to be an equal opportunity employer. All qualified applicants will receive consideration without regard to gender, race, ethnicity, religion, belief, sexual orientation, age, disability or any other protected characteristics under applicable law. We are looking forward to your application!

Come join us and be part of Camunda’s incredible journey: Make an impact at a pivotal moment in our story!

Top Skills

Bash
Datadog
Docker
Elk Stack
Go
Grafana
Kubernetes
New Relic
Prometheus
Python

Similar Jobs

19 Days Ago
In-Office or Remote
Open Hall, Subd. F, NL, CAN
Senior level
Senior level
Digital Media • Social Media
The Senior Site Reliability Engineer at TextNow will maintain and scale production services, improve reliability, write automation code, and collaborate with development teams for optimal infrastructure performance.
Top Skills: AnsibleAWSBashDockerGoKubernetesLinuxMariadbPuppetPythonRedisRubyTerraform
3 Days Ago
Remote
30 Locations
Senior level
Senior level
Information Technology
As a Senior Site Reliability Engineer, you'll build and maintain infrastructure, tackle operational challenges, and automate processes to enhance reliability.
Top Skills: DockerDocker ComposeGoLinuxPerlPython
12 Hours Ago
Remote
CAN
Senior level
Senior level
Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Recruit and assess top talent in Machine Learning through strategic partnership with leadership, evaluating candidates’ technical expertise and contributions while owning the full-cycle recruitment process.
Top Skills: Applied AiDeep LearningMachine Learning

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account