Docker, Inc Logo

Docker, Inc

Senior Infrastructure Engineer

Posted 3 Days Ago
Remote
9 Locations
Senior level
Remote
9 Locations
Senior level
As a Senior Infrastructure Engineer, you will design, develop, and operate cloud services, focusing on Kubernetes and automation using Infrastructure as Code, while driving operational excellence and collaboration across teams.
The summary above was generated by AI

At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride!

Role Summary

Our Infrastructure Engineering team writes software and operates the cloud‑native platform that powers Docker products such as Docker Hub, Docker Build Cloud, and Docker Scout. We design resilient services, automate everything, and measure what matters so hundreds of engineers can ship features to millions of users every day.

You will:

  • Build and run internal platform services (provisioning APIs, cost‑optimisation tools, observability pipelines) on AWS.

  • Evolve our multi‑tenant Kubernetes environment and networking layer to deliver secure, reliable, and cost‑effective compute at global scale.

  • Drive reliability through code, embracing GitOps, Infrastructure as Code, and SLO‑based operations.

How We Work

  • Code first: we tackle infra problems with software, design docs, and rigorous code review.

  • Async & remote‑first: decisions are documented in RFCs; incident reviews are blameless and written.

  • Cross‑functional: platform, product, and security engineers collaborate daily to unblock each other.

  • Continuous improvement: we ship small, measure impact, and iterate quickly.

Responsibilities

1. Ship & Operate Cloud Services
  • Design, develop, and ship internal platform services (e.g. provisioning, cost insights, rate‑limiting) in Go or Python.

  • Partner with product and engineering teams to provide paved‑road patterns for deployment, observability, and security.

2. Infrastructure as Code & Reliability
  • Codify infrastructure with Terraform and Go; champion GitOps best practices.

  • Define SLOs, lead on‑call rotations, conduct blameless post‑mortems, and implement preventive actions.

3. Platform Foundations (Kubernetes & Networking)
  • Evolve Docker’s ingress stack—Envoy Gateway, ALB/NLB, AWS VPC CNI—to deliver secure, reliable, and cost‑efficient request routing.

  • Operate and scale multi‑tenant EKS clusters; guide the evaluation and adoption of new infrastructure technologies.

Qualifications

Core Engineering Skills (must‑have)
  • Strong software development skills in Go, Python, or similar (design, testing, and code review).

  • Significant experience shipping and operating cloud applications/services in production (typically 5+ years of relevant work).

  • Solid foundation in Linux, networking, and cloud security.

  • Excellent written and verbal communication in a remote environment.

Depth in one or more of the following (nice‑to‑have)
  • Kubernetes ecosystem (EKS, ingress, CNI, service mesh).

  • Observability tooling (OpenTelemetry, Prometheus, Grafana).

  • CI/CD & release automation (GitHub Actions, Argo CD).

  • Cost optimisation at scale (FinOps, capacity modelling).

  • Distributed systems, containers, and Go‑based platform tooling.

Demonstrated expertise in at least one of these areas is welcome; we don’t expect candidates to be experts in all.

What to Expect

First 30 Days
  • Complete Docker onboarding and meet teammates across Engineering, Security, and Product.

  • Ship your first change to a Terraform module or internal service and shadow on‑call.

  • Gain a deep understanding of our platform architecture, SLOs, and current reliability initiatives.

First 90 Days
  • Take ownership of a critical service or infrastructure component and lead a performance‑oriented project from design to production.

  • Rotate fully into the on‑call schedule, leading incident response when needed.

  • Contribute to refining our platform roadmap and advocate for improvements that reduce toil and accelerate delivery.

First Year
  • Lead the design and rollout of a major infrastructure initiative.

  • Become a go‑to subject matter expert within Docker for cloud platform and networking.

  • Mentor newer engineers and influence engineering culture through technical leadership and continuous improvement.

We use Covey as part of our hiring and / or promotional process for jobs in NYC and certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound on April 13, 2024.

Please see the independent bias audit report covering our use of Covey here.

Perks

  • Freedom & flexibility; fit your work around your life

  • Designated quarterly Whaleness Days

  • Home office setup; we want you comfortable while you work

  • 16 weeks of paid Parental leave

  • Technology stipend equivalent to $100 net/month

  • PTO plan that encourages you to take time to do the things you enjoy

  • Quarterly, company-wide hackathons

  • Training stipend for conferences, courses and classes

  • Equity; we are a growing start-up and want all employees to have a share in the success of the company

  • Docker Swag

  • Medical benefits, retirement and holidays vary by country

Docker embraces diversity and equal opportunity. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our company will be.

Due to the remote nature of this role, we are unable to provide visa sponsorship.

#LI-REMOTE

Top Skills

Argo Cd
AWS
Envoy
Github Actions
Gitops
Go
Grafana
Kubernetes
Opentelemetry
Prometheus
Python
Terraform

Similar Jobs

18 Days Ago
Remote
30 Locations
Senior level
Senior level
Artificial Intelligence • Software
As a Senior Staff DevOps & Infrastructure Engineer, you will design and maintain infrastructure, manage CI/CD pipelines, and ensure security best practices, collaborating with a diverse team to deliver innovative cybersecurity solutions.
Top Skills: AnsibleAWSBazelCi/CdDockerKafkaKstreamsKubernetesPuppetTerraform
18 Days Ago
In-Office or Remote
30 Locations
Senior level
Senior level
Artificial Intelligence • Machine Learning • Natural Language Processing • Software • Generative AI
The Senior Software Engineer will build infrastructure, automate services, ensure stability, and mentor team members, focusing on Kubernetes and cloud environments.
Top Skills: AWSAzureGCPGoKubernetesLinuxOciRdma Networking
14 Days Ago
In-Office or Remote
46 Locations
Senior level
Senior level
Big Data • Cloud • Digital Media • Machine Learning • Mobile • Software • Industrial
As a Senior Technical Content Developer, you will create and manage audience-focused content plans, enhance customer experience with Civil 3D, and provide training and insights on best practices for content creation.
Top Skills: Authoring ToolsCivil 3DContent Management Systems (Cms)Infraworks

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account