WitnessAI Logo

WitnessAI

Senior DevOps Engineer - GCP

Posted 7 Days Ago
In-Office
7 Locations
Senior level
In-Office
7 Locations
Senior level
The Senior DevOps Engineer will design and maintain infrastructure on GCP for AI workloads, focusing on security and performance, while collaborating with multiple teams.
The summary above was generated by AI

Job Title: Senior DevOps Engineer – GCP
Location: SF Bay Area
Type: Full-time
Team: Platform Engineering
Reports To: Head of Platform Engineering

About Us

WitnessAI is a fast-growing SaaS startup on a mission to enable enterprises to adopt AI, safely. We're building a product that provides security and governance guardrails for public and private LLMs.

About the Role

We’re looking for a Sr. DevOps Engineer who will take ownership of designing, securing, and scaling the cloud backbone of our AI security platform. You’ll be responsible for infrastructure architecture in Google Cloud Platform (GCP), with a deep focus on networking, VPC security, interconnectivity, and service reliability.

You’ll work closely with ML engineers, security researchers, and backend developers to build highly reliable, secure, and performant environments for running AI workloads and security tooling.

What You’ll Do
  • Design, implement, and maintain GCP-based infrastructure for secure AI workloads and APIs

  • Build and manage scalable, low latency Internet and Cloud networking strategies eg Anycast+, route optimization, and private VPC peering, VPC networks, private service access, peering, and firewall configurations.

  • Develop secure ingress/egress patterns, service meshes, and zero-trust networking topologies

  • Automate infrastructure provisioning using Terraform, Helm, and CI/CD workflows

  • Collaborate on platform observability: logging, monitoring, alerting, and incident response

  • Harden cloud infrastructure against threats using IAM best practices, organization policies, and GCP security controls

  • Work cross-functionally with engineering, data science, and security teams to optimize environment reliability and cost

  • Help establish infrastructure SLAs, SLOs, and runbooks as the platform scales

What We’re Looking For
  • 7+ years of experience in infrastructure engineering, site reliability, or DevOps

  • Deep expertise in Google Cloud Platform, including VPC networking, Cloud NAT, private services, and inter-project connectivity

  • Strong knowledge of Terraform and infrastructure-as-code practices

  • Proficiency with container orchestration (Kubernetes / GKE preferred)

  • Experience designing for high availability, scalability, and secure access across cloud environments

  • Familiarity with service mesh tools (Istio, Linkerd, etc.) and API gateways

  • Solid understanding of Linux, DNS, TLS, load balancing, and network security principles

  • Comfort working in a fast-paced startup environment with ownership and autonomy

  • Bonus: Experience in regulated or high-security environments (SOC 2, FedRAMP, HIPAA, etc.)

  • Bonus: Exposure to ML infrastructure, GPU workloads, or data pipelines, including VPC networking, Cloud NAT, private services, and inter-project connectivity

Benefits:

  • Hybrid work environment

  • Competitive salary, health, dental, and vision insurance

  • 401(k) plan

  • Opportunities for professional development and growth

  • Generous vacation policy

Salary range:

$168K-$225K (Bay Area)

Top Skills

Ci/Cd
Dns
Google Cloud Platform
Helm
Istio
Kubernetes
Linkerd
Linux
Terraform
Tls

Similar Jobs

16 Minutes Ago
In-Office or Remote
7 Locations
Junior
Junior
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
The Quality Management Auditor role involves auditing evaluations, participating in calibration sessions, and providing insights to improve customer interactions and compliance.
Top Skills: Quality Management
27 Minutes Ago
In-Office or Remote
34 Locations
Entry level
Entry level
Machine Learning • Natural Language Processing
Join Welo Data to contribute to AI projects involving annotation, evaluation, and prompt creation, while working flexibly with global teams.
Top Skills: Digital Tools
Entry level
Machine Learning • Natural Language Processing
Welo Data seeks candidates fluent in Simplified Chinese for remote AI data labeling, evaluation, and instruction tasks, offering flexible hours.
Top Skills: AIDigital Tools

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account