Cloudflare Logo

Cloudflare

Network Reliability Engineer

Reposted 4 Days Ago
Hybrid
7 Locations
Senior level
Hybrid
7 Locations
Senior level
The Network Reliability Engineer will manage Cloudflare's core data center network, automate tasks, and enhance network resilience with strong software development skills.
The summary above was generated by AI
Available Locations:
  • US:
    • Atlanta, GA
    • Austin, TX
    • Denver, CO
    • New York, NY
    • Seattle, WA
    • Washington, DC
  • Canada:
    • Toronto, ON

About the Role
Cloudflare operates a large global network spanning hundreds of cities (data centers). You will join a team of talented network engineers who are building software solutions to improve network resilience and reduce operational toil.This position will be responsible for the technical operation and engineering of the Cloudflare's core data center network, including the planning, installation and management of the hardware and software as well as the day-to-day operations of the network. The core network supports our critical internal needs such as databases, high volume logging, and internal application clusters. This is an opportunity to be part of the team that is building a high -performance network that is accessible to any web property online.
You will build tools to automate operational tasks, streamline deployment processes and provide a platform for other engineering teams to build upon. You will nurture a passion for an "automate everything" approach that makes systems failure-resistant and ready-to-scale. Furthermore, you will be required to play a key role in system design and demonstrate the ability to bring an idea from design all the way to production.
Examples of desirable skills, knowledge and experience
  • 5+ years of relevant Network/Site Reliability Engineering experience
  • BA/BS in Computer Science or equivalent experience
  • Solid foundation on configuration management frameworks: Saltstack, Ansible, Chef
  • Experience with NX-OS, JUNOS, EOS, Cumulus, or Sonic Network Operating Systems
  • Solid Linux systems administration experience
  • Linux networking - iproute2, Traffic Control, Devlink, etc.
  • Strong software development skills in Go and Python

Bonus Points
  • Deep knowledge of BGP and other routing protocols
  • Workflow Management (AirFlow, Temporal)
  • Open Source Routing Daemons (FRR, Bird, GoBGP)
  • Experience with bare metal switching
  • Experience with network programming in C, C++ or rust
  • Experience with the Linux kernel and Linux software packaging
  • Strong tooling and automations development experience
  • Time series databases (Prometheus, Grafana, Thanos, Clickhouse)
  • Other Tools - Kubernetes, Docker, Prometheus, Consul

Compensation
Compensation may be adjusted depending on work location and level.
  • For Colorado and Texas based hires: Estimated annual salary of $137,000 - $187,000.

Equity
This role is eligible to participate in Cloudflare's equity plan.
Benefits
Cloudflare offers a complete package of benefits and programs to support you and your family. Our benefits programs can help you pay health care expenses, support caregiving, build capital for the future and make life a little easier and fun! The below is a description of our benefits for employees in the United States, and benefits may vary for employees based outside the U.S.
Health & Welfare Benefits
  • Medical/Rx Insurance
  • Dental Insurance
  • Vision Insurance
  • Flexible Spending Accounts
  • Commuter Spending Accounts
  • Fertility & Family Forming Benefits
  • On-demand mental health support and Employee Assistance Program
  • Global Travel Medical Insurance

Financial Benefits
  • Short and Long Term Disability Insurance
  • Life & Accident Insurance
  • 401(k) Retirement Savings Plan
  • Employee Stock Participation Plan

Time Off
  • Flexible paid time off covering vacation and sick leave
  • Leave programs, including parental, pregnancy health, medical, and bereavement leave

Top Skills

Ansible
Chef
Clickhouse
Cumulus
Docker
Eos
Go
Grafana
Junos
Kubernetes
Linux
Nx-Os
Prometheus
Python
Saltstack
Sonic
Thanos

Similar Jobs at Cloudflare

6 Hours Ago
Hybrid
Austin, TX, USA
Senior level
Senior level
Cloud • Information Technology • Security • Software • Cybersecurity
Lead the redesign of ERP infrastructure lifecycle, ensuring data quality, integrating processes, and driving improvements for a new ERP system.
Top Skills: Asset Management Data ModelCmdbData Center Asset Lifecycle SolutionsDcimErp SystemsOracle ErpPimPlm
Yesterday
Hybrid
Austin, TX, USA
Entry level
Entry level
Cloud • Information Technology • Security • Software • Cybersecurity
The Project Coordinator will manage resource activities and coordinate customer engagements to ensure project success and satisfaction.
Yesterday
Hybrid
Austin, TX, USA
Mid level
Mid level
Cloud • Information Technology • Security • Software • Cybersecurity
Join a talented team as a Systems Reliability Engineer to enhance the Cloudflare platform's availability and performance using automation and monitoring tools.
Top Skills: AnsibleApache AirflowChefConsulDockerGoGrafanaLinuxNginxNomadPostgresPrometheusPuppetPythonRustSaltstackSQLTemporal

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account