Hiive Jobs

Site Reliability Engineer

Hiive

Site Reliability Engineer

Reposted 6 Days Ago

Hybrid

Vancouver, BC, CAN

Mid level

Hybrid

Vancouver, BC, CAN

Mid level

As a Site Reliability Engineer at Hiive, you will enhance platform reliability and performance, support AI/ML workloads, automate processes, and assist with incident responses.

The summary above was generated by AI

Hiive is redefining how private companies and their shareholders access liquidity. Through its institutional-grade platform, Hiive brings together buyers, sellers, and issuers to facilitate secondary transactions in venture-backed, pre-IPO companies, introducing efficiency, transparency, and standardization to an otherwise opaque asset class.

Recognized as one of Canada’s fastest-growing companies and backed by leading U.S. investors, Hiive is profitable, well-capitalized, and building a high-performance team to meet growing demand and pursue new market opportunities.

Interested in learning more about life at Hiive? Check out our careers page to see how you can grow with us!

As a Site Reliability Engineer at Hiive, you will be responsible for ensuring the reliability, availability, and performance of our platform. You’ll join our small but growing infrastructure team, working closely with the DevOps team and engineering leadership. As a hands-on contributor, you will build scalable and resilient infrastructure, automate processes, and respond to incidents efficiently and effectively.

You will help implement security and compliance measures, act as a trusted resource to your colleagues, and collaborate across teams to continuously improve our platform’s performance and reliability. You’ll also contribute to fostering an excellent, supportive engineering culture.

As Hiive continues to expand its use of AI across the platform, you will play a key role in building and operating the infrastructure that powers these systems. This includes supporting AI/ML workloads, improving observability into model performance and system behavior, and ensuring these services are reliable, scalable, and cost-efficient in production.

In this role, your responsibilities would include:

Maintain and improve our platform's uptime and availability
Optimize and maintain our infrastructure to improve reliability, performance, and security
Proactively identify and resolve scaling and reliability issues before they impact users or business metrics
Partner with product engineers to troubleshoot performance issues and implement effective solutions
Configure and maintain monitoring, alerting, and observability systems across our stack
Assist with incident response, including investigation, mitigation, and postmortems; develop and maintain incident runbooks
Participate in an on-call rotation shared across the engineering organization
Support and scale infrastructure for AI/ML systems, including model-serving workloads, data pipelines, and batch/async processing
Improve observability for AI systems (latency, cost, drift, failures) and help define reliability standards for these workloads

Required Skills:

Experience in a Site Reliability Engineering or similar role
Experience working with (writing or deploying) Elixir, or a strong desire to learn
Experience operating production Kubernetes clusters
Proficiency building infrastructure with Terraform
Strong experience with AWS (especially EKS, RDS, and VPC) and Vercel
Experience working with and optimizing PostgreSQL
Experience with Datadog or similar observability tools

Preferred Skills:

Experience working in regulated or high-compliance environments
Experience with CI/CD systems such as GitHub Actions
Experience supporting SOC 2 or similar certifications
Experience working with Cloudflare
Hands-on development experience in one or more programming languages
Experience supporting AI/ML systems in production (e.g., model serving, vector databases, or data pipelines)

Compensation, Benefits & Perks:

Highly competitive salary commensurate with experience and contribution.
Opportunity to participate in ownership of a rapidly growing company through our employee stock option plan.
Comprehensive 100% employer-paid health & dental premiums and a Health/Personal Spending Account for Canadian employees. (An employer-subsidized benefits program is available for US-based team members).
If you are based in Vancouver, enjoy a dedicated desk in our Vancouver, BC HQ, in the heart of downtown, with a fridge stocked with healthy snacks and drinks, an onsite gym, and a gorgeous rooftop amenity.
Enjoy a $20-per-day commuter benefit for every day you work in our Vancouver HQ.
An engaging social calendar, including bi-weekly catered lunches, bi-weekly “Friday bar,” team workouts, annual summer party, and holiday party, two “onsite” all-team retreats each year, semi-annual team-building events, and Hiive Women’s Network events.
Significant opportunities for growth into team leadership and management roles.
Entrepreneurial culture and a small and dynamic team.
Sponsorship, immigration, and relocation for exceptional candidates.

Hiive is committed to fostering an inclusive workplace where all individuals have an opportunity to succeed.

AI, automated tools, and applicant privacy notice:

As part of our recruitment and hiring process, Hiive may use automated tools, including artificial intelligence (AI), to assist in screening applications, evaluating candidate qualifications, and supporting interview processes. These tools are designed to support and inform human decision-making and are not used as the sole basis for any employment decision.

We may collect, use, and analyze personal information you provide in connection with your application, including generating insights or inferences to assess job-related qualifications. This information is used for recruitment, evaluation, and compliance purposes in accordance with applicable law.

We take reasonable steps to evaluate and monitor our hiring tools and practices to promote fairness, consistency, and non-discrimination. Where required by applicable law - including in Ontario, Quebec, New York City, Illinois, and California - we conduct or rely on assessments such as bias audits, honor rights related to automated decision-making, and provide additional disclosures on request.

Depending on your location, you may have certain rights with respect to your personal information and the use of automated processing, including the right to request access to, correction of, or deletion of your information, or to receive additional information about our data practices. We honor such rights where required by applicable law.

For accommodation requests or questions about this notice, contact [email protected].

34 W 8th Ave, Vancouver, British Columbia, Canada, V5Y 1M7

Similar Jobs

MongoDB

Site Reliability Engineer

13 Days Ago

Easy Apply

Remote or Hybrid

Easy Apply

Senior level

Big Data • Cloud • Software • Database

Maintain and improve multi-cloud Kubernetes infrastructure, CI/CD (Argo Workflows/ArgoCD), observability, and networking. Build reliable continuous deployment tooling and onboarding flows, provide internal support, collaborate across Platform Engineering, contribute upstream (open-source/operators), and participate in a 24/7 on-call rotation to resolve deployment infrastructure issues.

Top Skills: AlertingArgo WorkflowsArgocdAWSAzureCi/CdContainersDnsGCPGoKubernetesLinuxLoad BalancerObservabilityPythonService MeshTcp/IpTls

Fortinet

Site Reliability Engineer

2 Hours Ago

In-Office

Burnaby, BC, CAN

Senior level

Security • Software • Cybersecurity

Operate, maintain, automate, and improve OpenStack-based private cloud platforms with GitOps-driven lifecycle management. Lead SRE/DevOps practices, optimize CI/CD pipelines and observability, troubleshoot platform issues, provide technical leadership, and participate in on-call support to ensure reliability and performance.

Top Skills: AnsibleAwxBashCentos StreamDnsDockerElkForemanGitlab Ci/CdGitlab RunnersGitopsGoGrafanaHttpsKvmLdapLokiOpenstackPowershellPrometheusPythonRhelSAMLSmtpTcp/IpTerraformUbuntu

Fortinet

Site Reliability Engineer

2 Hours Ago

In-Office

Burnaby, BC, CAN

Senior level

Security • Software • Cybersecurity

Hands-on SRE Specialist responsible for building, running, and improving global consumer-facing services on OpenStack, Kubernetes, VMware and physical servers. Duties include automation (Ansible/Bash/Python), CI/CD with GitLab, monitoring (Zabbix/Grafana/ELK), incident response and on-call, Linux and network administration, security/compliance support (SOC 2/ISO 27001), and documentation.

Top Skills: AnsibleBashDnsDockerElk StackGitGitlabGrafanaIptablesKubernetesLdapLinux (Red Hat/Centos/Ubuntu)MySQLOpenstackPythonSmtpVMwareZabbix

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Hiive

Site Reliability Engineer

Hiive Vancouver, British Columbia, CAN Office

Similar Jobs

Site Reliability Engineer

Site Reliability Engineer

Site Reliability Engineer

What you need to know about the Vancouver Tech Scene