Buzz Solutions Logo

Buzz Solutions

Applied Machine Learning Platform Engineer

Reposted 10 Hours Ago
Remote
Hiring Remotely in CAN
Mid level
Remote
Hiring Remotely in CAN
Mid level
As an Applied Machine Learning Platform Engineer, you'll design and maintain training infrastructure, manage distributed pipelines, and optimize data workflows for machine learning models.
The summary above was generated by AI

About Us

Buzz is revolutionizing the analytics and maintenance of power grid infrastructure through our advanced AI solutions. Our computer vision systems analyze critical infrastructure to enhance safety, reliability, and operational efficiency across the power grid network.

Job Description 

We're looking for an entry/mid-level Applied Machine Learning Platform Engineer to join our computer vision team and help improve the databases, cloud infrastructure, and tooling our team builds on. You'll build tooling and infrastructure to help scale our training and data pipelines. You'll work within a team of experienced ML engineers with the autonomy to drive your own projects and the support to keep growing.


Responsibilities

  • Design, build, and maintain scalable training infrastructure for computer vision workloads
  • Implement and manage distributed training pipelines (multi-GPU, multi-node) to support large-scale model training and hyperparameter tuning
  • Build and maintain robust data pipelines for ML development
  • Design database schemas and storage strategies for managing large training datasets, annotations, and model artifacts
  • Implement and manage feature stores, data versioning, and experiment tracking to support reliable model iteration
  • Automate existing analysis workflows
  • Maintain clear documentation for platform components, data contracts, and deployment processes
  • Communicate infrastructure decisions, tradeoffs, and system limitations clearly to ML engineers and stakeholders
  • Conduct thorough code reviews and write integration tests for ML pipelines

Qualifications & Experience

  • 2-4 years of industry experience in platform, backend, data, or MLOps engineering roles
  • Python proficiency — idiomatic code, type hints, async patterns, packaging, and performance-aware implementation
  • Strong software engineering fundamentals — testing, code review, API design, component-level system design
  • Hands-on experience building and operating distributed cloud machine learning infrastructure
  • Designing and maintaining scalable training infrastructure, managing ML platform reliability, optimizing data pipelines for throughput at scale
  • Experience with database design and data systems for ML workloads — schema design, query optimization, and storage strategies for large-scale datasets
  • Excels at workflow orchestration and automation
  • Solid proficiency in Python and core ML tooling:
    • Python ecosystem: Pytest, UV, FastAPI, Pydantic
    • Tooling: Git, Docker, UV
    • Tracking: MLflow, Weights & Biases, or equivalent
    • Automation: Github Actions, CI/CD, Prefect or equivalent
    • Infrastructure: AWS, GCP, Kubernetes, Helm, Terraform or equivalent
    • Databases: postgres, DynamoDB, Bigtable

* Buzz Solutions does not provide Visa sponsorship for work authorizations in the United States at this time *

Similar Jobs

Senior level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Perform monthly commodity analyses for grains and vegetable oils using supply/demand data and market forecasts; build and manage market intelligence databases; develop and improve constraint-optimization price models and long-range forecasts; support CPRM hedging and coverage strategy decisions; track team KPIs and deliver insights to inform pricing and risk management.
Top Skills: Artificial IntelligenceExcelPythonR
36 Minutes Ago
Easy Apply
Remote
Canada
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
Build, ship, and maintain backend features enabling AI agents to interact with GitLab. Design GraphQL/REST APIs, extend tests (RSpec), work with PostgreSQL, troubleshoot production issues, and collaborate cross-functionally to integrate AI tooling responsibly.
Top Skills: Background JobsGitlabGraphQLPostgresRest ApisRspecRubyRuby On RailsSQL
36 Minutes Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Cloud • Security • Software • Cybersecurity • Automation
Ownership of backend features for Agentic Tools: design and implement GraphQL/REST APIs, build secure scalable Ruby on Rails services, improve RSpec automated tests, collaborate across product and AI teams, participate in Tier 2 on-call, and shape architecture for AI agent interactions with GitLab.
Top Skills: Gitlab McpGraphQLPythonRestRspecRuby On RailsVue

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account