Labelbox Logo

Labelbox

Frontier Data Engineer

Sorry, this job was removed at 06:08 p.m. (PST) on Tuesday, Feb 04, 2025
Be an Early Applicant
7 Locations
7 Locations

Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform combines on-demand labeling services powered by the Alignerr community of highly-educated experts, who span all major languages and a diverse range of advanced subjects. They are available on-demand to rapidly generate new data for supervised fine-tuning, RLHF, and more. Labelbox’s software-first approach delivers unmatched control and transparency into the labeling process, leading to the generation of high-quality, consistent data at scale.

Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.

About the Role

This role requires an entrepreneurial mindset. You will operate like a technical cofounder of an AI startup. Maybe that’s what you will become after cutting your teeth at Labelbox!

You will use a unique mix of engineering, product and sales to deliver data on high stakes projects for Frontier AI labs - think Google, Meta, Amazon. You will work with human data teams or AI researchers in customer organizations to produce the best datasets for model Evals, Supervised Finetuning, or ground truth data for advanced and emerging Reinforcement Learning techniques

This role is based in our dedicated tech hub in San Francisco, CA. We use a hybrid work model of 2 days in the office per week.

Your Day to Day

  • Understand what data Frontier AI labs need to measure LLM performance and improve model capabilities.

  • Build and operate human data pipelines to produce the best data.

  • Write code for pipelines and analyze data quality.

  • Write specs for engineering to improve human data tools and workflows.

About You

  • Master’s degree or above (Computer Science, Engineering, Mathematics, AI)

  • Proficiency in Python

  • Ability to communicate precisely and clearly

  • Excellent project management skills

  • Interest in the intersection of product, engineering and customer facing role

Labelbox strives to ensure pay parity across the organization and discuss compensation transparently.  The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.

Annual base salary range

$140,000$200,000 USD

Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s Job Applicant Privacy notice.

Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.

Similar Jobs

15 Days Ago
Easy Apply
Hybrid
Toronto, ON, CAN
Easy Apply
Expert/Leader
Expert/Leader
Artificial Intelligence • Marketing Tech • Software
As a Principal Data Engineer, you will drive the direction of the Data Warehouse, enabling data access across departments and designing data pipelines to handle massive data ingestion and ensure compliance with data regulations. You will mentor less experienced team members and optimize pipeline performance.
Top Skills: Apache AirflowApache KafkaApache PulsarAWSBigQueryFireboltPythonRedshiftSnowflakeSQL
15 Days Ago
Hybrid
Toronto, ON, CAN
Senior level
Senior level
Cloud • Mobile • Software
As a Data Engineer at BuildOps, you will design and maintain data pipelines, focusing on data processing, automation, and machine learning enhancement. Collaboration with various teams to resolve technical data challenges is key, as is the continuous improvement of data workflows and ETL infrastructure.
5 Days Ago
7 Locations
Mid level
Mid level
Artificial Intelligence • Information Technology • Machine Learning
As a Frontier Data Engineer at Labelbox, you will work with AI labs to create high-quality datasets for machine learning models. Your role involves building data pipelines, analyzing data quality, and collaborating with engineering to enhance data tools and workflows.
Top Skills: Project ManagementPython

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account