Labelbox Logo

Labelbox

Frontier Data Engineer

Sorry, this job was removed at 06:08 p.m. (PST) on Tuesday, Feb 04, 2025
Be an Early Applicant
7 Locations
7 Locations

Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform combines on-demand labeling services powered by the Alignerr community of highly-educated experts, who span all major languages and a diverse range of advanced subjects. They are available on-demand to rapidly generate new data for supervised fine-tuning, RLHF, and more. Labelbox’s software-first approach delivers unmatched control and transparency into the labeling process, leading to the generation of high-quality, consistent data at scale.

Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.

About the Role

This role requires an entrepreneurial mindset. You will operate like a technical cofounder of an AI startup. Maybe that’s what you will become after cutting your teeth at Labelbox!

You will use a unique mix of engineering, product and sales to deliver data on high stakes projects for Frontier AI labs - think Google, Meta, Amazon. You will work with human data teams or AI researchers in customer organizations to produce the best datasets for model Evals, Supervised Finetuning, or ground truth data for advanced and emerging Reinforcement Learning techniques

This role is based in our dedicated tech hub in San Francisco, CA. We use a hybrid work model of 2 days in the office per week.

Your Day to Day

  • Understand what data Frontier AI labs need to measure LLM performance and improve model capabilities.

  • Build and operate human data pipelines to produce the best data.

  • Write code for pipelines and analyze data quality.

  • Write specs for engineering to improve human data tools and workflows.

About You

  • Master’s degree or above (Computer Science, Engineering, Mathematics, AI)

  • Proficiency in Python

  • Ability to communicate precisely and clearly

  • Excellent project management skills

  • Interest in the intersection of product, engineering and customer facing role

Labelbox strives to ensure pay parity across the organization and discuss compensation transparently.  The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.

Annual base salary range

$140,000$200,000 USD

Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s Job Applicant Privacy notice.

Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.

Similar Jobs

4 Days Ago
Toronto, ON, CAN
Senior level
Senior level
Cloud • Fintech • HR Tech
The Sr. Data Engineer will build scalable data solutions and robust data pipelines, collaborating with various teams to drive data-driven decisions.
Top Skills: AirflowAWSKafkaMongoDBPythonRedshiftSagemakerScalaSnaplogicSnowflakeSparkTableau
8 Days Ago
Hybrid
Vancouver, BC, CAN
Senior level
Senior level
Insurance • Software
The Senior Data Engineer will develop scalable systems and infrastructure, enhancing data handling processes, and collaborating with technical teams.
Top Skills: AIAutomationCloud ComputingData Infrastructure
8 Days Ago
Toronto, ON, CAN
Senior level
Senior level
Payments
The Senior Data Engineer will design, develop, and optimize data solutions and pipelines, collaborating with other teams to improve data systems and implement ETL processes.
Top Skills: Amazon GlueAWSDocumentdbEventbridgeHadoopMongoDBMqlPostgresPythonRedshiftSparkSQLSQL Server

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account