Labelbox Logo

Labelbox

Applied Research Engineer

Posted 17 Days Ago
Be an Early Applicant
7 Locations
Mid level
7 Locations
Mid level
As an Applied Research Engineer, you will develop systems for high-quality human-in-the-loop data, working on AI training processes like RLHF. Responsibilities include designing algorithms, measuring data quality, creating AI tools, and publishing research findings while collaborating with teams and engaging with the AI community.
The summary above was generated by AI

Labelbox is the data factory for generative AI, providing the highest quality training data for frontier and task-specific models. Labelbox’s comprehensive platform combines on-demand labeling services powered by the Alignerr community of highly-educated experts, who span all major languages and a diverse range of advanced subjects. They are available on-demand to rapidly generate new data for supervised fine-tuning, RLHF, and more. Labelbox’s software-first approach delivers unmatched control and transparency into the labeling process, leading to the generation of high-quality, consistent data at scale.

Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.

About the Role

As an Applied Research Engineer at Labelbox, you will be at the forefront of developing cutting-edge systems and methods to create, analyze, and leverage high-quality human-in-the-loop data for frontier model developers. Your role will involve designing and implementing advanced systems that align human feedback into AI training processes, such as Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), etc. You will also work on innovative techniques to measure and improve human data quality, and develop AI-assisted tools to enhance the data labeling process. Your expertise in machine learning, frontier model training, and advanced human data alignment techniques will be crucial in pushing the boundaries of AI capabilities and delivering state-of-the-art solutions to meet the evolving needs of our customers.

This role is based in our dedicated tech hub in San Francisco, CA. We use a hybrid work model of 2 days in the office per week.

Your Day to Day

  • Conduct cutting-edge research on advanced methods for aligning human preferences with AI systems, including RLHF and other novel approaches.

  • Design and develop rigorous systems to measure, enhance, and leverage the quality of human-in-the-loop data for AI training.

  • Create AI-assisted tools that incorporate active learning and adaptive sampling techniques to increase the efficiency and effectiveness of the human data labeling process.

  • Investigate the impact of different types of human feedback (e.g., demonstrations, preferences, critiques) on model performance and alignment.

  • Develop and implement novel algorithms for learning from human preferences and for optimizing the human feedback collection process.

  • Collaborate with engineering and product teams to integrate research findings into Labelbox's product suite, focusing on scalable and practical applications of human-AI alignment techniques.

  • Engage with customers and the AI community to understand evolving human data needs for frontier models and to share insights on best practices.

  • Publish research findings in top-tier academic journals and present at leading AI conferences.

  • Stay at the forefront of advancements in AI, particularly in areas related to human data quality, human-AI collaboration, and AI alignment.

  • Contribute to technical documentation, blog posts, and educational content to establish Labelbox as a thought leader in human-centric AI development.

About You

  • Ph.D. or Master's degree in Computer Science, Machine Learning, AI, or related field

  • At least 3 years of experience addressing sophisticated ML problems with successful delivery to customers

  • Track record of designing robust data quality measurement and refinement systems for improving model performance

  • Deep understanding of frontier models (e.g., large language models, multimodal models), state-of-the-art post-training methods, and their human data requirements

  • Proficiency in programming languages such as Python, and experience with deep learning frameworks (e.g., PyTorch, JAX, TensorFlow)

  • Excellent research skills with a track record of publications in top-tier AI/ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, etc.)

  • Adept at interpreting research literature and quickly turning new ideas into prototypes

  • Strong analytical and problem-solving abilities

  • Excellent communication skills and ability to collaborate in a multidisciplinary team

Labelbox Applied Research

At Labelbox Applied Research, we're committed to pushing the boundaries of AI and data-centric machine learning, with a particular focus on advanced human-AI interaction techniques. We believe that high-quality human data and sophisticated human feedback integration methods are key to unlocking the next generation of AI capabilities. Our research team works at the intersection of machine learning, human-computer interaction, and AI ethics to develop innovative solutions that can be practically applied in real-world scenarios.

We foster an environment of intellectual curiosity, collaboration, and innovation. We encourage our researchers to explore new ideas, engage in open discussions, and contribute to the wider AI community through publications and conference presentations. Our goal is to be at the forefront of human-centric AI development, setting new standards for how AI systems learn from and interact with humans.

Labelbox strives to ensure pay parity across the organization and discuss compensation transparently.  The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.

Annual base salary range

$220,000$300,000 USD

Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox’s Job Applicant Privacy notice.

Any emails from Labelbox team members will originate from a @labelbox.com email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.

Top Skills

Python

Similar Jobs

15 Minutes Ago
Hybrid
Ingersoll, ON, CAN
Senior level
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
In this role, you will manage quality improvements in metrics like eFTQ and Warranty, fix supplier and engineering issues affecting vehicle assembly, and support new product launches. You'll utilize problem-solving tools, develop quality standards, and collaborate with teams to prevent defects and ensure process improvements.
Top Skills: Engineering
An Hour Ago
Hybrid
Toronto, ON, CAN
Mid level
Mid level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
The Full Stack Angular Developer will develop web user interfaces using Angular, design backend services with Node.js, and create hybrid mobile apps. Responsibilities include collaborating with design teams, creating internal tools, troubleshooting customer issues, and implementing new technologies for improving user experiences.
Top Skills: AngularNode.jsTypescript
3 Hours Ago
Remote
Hybrid
8 Locations
Senior level
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Lead engineering projects for Square's mobile applications, focusing on building remote settings and configuration management systems. Collaborate with design and product teams to enhance user experience, ensure high availability in systems, and respond to customer feedback for product improvement.
Top Skills: Objective-CSwift

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account