Outlier AI Logo

Outlier AI

Humanity's Finest - Referrals

Posted 14 Days Ago
Be an Early Applicant
Remote
8 Locations
Expert/Leader
Remote
8 Locations
Expert/Leader
Assemble a team of experts to create PhD-level problems that challenge state-of-the-art AI models. Collaborate on datasets for research and co-author a research paper analyzing model reasoning enhancements. Contributors can earn up to $540 for each problem-solution pair submitted.
The summary above was generated by AI

Note: At the moment, this opportunity is not available to applicants from California and New York.

We'll cut to the chase: we're looking for the world's best experts to take on the world's smartest models.

In Humanity’s Last Exam, we introduced the most challenging reasoning benchmark for frontier AI models. So far, the highest-performing system—OpenAI’s Deep Research—has achieved only 26% accuracy. We are collaborating with leading AI labs to identify the most effective data to improve AI reasoning capabilities in expert domains, and we aim to publish a new paper presenting our findings.

To do this, we are assembling a team of elite individuals who are the utmost experts in their respective fields. Our shared goal is to create PhD+ level problems that current state-of-the-art LLMs cannot correctly solve. This team will work collaboratively to produce datasets that will be available to our partner research groups, the world’s most advanced AI laboratories.

We're looking for unicorns to help us write some of the hardest problems intelligence has ever seen — do you think you can do this?

Why Join?

  • Have the opportunity to co-author a research paper analyzing how effectively this data enhances model reasoning.
  • Get access to an exclusive community of world-class researchers in a variety of domains; Each task submission will be open to peer review.
  • Receive up to $540 for each contributed problem and solution pair

What’s Next?

  • Still unsure? Watch this quick video of what comes after you hit "Apply Now" below
  • If you're up for the task, apply below!

PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with the Outlier Privacy Policy and our internal policies and programs designed to protect personal data.

This is a 1099 contract opportunity on the Outlier.ai platform. Because this is a freelance opportunity, we do not offer internships, sponsorship, or employment. You must be authorized to work in your country of residence. If you are an international student, you may be able to sign up for Outlier if you are on a visa. You should contact your tax and/or immigration advisor with specific questions regarding your circumstances.

Top Skills

AI
Llms
Research

Similar Jobs

10 Hours Ago
Remote
Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
The Business Analyst will drive the global implementation of corporate card products, manage procurement processes, and collaborate with stakeholders across various regions. This role includes project management, stakeholder engagement, and operational efficiency initiatives.
Top Skills: Erp SystemsOracle
15 Hours Ago
Remote
Bengaluru, Karnataka, IND
Senior level
Senior level
Cloud • Information Technology • Productivity • Security • Software • App development • Automation
As a Senior Data Engineer, you'll build scalable data products, ingest data into data lakes, improve data product efficiency, and architect self-serve capabilities.
Top Skills: AirflowAmazon Web ServicesDatabricksEmrJavaKinesisPythonRdsS3ScalaSparkSQLSqs
Yesterday
Remote
Hybrid
India
Senior level
Senior level
Fintech • Professional Services • Consulting • Energy • Financial Services • Cybersecurity • Generative AI
This role involves analyzing credit risk in banking, utilizing programming skills and financial frameworks, to provide insights and solutions.
Top Skills: PysparkPythonQlik SenseSQLTableau

What you need to know about the Vancouver Tech Scene

Raincouver, Vancity, The Big Smoke — Vancouver is known by many names, and in recent years, it has gained a reputation as a growing hub for both tech and sustainability. Renowned for its natural beauty, the city has become a magnet for professionals eager to create environmental solutions, and with an emphasis on clean technology, renewable energy and environmental innovation, it's attracted companies across various industries, all working toward a shared goal: advancing clean technology.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account