About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: AI Model Evaluation Specialist
Type: Contract
Compensation: $25–$35/hour
Commitment: 20 hours/week
Role Responsibilities
Write realistic prompts reflecting professional and consumer domain-specific guidance.Evaluate AI-generated responses for factual accuracy and practical usefulness.Identify fabricated claims and misleading reasoning in model outputs.Score and rank model responses using structured rubrics.Provide written justifications with specific evidence for evaluations.Qualifications
Must-Have
Professional experience applying domain expertise in a practitioner or advisory capacity.Familiarity with industry-specific standards, regulations, or clinical guidelines.Strong written communication and critical reasoning skills.Application Process (Takes 20–30 mins to complete)
Submit your resume to begin.Complete the Model Response Evaluation assessment.Resources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcomeFor any help or support, reach out to: support@mercor.comPS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Read LessAbout the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position: Software Engineer
Type: Contract
Compensation: $80–$110/hour
Location: Remote; MPK if possible for those who want to be onsite
Role Responsibilities
Design, build, and maintain scalable web applications using React/TypeScript and Python.Leverage LLM-based coding tools like Cursor, Copilot, and Claude to accelerate development velocity.Quickly iterate on new product ideas, going from 0 to 1 in short timeframes.Build robust APIs and integrate with various internal and external systems.Create interactive dashboards and data visualizations using tools like Plotly, D3, or Streamlit.Develop AI-native applications, engineer prompts, and work with LLM APIs.Qualifications
Must-Have
3+ years of experience building web applications with React/TypeScript and Python or similar technologies.Demonstrated experience using LLM-based coding tools to accelerate development.A track record of rapidly prototyping and shipping products.Proficiency in API design, systems integration, and data visualization.Strong proficiency in Python.Experience building AI-native applications, prompt engineering, or working with LLM APIs.Familiarity with the Model Context Protocol (MCP) or similar agentic frameworks (LangChain, AutoGen, CrewAI, etc.).Prior experience building workforce management, scheduling, or operations platforms.Compensation & Legal
Hourly contractorPursuant to the California Fair Chance Act and related ordinances, qualified applicants will be considered for assignment with arrest and conviction records.Application Process (Takes 20–30 mins to complete)
Upload resumeAI interview based on your resumeSubmit formResources & Support
For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcomeFor any help or support, reach out to: support@mercor.comPS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.
Read Less