We are the leading independent AI benchmarking and insights company. We support engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 30+, on track to double by mid-year, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, Deeplearning.ai, Amazon), Adam D’Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
We’re hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You’ll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day.
What You’ll Do
Generate image and video outputs across models according to standardized evaluation protocols
Set up and manage human preference evaluation studies, including study design, participant management, and quality control
Process and analyze preference vote data to produce benchmark results
Manage the end-to-end pipeline: from prompt execution through to published results
Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries
Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds
Maintain documentation of processes and configurations
Stay current with new image and video model releases
Tech stackPythonRequirements:
3+ years of experience in a technical, client-facing role such as support engineering (Mandatory)
Proficient in Python scripting and working with APIs (Mandatory)
Background in data analysis, research operations, or other process heavy roles (Mandatory)
Experience with AI, particularly image/video generation models (Nice-to-have)
Comfortable in a purely operational/execution-focused role, not a builder role (Mandatory)
Read LessWe are the leading independent AI benchmarking and insights company. We support engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 30+, on track to double by mid-year, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, Deeplearning.ai, Amazon), Adam D’Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
We’re hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You’ll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day.
What You’ll Do
Generate image and video outputs across models according to standardized evaluation protocols
Set up and manage human preference evaluation studies, including study design, participant management, and quality control
Process and analyze preference vote data to produce benchmark results
Manage the end-to-end pipeline: from prompt execution through to published results
Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries
Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds
Maintain documentation of processes and configurations
Stay current with new image and video model releases
Tech stackPythonRequirements:
3+ years of experience in a technical, client-facing role such as support engineering (Mandatory)
Proficient in Python scripting and working with APIs (Mandatory)
Background in data analysis, research operations, or other process heavy roles (Mandatory)
Experience with AI, particularly image/video generation models (Nice-to-have)
Comfortable in a purely operational/execution-focused role, not a builder role (Mandatory)
Read LessWe are the leading independent AI benchmarking and insights company. We support engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 30+, on track to double by mid-year, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, Deeplearning.ai, Amazon), Adam D’Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
We’re hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You’ll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day.
What You’ll Do
Generate image and video outputs across models according to standardized evaluation protocols
Set up and manage human preference evaluation studies, including study design, participant management, and quality control
Process and analyze preference vote data to produce benchmark results
Manage the end-to-end pipeline: from prompt execution through to published results
Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries
Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds
Maintain documentation of processes and configurations
Stay current with new image and video model releases
Tech stackPythonRequirements:
3+ years of experience in a technical, client-facing role such as support engineering (Mandatory)
Proficient in Python scripting and working with APIs (Mandatory)
Background in data analysis, research operations, or other process heavy roles (Mandatory)
Experience with AI, particularly image/video generation models (Nice-to-have)
Comfortable in a purely operational/execution-focused role, not a builder role (Mandatory)
Read LessWe are the leading independent AI benchmarking and insights company. We support engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 30+, on track to double by mid-year, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, Deeplearning.ai, Amazon), Adam D’Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
We’re hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You’ll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day.
What You’ll Do
Generate image and video outputs across models according to standardized evaluation protocols
Set up and manage human preference evaluation studies, including study design, participant management, and quality control
Process and analyze preference vote data to produce benchmark results
Manage the end-to-end pipeline: from prompt execution through to published results
Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries
Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds
Maintain documentation of processes and configurations
Stay current with new image and video model releases
Tech stackPythonRequirements:
3+ years of experience in a technical, client-facing role such as support engineering (Mandatory)
Proficient in Python scripting and working with APIs (Mandatory)
Background in data analysis, research operations, or other process heavy roles (Mandatory)
Experience with AI, particularly image/video generation models (Nice-to-have)
Comfortable in a purely operational/execution-focused role, not a builder role (Mandatory)
Read LessWe are the leading independent AI benchmarking and insights company. We support engineers and enterprises to understand AI capabilities and make critical decisions about their AI strategies. We are the go-to authority for understanding AI, from AI labs and enterprises to media, investors, and policymakers. Our benchmarks don't just measure the cutting edge of AI, they are actively shaping the frontier.
Our benchmarks and analysis are trusted by hundreds of thousands of users and are the go-to reference for leading AI labs including OpenAI, Google, Meta, NVIDIA and Anthropic, and major publications including the Wall Street Journal, Bloomberg, the Financial Times and The Economist.
We are a team of 30+, on track to double by mid-year, backed by Nat Friedman (Github, Meta), Daniel Gross (SSI), Andrew Ng (Google Brain, Deeplearning.ai, Amazon), Adam D’Angelo (Quora, Poe, OpenAI), Clem Delangue (Hugging Face) and other industry leaders.
We’re hiring a Solutions Engineer to manage our media generation benchmarking pipeline. You’ll run image and video generation evaluations, manage human preference studies, and serve as a technical point of contact for media generation model providers. This is a process-driven, operational role suited to someone who is detail-oriented, comfortable with Python, and can manage pipelines reliably day-to-day.
What You’ll Do
Generate image and video outputs across models according to standardized evaluation protocols
Set up and manage human preference evaluation studies, including study design, participant management, and quality control
Process and analyze preference vote data to produce benchmark results
Manage the end-to-end pipeline: from prompt execution through to published results
Serve as a technical point of contact for media generation model providers — communicating results, explaining methodology, and handling queries
Monitor data quality, flag anomalies, and ensure consistency across evaluation rounds
Maintain documentation of processes and configurations
Stay current with new image and video model releases
Tech stackPythonRequirements:
3+ years of experience in a technical, client-facing role such as support engineering (Mandatory)
Proficient in Python scripting and working with APIs (Mandatory)
Background in data analysis, research operations, or other process heavy roles (Mandatory)
Experience with AI, particularly image/video generation models (Nice-to-have)
Comfortable in a purely operational/execution-focused role, not a builder role (Mandatory)
Read LessClear signal of excellence: whether that is a top school, top companies or high growth startups, participating and winning hackathons, etc. (Mandatory)
5 - 8 years of experience of full stack engineering experience (senior+ level) (Mandatory)
Part of a team that experienced high growth and building scalable software (Mandatory)
Experience as an IC (Individual Contributor) writing code, not an Engineering Manager (Mandatory)
React.js w/ Typescript (Nice-to-have)
Computer Science / Computer Engineering / Math / STEM Degree from Top University (Mandatory)
Comfortable working across the full-stack (Mandatory)
Experience in Python (FastAPI, concurrency, asyncio) (Nice-to-have)
Experience in Cloud / DevOps (Azure, Kubernetes, Docker) (Nice-to-have)
Great frontend design sense, experience in Figma (Nice-to-have)
We are looking for a creative back-end focused engineer to help us architect, build, and maintain our increasingly dynamic web creative tooling experiences . You are a driven developer who enjoys critically thinking about the best ways to build a new product, take full ownership in creating a brand new extensible codebase, and bring fresh new ideas to our experience. You’ll build and own core platform services, design scalable APIs, shape our data infrastructure, and contribute across the stack when needed. You’re an endless experimenter with a focus on AI coding tools, agents, and automation to amplify engineering velocity. Finally and most importantly, you love collaborating with a team of talented domain experts and relish startup pace, ambiguity, and iterating through problems.
What you'll do:Build and maintain core backend services, APIs, and data layers that power the TITLES experience.
Partner with product, design, and data to define and deliver features end-to-end.
Design scalable, secure, and observable systems capable of supporting rapid user growth.
Contribute across the stack, including frontend applications (e.g. React) with a focus on user experience.
Drive CI/CD, infrastructure automation, and developer tooling to improve engineering velocity.
Leverage and experiment with AI-assisted coding tools, agents, and workflows to accelerate delivery, quality, and testing.
Help evolve our architecture with strong opinions on modular design, APIs, and service boundaries.
Your skills:4+ years of professional engineering experience at startups where you’ve shipped product and taken ownership from idea to reality.
Strong backend experience in Node.js / TypeScript with real practice building APIs and services.
Proficiency with modern databases (e.g., PostgreSQL), data modeling, and performance optimization.
Experience with front-end technologies (e.g., React, modern JS) and ability to deliver across the full stack.
Comfortable with cloud infrastructure (AWS / GCP / serverless / containers) and CI/CD pipelines.
A history of using, customizing, and integrating AI coding tools, code generation, agents, or developer automation in production workflows.
Excellent communicator who can articulate technical trade-offs and collaborate across disciplines.
Startup DNA: you enjoy autonomy, ownership, and meaningful impact.
Compensation & PackageWe pay well and give our team members skin in the game with equity. In addition we provide quality medical, dental, and vision insurance along with flexible time off and the equipment you need to be successful.
Read LessWe are looking for a creative back-end focused engineer to help us architect, build, and maintain our increasingly dynamic web creative tooling experiences . You are a driven developer who enjoys critically thinking about the best ways to build a new product, take full ownership in creating a brand new extensible codebase, and bring fresh new ideas to our experience. You’ll build and own core platform services, design scalable APIs, shape our data infrastructure, and contribute across the stack when needed. You’re an endless experimenter with a focus on AI coding tools, agents, and automation to amplify engineering velocity. Finally and most importantly, you love collaborating with a team of talented domain experts and relish startup pace, ambiguity, and iterating through problems.
What you'll do:Build and maintain core backend services, APIs, and data layers that power the TITLES experience.
Partner with product, design, and data to define and deliver features end-to-end.
Design scalable, secure, and observable systems capable of supporting rapid user growth.
Contribute across the stack, including frontend applications (e.g. React) with a focus on user experience.
Drive CI/CD, infrastructure automation, and developer tooling to improve engineering velocity.
Leverage and experiment with AI-assisted coding tools, agents, and workflows to accelerate delivery, quality, and testing.
Help evolve our architecture with strong opinions on modular design, APIs, and service boundaries.
Your skills:4+ years of professional engineering experience at startups where you’ve shipped product and taken ownership from idea to reality.
Strong backend experience in Node.js / TypeScript with real practice building APIs and services.
Proficiency with modern databases (e.g., PostgreSQL), data modeling, and performance optimization.
Experience with front-end technologies (e.g., React, modern JS) and ability to deliver across the full stack.
Comfortable with cloud infrastructure (AWS / GCP / serverless / containers) and CI/CD pipelines.
A history of using, customizing, and integrating AI coding tools, code generation, agents, or developer automation in production workflows.
Excellent communicator who can articulate technical trade-offs and collaborate across disciplines.
Startup DNA: you enjoy autonomy, ownership, and meaningful impact.
Compensation & PackageWe pay well and give our team members skin in the game with equity. In addition we provide quality medical, dental, and vision insurance along with flexible time off and the equipment you need to be successful.
Read LessWe are looking for a creative back-end focused engineer to help us architect, build, and maintain our increasingly dynamic web creative tooling experiences . You are a driven developer who enjoys critically thinking about the best ways to build a new product, take full ownership in creating a brand new extensible codebase, and bring fresh new ideas to our experience. You’ll build and own core platform services, design scalable APIs, shape our data infrastructure, and contribute across the stack when needed. You’re an endless experimenter with a focus on AI coding tools, agents, and automation to amplify engineering velocity. Finally and most importantly, you love collaborating with a team of talented domain experts and relish startup pace, ambiguity, and iterating through problems.
What you'll do:Build and maintain core backend services, APIs, and data layers that power the TITLES experience.
Partner with product, design, and data to define and deliver features end-to-end.
Design scalable, secure, and observable systems capable of supporting rapid user growth.
Contribute across the stack, including frontend applications (e.g. React) with a focus on user experience.
Drive CI/CD, infrastructure automation, and developer tooling to improve engineering velocity.
Leverage and experiment with AI-assisted coding tools, agents, and workflows to accelerate delivery, quality, and testing.
Help evolve our architecture with strong opinions on modular design, APIs, and service boundaries.
Your skills:4+ years of professional engineering experience at startups where you’ve shipped product and taken ownership from idea to reality.
Strong backend experience in Node.js / TypeScript with real practice building APIs and services.
Proficiency with modern databases (e.g., PostgreSQL), data modeling, and performance optimization.
Experience with front-end technologies (e.g., React, modern JS) and ability to deliver across the full stack.
Comfortable with cloud infrastructure (AWS / GCP / serverless / containers) and CI/CD pipelines.
A history of using, customizing, and integrating AI coding tools, code generation, agents, or developer automation in production workflows.
Excellent communicator who can articulate technical trade-offs and collaborate across disciplines.
Startup DNA: you enjoy autonomy, ownership, and meaningful impact.
Compensation & PackageWe pay well and give our team members skin in the game with equity. In addition we provide quality medical, dental, and vision insurance along with flexible time off and the equipment you need to be successful.
Read LessWe are looking for a creative back-end focused engineer to help us architect, build, and maintain our increasingly dynamic web creative tooling experiences . You are a driven developer who enjoys critically thinking about the best ways to build a new product, take full ownership in creating a brand new extensible codebase, and bring fresh new ideas to our experience. You’ll build and own core platform services, design scalable APIs, shape our data infrastructure, and contribute across the stack when needed. You’re an endless experimenter with a focus on AI coding tools, agents, and automation to amplify engineering velocity. Finally and most importantly, you love collaborating with a team of talented domain experts and relish startup pace, ambiguity, and iterating through problems.
What you'll do:Build and maintain core backend services, APIs, and data layers that power the TITLES experience.
Partner with product, design, and data to define and deliver features end-to-end.
Design scalable, secure, and observable systems capable of supporting rapid user growth.
Contribute across the stack, including frontend applications (e.g. React) with a focus on user experience.
Drive CI/CD, infrastructure automation, and developer tooling to improve engineering velocity.
Leverage and experiment with AI-assisted coding tools, agents, and workflows to accelerate delivery, quality, and testing.
Help evolve our architecture with strong opinions on modular design, APIs, and service boundaries.
Your skills:4+ years of professional engineering experience at startups where you’ve shipped product and taken ownership from idea to reality.
Strong backend experience in Node.js / TypeScript with real practice building APIs and services.
Proficiency with modern databases (e.g., PostgreSQL), data modeling, and performance optimization.
Experience with front-end technologies (e.g., React, modern JS) and ability to deliver across the full stack.
Comfortable with cloud infrastructure (AWS / GCP / serverless / containers) and CI/CD pipelines.
A history of using, customizing, and integrating AI coding tools, code generation, agents, or developer automation in production workflows.
Excellent communicator who can articulate technical trade-offs and collaborate across disciplines.
Startup DNA: you enjoy autonomy, ownership, and meaningful impact.
Compensation & PackageWe pay well and give our team members skin in the game with equity. In addition we provide quality medical, dental, and vision insurance along with flexible time off and the equipment you need to be successful.
Read Less