As an AI Researcher at AIUC, you will develop and expand the evaluation methods that sit at the heart of our work. You will identify the most pressing problems in our evaluation stack, scope and lead projects to address them, and push the frontier of what rigorous AI assessment looks like.

Your work spans three horizons. On the product side, you'll improve our scale and accuracy by building better LLM judges, tightening our pipelines, making our evaluations faster and more reliable. On the more pure research side, you'll deepen the quality of what we evaluate. This means designing new attack vectors, implementing techniques from the latest research, and building agentic automations that extend our capabilities. And on the longer horizon, you'll take on moonshot projects: things like fully dynamic attackers, self-expanding libraries of attacks, and novel approaches to evaluation that don't yet exist.

From the outset you will:

Identify and scope the highest-leverage problems in our evaluation system, then lead projects end-to-end to address them.
Build novel approaches to AI evaluation by implementing research papers, replicating attack techniques, and experimenting with new methods.
Lead and coordinate research teams, managing complex multi-person projects with clear ownership and delivery.
Communicate findings internally and externally through technical blog posts, papers, and direct engagement with client partners.
Feed insights back to the product, shaping our roadmap based on what you learn on the frontier.

Apply now

See more open positions at Artificial Intelligence Underwriting Company

Privacy policy Cookie policy