The recruiting industry stopped innovating years ago. Bloated platforms, slow workflows, zero intelligence. Staffer is what happens when you rebuild hiring tools from scratch with AI at the core. It learns your company, writes roles, searches 850M+ profiles, scores candidates, and handles outreach, with full transparency for candidates. Legacy ATS vendors had their run. We're taking their market.
You will design the systems that define quality.
Shape the metrics behind sourcing intelligence and the evaluation engine behind our LLM agents, turning subjective feedback into measurable truth and driving continuous improvement across the platform.
Design and Own Sourcing Metrics
You will turn subjective product feedback into structured, quantitative signals that drive improvement.
Partner with product and engineering to define sourcing quality metrics (relevance, match accuracy, diversity)
Build and validate measurement frameworks from scratch
Translate qualitative feedback into SQL-based metrics
Communicate metrics and analytic logic across teams
Own the feedback metric business insight loop
Build LLM Evaluation Systems
We run multiple LLM-driven agents and need a systematic way to track quality and improvement.
Define evaluation matrices and success criteria (hallucination rates, tool accuracy, consistency)
Implement evaluation frameworks using Langfuse (or similar tools)
Build monitoring, baselines, and continuous improvement processes
Contribute Technically
Write production Clojure / ClojureScript where needed
Collaborate with senior full-stack engineers
Maintain high code standards and quality
Expert-level PostgreSQL and SQL optimization skills
Experience designing end-to-end metrics frameworks from ambiguous requirements
Strong data modeling skills
Experience with LLM systems in production and evaluation methodologies
Familiarity with Langfuse or similar LLM operations tools preferred
Excellent communicator with the ability to translate technical logic to product and engineering stakeholders
Experience with semantic search (embeddings, vector databases)
Machine learning experience
Experience in recruiting or HR-tech domain
Month 1: Defined and implemented sourcing metrics framework
Month 2: Product team using structured metrics; LLM evaluation baseline established
Month 3: Clear improvements measured in sourcing quality and agent performance
You are a systems thinker who understands how to go from ambiguity to a working measurement system
You are a translator between product intuition and engineering precision
You own problems end-to-end
You ship pragmatic working solutions and iterate based on data
A pure BI analyst
A research-only ML scientist
A prompt tinkerer without systematic evaluation experience
A performance-only engineering specialist
Staffer.ai is part of the Vergence Group - a remote-first, product-centric engineering org with a strong engineering culture and Clojure DNA. You’ll join a small, senior team where your work has direct impact on product outcomes.
We're building something that will change how companies hire, and we need exceptional people to do it.
Remote-first setup with real flexibility.
Competitive pay with equity so you win when we win.
Every tool you need on us.
Freedom to own your domain completely. No hand holding, just impact.