Codility

Name: Codility
Brand: Codility

Velocity2.5

Technical recruiting and coding assessments

www.codility.com ↗

Codility rebuilds technical assessment around the AI-era engineer

technical-assessmentai-skillsbenchmarkinghiringcode-qualityintegrity

◆Current state

Codility has reoriented its technical-assessment platform around AI on two fronts: measuring how candidates work with AI (an AI Copilot inside interviews, an AI Readiness assessment framework) and establishing authority on evaluating AI-generated code (the COMPASS benchmark). Recent posts also cover the defensive side — detecting AI-assisted cheating and holding to SIOP validity standards.

◆Where it's heading

The company is shifting from testing raw coding skill toward measuring AI-era engineering skill, betting that judgment about AI collaboration and code quality is the durable value as raw code generation commoditizes. COMPASS doubles as research credibility and a positioning moat.

◆Prediction

Expect AI Copilot and AI-readiness assessments to move from blog and research framing into packaged product offerings, and COMPASS to expand its problem set or get repositioned as a buyer-facing tool for comparing models.

◆Recent moves

1mo ago
How we hold ourselves to the SIOP standard for AI-based assessment
A conference-recap post on holding AI-based assessment to SIOP validity standards. Thought-leadership positioning that reinforces the AI-assessment arc but isn't itself a product change.
View source ↗
4mo ago
AI Readiness Assessment: Does Your Team Have the Skills They Need?
Introduces an AI Readiness assessment framework spanning four skill areas (AI Literacy, Evaluation, Application, Building) to find team skill gaps — a concrete extension of Codility's pivot toward measuring AI-era skills.
View source ↗
5mo ago
COMPASS: A Better Way to Evaluate AI Code Generation
⚡ SPARK
COMPASS is the research pillar of Codility's AI strategy: a benchmark that judges AI-generated code on correctness, efficiency, and quality, anchored to 393,150 human submissions across 50 problems — establishing Codility as an authority on evaluating, not just running, AI code.
View source ↗
5mo ago
Most AI Code Benchmarks Miss the Point. COMPASS Doesn’t.
The companion research announcement for COMPASS, arguing existing AI code benchmarks only test whether code works, not whether it scales under load or stays maintainable. Reinforces the same benchmark initiative rather than adding a separate capability.
View source ↗
5mo ago
Detecting AI Cheating: How to Protect Technical Assessment Integrity
Addresses the defensive flip side of AI in hiring: detecting AI-generated code, verifying candidate identity, and protecting assessment integrity. A real capability area that complements the AI-skills push.
View source ↗
8mo ago
AI Copilot: Assess AI Skills During Codility Technical Interviews
⚡ SPARK
AI Copilot brings real-time AI-assisted coding into Codility technical interviews, letting employers assess how candidates work with AI rather than only their unaided coding — the clearest expression of the platform's pivot.
View source ↗

Codility rebuilds technical assessment around the AI-era engineer

◆Recent moves

How we hold ourselves to the SIOP standard for AI-based assessment

AI Readiness Assessment: Does Your Team Have the Skills They Need?

COMPASS: A Better Way to Evaluate AI Code Generation

Most AI Code Benchmarks Miss the Point. COMPASS Doesn’t.

Detecting AI Cheating: How to Protect Technical Assessment Integrity

AI Copilot: Assess AI Skills During Codility Technical Interviews