Snorkel AI
AI data development platform for enterprise model fine-tuning, evaluation, and curation.
Snorkel pivots hard from data labeling to becoming the evals authority for agentic AI.
◆Recent moves
- 8d ago
Building AI-Native Systems for Federal Infrastructure: A Conversation with Rezaur Rahman
An interview with a federal CIO/CAIO that signals Snorkel's growing posture toward regulated and government AI buyers. It fits the trajectory of layering enterprise GTM on top of the open-research credibility play, rather than a new product move.
View source ↗ - 8d ago
Code World Models and AutoHarness for LLM Agents
Standard Reading Group post recapping two ICLR papers on code world models and synthetic harnesses for LLM agents. Maintains the academic-credibility cadence but adds nothing new on Snorkel's own roadmap.
View source ↗ - 11d ago
Why coding agents need better data, evals, and environments
Thesis post staking out coding agents as a domain where data, evals, and environments are the binding constraints. Frames the strategic argument behind the Agentic Coding Benchmark and the broader evaluation pivot.
View source ↗ - 21d ago
Understanding Olmix: A Framework for Data Mixing Throughout Language Model Development
Reading Group writeup on data-mixing ratios for OLMo 3 pre-training. Academic-engagement content that reinforces the brand around rigorous data work without changing product direction.
View source ↗ - 1mo ago
Benchmarks should shape the frontier, not just measure it
Update on the Open Benchmarks Grants program reporting 100+ applications and articulating what a high-bar benchmark now requires. A meaningful programmatic milestone that anchors Snorkel's claim to benchmark-authority status.
View source ↗ - 1mo ago
Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory
⚡ SPARKLaunch of Benchtalks, a recurring interview series with benchmark authors that pairs naturally with the Open Benchmarks Grants. This is the company building a content franchise around its strategic positioning rather than a one-off post.
View source ↗