Pavlov's List

· RL environment startups. For the RL-pilled.

Status: Draft. Suggestions: @chrisbarber or email. Note: Larger providers like Surge, Handshake, Mercor, Micro1, and Turing also offer RL environments.

Company Domain Team Background Website Active Founders
AfterQuery Code Finance Wharton, Penn, UBC afterquery.com Carlos Georgescu (@CarlosGeorgescu), Spencer Mateega (@spencermateega), Danny Tang
BenchFlow Code Terminal, Red Hat benchflow.ai Xiangyi Li (@xdotli)
Bespoke Labs Enterprise Google DeepMind, UC Berkeley bespokelabs.ai Mahesh Sathiamoorthy (@madiator), Alex Dimakis (@AlexGDimakis)
Calaveras Code Magic, Google calaveras.ai Sophia Wisdom (@cis_female), Alana Xiang
Cua Computer Use Microsoft cua.ai Francesco Bonacci (@francedot)
Collinear Enterprise Hugging Face, Salesforce collinear.ai Nazneen Rajani (@nazneenrajani), Soumyadeep Bakshi (@soumyadeepb_)
dmodel ML Alignment OpenAI, Google Brain, EleutherAI dmodel.ai Anish Tondwalkar (@dlbydq), Daniel Moon (@dmooooon)
Datacurve Code Waterloo datacurve.ai Serena Ge (@serenaa_ge), Charley Lee (@charleyslee)
Deeptune Enterprise Hebbia deeptune.com Tim Lupo (@timlup)
Fleet AI Enterprise Mercor, Anthropic, Essential AI fleetai.com Nicolai Ouporov (@nicolas_ouporov)
General Reasoning Long Horizon Meta, Conjecture, Aleph Alpha gr.inc Ross Taylor (@rosstaylor90), Chengxi Taylor (@ChengxiTaylor), Kip Parker
Halluminate Long Horizon Finance Capital One, Meta, Cornell halluminate.ai Jerry Wu (@Jerr_Wu), Wyatt Marshall (@wgm752)
Habitat Code Computer Use Jane Street, Arrowstreet, Ramp habitat.inc Maxim Enis (@maxim_enis), Max Kan (@maxkan_), Andrew Megalaa (@AndrewMegalaa)
Haladir Code Math Carnegie Mellon, Princeton, UVA haladir.com Jibran Hutchins (@jibranhutch), Quan Huynh (@quanmhuynh), Preston Schmittou (@preston281s), Joseph Tso (@josephtso914)
Hillclimb Math DeepMind, Base hillclimb.com Jun Park (@jparkjmc), Ibrakhim Ustelbay (@agithief)
Idler Code Meta, Microsoft, Cornell idler.ai Ivan Chub (@chubivan), Nalu Concepcion (@naluconcepcion), Tony Goss (@cha0sg0d_)
Matrices Browser Weights & Biases, Adept matrices.ai John Qian (@johnlqian), Leonardo Axel Setyanto (@laxels25)
Mechanize Code Epoch AI mechanize.work Tamay Besiroglu (@tamaybes), Ege Erdil (@EgeErdil2), Matthew Barnett (@MatthewJBar)
Metaphi Enterprise Waymo, Nuro, Netflix metaphi.ai Abhishek Chandwani (@abhi_chandwani), Ishan Gupta (@Ishan345)
Phinity Chip Design NVIDIA, AWS, Stanford phinity.ai Sonya Jin (@sonyashijin), Aadi Nashikkar (@aadi_nash)
Plato Browser Enterprise MultiOn, Anyscale, Georgia Tech plato.so Rob Farlow (@RobFarlow), Pranav Putta (@pranav__putta)
Preference Model ML Capabilities Datology preferencemodel.com Stealth
Proximal Code Prime Intellect, Cursor proximal.ai Justus Mattern (@MatternJustus), Navid Pour (@navidkpr), Calvin Chen (@calvinchen)
Sepal AI Science Turing, Bain, McKinsey sepalai.com Robi Lin (@robi_lin), Kat Hu (@KatQHu1)
Stealth Code Enterprise Scale, Mercor N/A N/A
Theta Enterprise DeepSilicon, Cornell, MultiOn thetasoftware.com Rayan Garg (@RayanGarg), Tanmay Sharma (@tsha444), Gurvir Singh (@_gurvir_)
The LLM Data Company Enterprise Traba llmdata.com Daanish Khazi (@bertgodel), Gavin Bains (@thegavinbains), Joseph Besgen
Trajectory Labs Alignment METR, GovAI, 80000 Hours trajectorylabs.net Peter McIntyre (@pmcntyr), Ben West
Verita AI Design HRT, Jane Street, Mercor verita-ai.com Rithika Kacham (@rithika_ka), Rishi Kalakuntla (@rishiinsf)
Vmax ML Capabilities UCL vmax.ai Matthew Sargent (@matthewjsargent), Augustine Mavor-Parker (@MavorParker)