Hello, world

I’ve spent the last decade and a half working on search, recommendation, and — for the past few years — large language models. Along the way I’ve collected a lot of opinions, a lot of war stories, and a lot of half-finished notes that never made it out of my private scratchpads.

This site is where I’m going to start pushing some of those notes into the open. Expect posts on:

LLM pretraining and post-training at scale — GRPO, DPO, FSDP/DeepSpeed, multinode training on H100/H200 clusters.
Multimodal learning — vision-language models for search, retrieval, and grounded summarization.
Search and ranking — query understanding, relevance, and how LLMs reshape classical IR stacks.
Things I wish I’d known earlier — debugging distributed training, evaluating generative systems, managing ML teams.

If any of this resonates, you can find my publications here, my projects here, or just email me (link in the sidebar).

More soon.