Projects

A curated set of ML systems and tools with engineering details.

2026

LLM Evals Workbench

A reproducible framework for prompt/model evaluation with scenario-driven scoring and regression tracking.

2025

RAG Ops Toolkit

A retrieval pipeline toolkit for indexing, re-ranking, and latency profiling under production constraints.

2024

Feature Store Sanity Checks

Validation jobs and drift alerts to keep online and offline features aligned.