- Ship agentic features on Stack Overflow's enterprise knowledge platform.
- Build evaluation frameworks that measure whether agent outputs are grounded, useful, and safe.
- Work across frontend, backend, prompt-level changes, and deployments.
Python · LLMs · RAG · TypeScript · Kubernetes · DataDog