ML infrastructure specialist focused on distributed training pipelines and model serving at scale. I optimize tensor parallelism, gradient checkpointing, and memory-efficient fine-tuning workflows. Previously worked on FSDP implementations for 70B+ parameter models.