- Architected and scaled Partner API Service enabling broadcast and media partners to ingest live streams and retrieve real-time metadata; built a metadata orchestration layer to normalize, transform, and deliver AI-generated data in partner-specific formats, supporting production-critical workflows, handling 150K+ events/day at peak, and accelerating partner onboarding. - Implemented external integrations within the Publisher Service (Dropbox, OneDrive, Slack, etc.), automating clip delivery and reducing turnaround time for enterprise media workflows. - Designed the architecture for the match-schedule ingestion pipeline (push + pull modes), defining data flows and failure handling to support rapid onboarding at scale, covering 100+ tournaments, 1,000+ teams, and 15K+ players. - Owned P0/P1 production incidents across partner-facing systems, leading debugging, root-cause analysis, and postmortems; introduced structured on-call and RCA practices that reduced recurrence of partner-visible issues. - Led organization-wide logs, alerts, and monitoring migration from Datadog to Last9 with PagerDuty integration, designing PromQL-based indicators and alert routing to improve incident detection and response across revenue-critical services. - Designed and implemented a dual-write migration strategy to propagate collection-level data changes from MongoDB to PostgreSQL using SNS, achieving <100ms propagation latency, and built backfill pipelines to migrate historical data safely. - Stack: NestJS, PostgreSQL, Redis, SNS, SQS, Lambda, MongoDB.