Enterprise ETL infrastructure processing 500K+ daily orders with automated compliance
Following Grubhub's acquisition of Tapingo, I led the complete modernization of the data infrastructure, migrating from legacy Python 2.7 systems to a modern cloud-native platform. The project involved rebuilding over 45 ETL pipelines using over 110 sources from MongoDB, MySQL, Salesforce API, and Zendesk API. This migration supported care, account managers, product, growth, marketing, finance, and logistics teams, including the migration of analytics for all teams. The modernization implemented automated NASDAQ reporting compliance and integrated campus order data that increased total reportable volume by 10%.
Grubhub's acquisition of Tapingo created a critical data integration challenge. Campus dining orders representing 10% of total volume were not included in financial reporting, creating NASDAQ compliance issues and understating the company's true market performance.
Multiple stakeholders (Finance, Product, Care, Campus, Partnerships) relying on inconsistent data
MySQL 5.7 to 8.0 migration with zero downtime requirements for 500K+ daily orders
NASDAQ reporting standards and CCPA data governance across 4 data marts
Peak 500K daily orders during school year, dropping to 50K in summer
I designed and built a standardized, configuration-driven ETL framework that eliminated code duplication across 110 pipelines while ensuring consistent data quality and compliance standards.
Executed a phased migration approach that maintained business continuity while modernizing the entire infrastructure from Python 2.7 to Python 3.9 with PySpark on AWS EMR.
Coordinated with Finance, Product, Care, Campus, and Partnership teams to ensure the new platform met diverse requirements while maintaining data consistency across all business units.
I specialize in modernizing legacy data infrastructure at scale. Let's discuss how I can help transform your data platform to drive measurable business outcomes.