Published inzdjohnMetrics Dimension Metadata (MDM) Model DiscussionA data modelling proposal to simplify the data warehouse and pipeline process for data analytics/reporting/feature store management.Jun 15, 2022Jun 15, 2022
Published inLeading in techWhy becoming a manager?If you are a manager, I am sure you would have had similar conversations either in your 1:1 or during the interview. If you are considering…Nov 2, 2021Nov 2, 2021
Developing pyspark with jupyter notebooks brings a number of benefits:An interactive dev environmentJan 4, 2021Jan 4, 2021
pySpark Development environment setuphttps://github.com/zdjohn/spark-setup-workshopNov 15, 2020Nov 15, 2020
pySpark 3 Ubuntu 20.04 InstallationA quick notes for the upcoming pySpark 3 seriesNov 15, 2020Nov 15, 2020
Published inzdjohnThinking in events — working with aws lambda and serverless architectureEvent sourcing is not a new concept. In the ear of the serverless architecture world, events trigger nearly everything in lambda.Sep 12, 2018Sep 12, 2018