Second day of Q&A around Data Mesh with IBM’s Technical Group about “ten lessons learned from building a Data Mesh.”

Standing in front of the convention center, next to the statue of Sir Walter Raleigh. On September 15th, 2021, after more than 18 months, I was finally able to give…

Drsti (pronounced drishti) is an effortless data visualization that interfaces easily with Apache Spark

Spark in Action, second edition is a favorite for the Big Bag Theory gang Spark in Action, second edition, has been out for about a month and was running a…

Despite 2020 being a mess so far, and after a very calm period in terms of events, it’s time to get back on stage. July 2020 is going to be…

Apache Spark v3.0.0 hits the road, let’s celebrate! Apache Spark v3.0.0 has been released on June 18th, 2020, just before Spark + AI Summit 2020, which is being held virtually…

In this episode, you will learn about doing a basic ETL (extract, transform, and load) operation using Apache Spark. You will load a basic CSV file with Apache Spark, make…

Starting today, I will host a weekly live show about data. You may join, attend “live,” and ask questions as I go through a data-oriented topic. For now, the topic…

I just wanted to share with you the latest update on Spark in Action, second edition What’s new? Chapter 12, “Transforming your data” Chapter 13, “Transforming entire documents” Appendix K,…