CanadianDataGuy’s No Fluff Newsletter
Subscribe
Sign in
Home
Notes
Archive
About
Optimizing Delta Lake Tables: Liquid Clustering vs. Partitioning with Z-Order
This guide will help you decide on your clustering methodology in Lakehouse
Oct 3
•
CanadianDataGuy
Share this post
Optimizing Delta Lake Tables: Liquid Clustering vs. Partitioning with Z-Order
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
September 2024
What is Delta Lake? How does it work?
Dive into the inner workings of Delta Lake, from ACID transactions to time travel, and see how it combines the best of data warehouses and data lakes.
Sep 19
•
CanadianDataGuy
7
Share this post
What is Delta Lake? How does it work?
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
Build Your First Spark Streaming App with Stream-Batch Joins
Start your Spark Streaming journey with this beginner-friendly guide, featuring code for stream-batch joins and Delta table operations.
Sep 4
•
CanadianDataGuy
Share this post
Build Your First Spark Streaming App with Stream-Batch Joins
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
August 2024
I Spent 5 Hours Reading the Original White Paper on Spark and RDDs: Here’s What I Learned
Understanding the origins of Apache Spark and the power of in-memory data processing.
Aug 31
•
CanadianDataGuy
4
Share this post
I Spent 5 Hours Reading the Original White Paper on Spark and RDDs: Here’s What I Learned
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
Upgrading Spark Stream with New Checkpoints
Step-by-step instructions on implementing new checkpoints in Spark Stream applications during major upgrades, complete with practical, executable code…
Aug 30
•
CanadianDataGuy
3
Share this post
Upgrading Spark Stream with New Checkpoints
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
Boost Your Spark Streaming Skills with This Checklist
Dive into this step-by-step checklist for Spark Streaming, covering foundational tips and advanced techniques to enhance your data processing…
Aug 26
•
CanadianDataGuy
1
Share this post
Boost Your Spark Streaming Skills with This Checklist
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
Kafka Explained: Key Concepts and Architecture Principles
Learn the fundamentals of Kafka, including its architecture, core components, and how it handles real-time data processing with high throughput.
Aug 24
•
CanadianDataGuy
1
Share this post
Kafka Explained: Key Concepts and Architecture Principles
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
May 2024
Top Strategies to Excel in 2024 Data Interviews
Discover how to prepare for data interviews with in-depth guides on SQL, Python, data modeling, and essential metrics for a successful data career.ccess
May 21
•
CanadianDataGuy
Share this post
Top Strategies to Excel in 2024 Data Interviews
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
January 2024
Stream Anything on Databricks
Learn how to use Spark Streaming and Databricks Autoloader to efficiently process unsupported file formats, including a step-by-step guide and best…
Jan 4
•
CanadianDataGuy
1
Share this post
Stream Anything on Databricks
blogs.canadiandataguy.com
Copy link
Facebook
Email
Note
Other
Share
Copy link
Facebook
Email
Note
Other
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts