Blog | luminousmen

Blog | luminousmen

Home
Notes
[Uncompiled]
Spark Under the Hood
About
Deep Dive into Spark Memory Management
The real reason your Spark cluster is burning money
Dec 23 • luminousmen
[Uncompiled] Praise Your Data Team
“You can catch more flies with honey than with vinegar”
Dec 9 • luminousmen
BigQuery Explained: What Really Happens When You Hit “Run”
What if we removed all the crap and just left SQL and hardware?
Dec 2 • luminousmen

November 2025

What Do You Think?
The 4 Most Underrated Words in Leadership
Nov 25 • luminousmen
Data Warehouse, Data Lake, Data Lakehouse, Data Mesh: What They Are and How They Differ
Collecting All the Pokemons
Nov 11 • luminousmen

October 2025

[Uncompiled] Fivetran bought dbt Labs
Not partnered.
Oct 28 • luminousmen
The Golden Age of dbt
How dbt Became the Backbone of Modern Data Engineering
Oct 21 • luminousmen
Spark Partitions
How partitioning shapes Spark performance, and what to do when it doesn’t
Oct 14 • luminousmen
How Not to Partition Data in S3 (And What to Do Instead)
Learn the pitfalls of partitioning data by date in S3
Oct 7 • luminousmen

September 2025

AI Fluency Expectations
You AI or You Die
Sep 30 • luminousmen
[Uncompiled] AI isn't objective. It never was.
There's no such thing as a neutral language model
Sep 24 • luminousmen
Understanding Lakehouse Compaction
Why Tiny Files Break Big Data
Sep 16 • luminousmen
© 2025 luminousmen · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture