Blog | luminousmen
Subscribe
Sign in
Home
Notes
[Uncompiled]
Spark Under the Hood
About
Latest
Top
Discussions
Deep Dive into Spark Memory Management
The real reason your Spark cluster is burning money
Dec 23
•
luminousmen
23
4
[Uncompiled] Praise Your Data Team
“You can catch more flies with honey than with vinegar”
Dec 9
•
luminousmen
BigQuery Explained: What Really Happens When You Hit “Run”
What if we removed all the crap and just left SQL and hardware?
Dec 2
•
luminousmen
11
2
November 2025
What Do You Think?
The 4 Most Underrated Words in Leadership
Nov 25
•
luminousmen
5
1
Data Warehouse, Data Lake, Data Lakehouse, Data Mesh: What They Are and How They Differ
Collecting All the Pokemons
Nov 11
•
luminousmen
47
7
October 2025
[Uncompiled] Fivetran bought dbt Labs
Not partnered.
Oct 28
•
luminousmen
The Golden Age of dbt
How dbt Became the Backbone of Modern Data Engineering
Oct 21
•
luminousmen
3
Spark Partitions
How partitioning shapes Spark performance, and what to do when it doesn’t
Oct 14
•
luminousmen
76
9
How Not to Partition Data in S3 (And What to Do Instead)
Learn the pitfalls of partitioning data by date in S3
Oct 7
•
luminousmen
91
4
September 2025
AI Fluency Expectations
You AI or You Die
Sep 30
•
luminousmen
[Uncompiled] AI isn't objective. It never was.
There's no such thing as a neutral language model
Sep 24
•
luminousmen
2
Understanding Lakehouse Compaction
Why Tiny Files Break Big Data
Sep 16
•
luminousmen
14
2
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts