This repo contains resources related to several big data technologies like hadoop, spark, sql, hive , kafka etc
learningjournal playlist - youtube
file format - when and what to use
Row-oriented vs column oriented formats
https://www.bi4all.pt/en/news/en-blog/apache-spark-best-practices/ -- includes calculating number of executors
http://timepasstechies.com/row-oriented-column-oriented-file-formats-hadoop/
https://blog.matthewrathbone.com/
https://blog.clairvoyantsoft.com
spark performance tuning- Expedia medium blog
StreamBench-spark & kafka streaming --> Blog for spark and kafka streaming
8 non-obvious features in Spark SQL that are worth knowing
Dimension fact model, star schema vs snowflake, lambda vs kappa architecture
Big Data Stack interview collection
BigDataProgrammers Interview Q&A
Scala interview questions -basic
Scala interview questions - intermediate
Scala interview questions - advanced
Python Interview questions-interviewbit