Skip to content

This repo contains resources related to several big data technologies like hadoop, spark, sql, hive , kafka etc

License

Notifications You must be signed in to change notification settings

BParesh89/bigdata-resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

51 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

bigdata-resources

This repo contains resources related to several big data technologies like hadoop, spark, sql, hive , kafka etc

hadoop tutorials

learningjournal playlist - youtube

Hadoop - tutorialspoint

youtube channels

Data Savvy

Tech with Viresh

DataMaking

LearningJournal

TrendyTech

LearnToSpark

Kal Wehner - kafka

itversity

AI Engineering

Talent Origin

file formats and their comparison

file formats in hadoop

file format - when and what to use

Row-oriented vs column oriented formats

Spark Performance optimization

apache spark best practices

https://www.bi4all.pt/en/news/en-blog/apache-spark-best-practices/ -- includes calculating number of executors

Blogs

http://timepasstechies.com/row-oriented-column-oriented-file-formats-hadoop/

https://blog.matthewrathbone.com/

https://blog.clairvoyantsoft.com

Ashkrit blog

Spark on docker

Byte Size

Supergloo

SparkByExamples

madhukarphatak

MyItLearnings

spark performance tuning- Expedia medium blog

StreamBench-spark & kafka streaming --> Blog for spark and kafka streaming

Kafka-poc-project-udemy

8 non-obvious features in Spark SQL that are worth knowing

DevOps

Kubernetes & Docker

Data Modelling

Dimension fact model, star schema vs snowflake, lambda vs kappa architecture

Data Modelling

Interview resources

Hadoop interview questions

Andrew Kretz guide

Big Data

Big Data Stack interview collection

Another collection

Detailed Interview collection

Practical question with code

BigDataInterview.com

BigDataProgrammers Interview Q&A

Kafka Interview questions

Kafka interview questions2

Scala interview questions

Scala interview questions -basic

Scala interview questions - intermediate

Scala interview questions - advanced

Python Interview questions-interviewbit

Python Interview questions-Edureka

Exercise - practice

Six spark exercises

Itveristy Spark Exercises

About

This repo contains resources related to several big data technologies like hadoop, spark, sql, hive , kafka etc

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published