Looks like the .git folder is really big:
$ du -sh data8assets/.git
1.1G data8assets/.git
This is probably because git keeps the history of adding / removing big datasets.
Students had trouble downloading their work since tar would zip the .git folder too. For now I've told them to run tar on the data8assets/materials folder instead (~150 MB).
Since git clone causes a full download of the history, next semester we'll have 500 students trying to download a >1 GB repo which seems wasteful.
We can avoid this by making new repos for each semester (materials_sp17, materials_fa17, etc), or by squashing the history for this repo. I prefer the first option since it avoids rewriting git history which makes git pull more complicated for the staff.
Looks like the
.gitfolder is really big:This is probably because git keeps the history of adding / removing big datasets.
Students had trouble downloading their work since
tarwould zip the.gitfolder too. For now I've told them to runtaron thedata8assets/materialsfolder instead (~150 MB).Since
git clonecauses a full download of the history, next semester we'll have 500 students trying to download a >1 GB repo which seems wasteful.We can avoid this by making new repos for each semester (
materials_sp17,materials_fa17, etc), or by squashing the history for this repo. I prefer the first option since it avoids rewritinggithistory which makesgit pullmore complicated for the staff.