Visual Representation Learning: Autoencoding Gaussian Splats

This repository explores Autoencoding Gaussian Splats for 2D image representation. Gaussian splatting has emerged as a powerful technique for modeling images and 3D scenes using parameterized Gaussians. In this project, we apply autoencoders to learn compact representations of images modeled as 2D Gaussian splats.

Project Goals

Construct a dataset of trained Gaussian splats using CIFAR-10.
Design and train an autoencoder to encode and reconstruct Gaussian splat representations.
Compare Gaussian splat autoencoding with traditional pixel-based autoencoding.

Methodology

We use gsplat to fit Gaussians to CIFAR-10 images, storing parameters like position, scale, rotation, opacity, and color.
Different autoencoder architectures (deep, convolutional, ResNet-based) are explored for encoding Gaussian splats.
Experiments analyze compression efficiency, reconstruction quality, and feature disentanglement.

Project Structure

.
├── configs/              # Configuration files for experiments
├── constants/            # Constants and transformation utilities
├── data/                 # Dataset scripts and preprocessed data
├── images/               # Visualization results (e.g., loss curves, comparisons)
├── logs/                 # Logs from different model training runs
├── models/               # Implementations of autoencoders and trainers
├── references/           # Reference implementations and utilities
├── report/               # LaTeX report files and compiled PDF
├── results/              # Experimental results and analysis
├── slurm/                # SLURM batch scripts for job scheduling
├── style/                # Custom matplotlib styles
├── submodules/           # External repositories (e.g., ResNet-18 autoencoder)
├── tests/                # Jupyter notebooks for experimenting with different setups
├── utils/                # Utility functions for data processing and visualization
├── example.ipynb         # Provided example from mentors
├── LICENSE               # License file
├── README.md             # Project documentation
└── requirements.txt      # Python dependencies

Getting Started

Clone the repository:

git clone https://github.com/mokot/visual-representation-learning.git
cd visual-representation-learning

Install dependencies:
```
pip install -r requirements.txt
```
Run experiments: Refer to tests/ folder for Jupyter notebooks with different experimental setups.

References

gsplat: Open-source library for Gaussian splatting (github.com)
CIFAR-10 Dataset: Standard benchmark dataset for image representation.

License

This project is licensed under the MIT License.

Authors

Both authors contributed equally to the conceptualization, research, and implementation of this project.

Rok Mokotar (LMU Munich) – Rok.Mokotar@campus.lmu.de
Federico Bernardo Harjes Ruiloba (LMU Munich) – f.harjes@campus.lmu.de

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Visual Representation Learning: Autoencoding Gaussian Splats

Project Goals

Methodology

Project Structure

Getting Started

References

License

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
configs		configs
constants		constants
data		data
images		images
models		models
references		references
report		report
results/basic_test		results/basic_test
slurm		slurm
style		style
submodules		submodules
tests		tests
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
example.ipynb		example.ipynb
requirements.txt		requirements.txt

License

mokot/visual-representation-learning

Folders and files

Latest commit

History

Repository files navigation

Visual Representation Learning: Autoencoding Gaussian Splats

Project Goals

Methodology

Project Structure

Getting Started

References

License

Authors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages