cocogold

Marigold-based, text-grounded CoCo Segmentation

This is just the dataset preparation code for the cocogold project. The training code lives here.

Usage

Installation

Install from the GitHub repository:

$ pip install git+https://github.com/pcuenca/cocogold.git

How to use

Download a copy of the COCO 2017 dataset:

uv pip install huggingface-hub

huggingface-cli download --local-dir coco-2017 pcuenq/coco-2017-mirror

cd coco-2017
unzip annotations_trainval2017.zip
rm annotations_trainval2017.zip

mkdir images
unzip val2017.zip -d images
rm val2017.zip
ln -s val2017 images/val

unzip train2017.zip -d images
rm train2017.zip
ln -s train2017 images/train

Then you can build a PyTorch dataset like this:

from cocogold.dataset import CocoGoldIterableDataset
ds = CocoGoldIterableDataset("coco-2017", split="val", return_type="pt")

By default, the dataset will randomly crop 512x512 squares that include a segmentation mask for one of the categories. Each time you iterate through the dataset you'll get different results.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
cocogold		cocogold
nbs		nbs
post		post
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
pyproject.toml		pyproject.toml
settings.ini		settings.ini
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

cocogold

Usage

Installation

How to use

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

pcuenca/cocogold

Folders and files

Latest commit

History

Repository files navigation

cocogold

Usage

Installation

How to use

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages