MORE: Molecule Pretraining with Multi-Level Pretext Task

AAAI 2025 paper: MORE: Molecule Pretraining with Multi-Level Pretext Task

Installation

We used the following packages under Python 3.7.

pytorch 1.13.1
torch-cluster 1.6.1
torch-geometric 2.6.0
torch-scatter 2.1.1
torch-sparse 0.6.17
rdkit 2022.9.5

Dataset

The pre-training and downstream datasets used in our experiments are referred to the paper Strategies for Pre-training Graph Neural Networks. You can download the biology and chemistry datasets from their repository.

To run the codes successfully, the downloaded datasets should be placed in /dataset_conf and /dataset_info for pre-training

(If you're using 3D-level pretext task, you'll need to use the /dataset_conf)

(If you are not using 3D-level pretext task, you'll need to use the /dataset_info)

To run the codes successfully, the downloaded datasets should be placed in /dataset for fine-tuning

We use Pretrain/dataset_conf/zinc_2m_MD and Pretrain/dataset_info/zinc_2m_MD

(Preprocessed data from zinc_standard_agent dataset, you can get here

Pretrain step

Please run pretraining.py for downstream adaptations.

The pre-trained models we use follow the training steps of the paper Strategies for Pre-training Graph Neural Networks and GraphMAE

Fine-tune step

Please run finetune.py for downstream adaptations.

We provide pretrained MORE (Finetune/pre-train/MORE.pth)

Hyperparameter

Pretraining settings

Hyperparameter	Value
batch size	256
epochs	100
learning rate	0.001
dropout rate	0.2
decay for graph-level decoder	0.001
decay for 3D-level decoder	0.001
mask rate	0.25
$\lambda_1$	4.5
$\lambda_2$	5.0
$\lambda_3$	1.0
$\lambda_4$	0.04

Fine-tuning settings

Hyperparameter	Value
batch size	32
epochs	50
learning rate	0.001
dropout rate	0.5
decay	0.0

Example

For pretraining, Pretrain/example.ipynb

For Fine-tuning, Finetune/example.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Finetune		Finetune
Pretrain		Pretrain
LICENSE		LICENSE
README.md		README.md
overview.jpg		overview.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MORE: Molecule Pretraining with Multi-Level Pretext Task

Installation

Dataset

Pretrain step

Fine-tune step

Hyperparameter

Example

About

Uh oh!

Releases

Packages

Languages

License

IT-fatica/MORE

Folders and files

Latest commit

History

Repository files navigation

MORE: Molecule Pretraining with Multi-Level Pretext Task

Installation

Dataset

Pretrain step

Fine-tune step

Hyperparameter

Example

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages