dyf #3

1324fgg · 2025-04-27T02:01:41Z

No description provided.

Jupyter notebook for arvix_data_loading_pipeline

Add some explanation for extracting normal embedding from he saved graph. And mention the problem may happen processing dataset mag

…00 nodes). This sampled mag nodes have more nodes, but less connected compared to arxiv dataset.

This is the sampled graph mag dataset. It contains paper node and paper citation edge only and "feat" is the 128 embedding of title and abstract provided by the dataset, "_ID" is the node_ID, "y" is the class labels. its structure is Node data: dict_keys(['year', 'feat', '_ID', '_TYPE', 'y']) Edge data: dict_keys(['reltype', '_ID', '_TYPE'])

Here is the description image of arxiv dataset, mag dataset and sampled mag dataset.

This is model trained on graphsage on sampled mag dataset. I split the 2w data into train, validatino and test, seperated by year of paper like 2013, 2015. The accuracy (47% for training 100 epoches) is higher than the 2005.00687v7 arxiv paper, it provided only 31.53% . Maybe my sampled dataset is better connected, so it is easier to predict the node label. This is the first version, training is really faster than I thought, it only takes minutes, I will finish the other part later.

…edded

recommend to change the name for better understanding

…edded

Upload two notebook for MLP training, and Result evaluation

This jupyter file contains the data processing progress that all we need. Including sampling and combination. Also, it contains Graphsage training on the combined graph.

jethrocsau and others added 30 commits April 21, 2025 18:53

Update README.md

cca4934

reorganized documents from forked repo

a5900b3

format ref file & set framework

8a5dc53

update load data utils

b7edd39

Add files via upload

ee01335

Jupyter notebook for arvix_data_loading_pipeline

updated data utils

710c1b7

Merge branch 'main' of https://github.com/jethrocsau/GNN-language-emb…

f588e49

…edded

updated data_utils

aff5af8

updated modules

f79cdf5

creating superclass GraphAlign_e5

2bb6179

include data preparation func

eb50c0e

uncommented gcn

82da1fd

expanded start + datset

f7b4259

update save paths

3b8e8c6

generate and save graph alignment

0e424d1

Update generate_embedding.py

049c9a3

add load_graph

176d6f0

Update generate_embedding.py

558b901

Update data_utils.py

fc6d77d

Add some explanation for extracting normal embedding from he saved graph. And mention the problem may happen processing dataset mag

Here is the description for arxiv dataset and sampled mag dataset(300…

c062773

…00 nodes). This sampled mag nodes have more nodes, but less connected compared to arxiv dataset.

Add files via upload

fc65a46

Here is the description image of arxiv dataset, mag dataset and sampled mag dataset.

updated mag nodeidx2papers

5f873c7

Merge branch 'main' of https://github.com/jethrocsau/GNN-language-emb…

ea7d3cb

…edded

Image of 200000 subgraph of mug dataset

e8cdf89

Merge branch 'main' of https://github.com/jethrocsau/GNN-language-emb…

8cebab3

…edded

updated for graphalign embeddings

2d0fcbc

updated

7880bf8

map_graph.py

09e68d5

recommend to change the name for better understanding

jethrocsau and others added 25 commits April 29, 2025 12:21

update

839fe03

Merge branch 'main' of https://github.com/jethrocsau/GNN-language-emb…

4544169

…edded

Create README.md

b5cc815

Update README.md

b746443

Update README.md

21eba53

Update README.md

9787d19

Merge branch 'main' of https://github.com/jethrocsau/GNN-language-emb…

847dddf

…edded

graph train save to dir

6955e46

debug fix

8e252a7

added batchify

1edcd86

debug

0381490

updated for pca normalizated & 3-layers GAT

49d09f6

debug and set argprase

b3cc8ae

modified relu and drop out

e4b22ae

update utils

4fd4109

debug

0923f06

typo fix

0728d97

adding epoch values

8f5d317

Add files via upload

6393608

Upload two notebook for MLP training, and Result evaluation

organizing repo

9ea0904

organizing

fce8fb9

update readme

cbf306d

Add files via upload

97441b3

Delete dyf_graphsage_second_version.ipynb

52fad33

Add files via upload

ae445f5

This jupyter file contains the data processing progress that all we need. Including sampling and combination. Also, it contains Graphsage training on the combined graph.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dyf #3

dyf #3

Uh oh!

1324fgg commented Apr 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dyf #3

Are you sure you want to change the base?

dyf #3

Uh oh!

Conversation

1324fgg commented Apr 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants