Skip to content

Latest commit

 

History

History
38 lines (31 loc) · 1.9 KB

README.md

File metadata and controls

38 lines (31 loc) · 1.9 KB

ggc

This is a Python implementation for Geometric Graph Construction (GGC) from data features, as described in our paper:

Yifan Qian, Paul Expert, Pietro Panzarasa, and Mauricio Barahona (2021), Geometric graphs from data to aid classification tasks with graph convolutional networks, Patterns.

Installation

python setup.py install

Run the demo

cd ggc
jupyter notebook demo.ipynb

Data

Data sets can be found here. Each data set is a single csv-like file. Each row represents a sample. The first column is sample id, the last column is sample label and all the columns in the middle are features. Detailed descriptions of origins of data sets are described in the SI Appendix in our paper. For the splits of training/validation/test sets, the first N1 rows consist of the traning set, the next N2 rows consist of the validation set, and the rest of N3 rows corresponds to the test set. The exact numbers of N1(~5% of samples), N2(~10% of samples) and N3(~85% of samples) are described in the SI Appendix in our paper. The samples in the training set are evenly distributed across classes. Data can be loaded by using Kipf's code here.

Cite

Please cite our paper if you use this code in your own work:

@article{qian2021geometric,
  title={Geometric graphs from data to aid classification tasks with graph convolutional networks},
  author={Qian, Yifan and Expert, Paul and Panzarasa, Pietro and Barahona, Mauricio},
  journal={Patterns},
  volume={2},
  number={4},
  pages={100237},
  year={2021},
  oi={10.1016/j.patter.2021.100237},
  publisher={Elsevier}
}