A graph database can store any kind of data using a few simple concepts:
1. Nodes – graph data records
2. Relationships – connect nodes
3. Properties – named data values
1. nodes are: papers
2. relationships: Paper A reference Paper B. From fields `references`, we can add these relationships
3. Properties: Felds except `references` can all be taken as properties (name and value pairs)
4. We can use Neo4j Python Driver (https://neo4j.com/developer/python/) to builld the graph from data
For our data, the schema https://www.openacademic.ai/oag/
| **Field Name** | **Field Type** | **Description** | **Example** |
| ————– | ————— | ——————————– | ———————————————————— |
| id | string | MAG or AMiner ID | 53e9ab9eb7602d970354a97e |
| title | string | paper title | Data mining: concepts and techniques |
| authors.name | string | author name | Jiawei Han |
| author.org | string | author affiliation | department of computer science university of illinois at urbana champaign |
| venue | string | paper venue | Inteligencia Artificial, Revista Iberoamericana de Inteligencia Artificial |
| year | int | published year | 2000 |
| keywords | list of strings | keywords | [“data mining”, “structured data”, “world wide web”, “social network”, “relational data”] |
| fos | list of strings | fields of study | [“relational database”, “data model”, “social network”] |
| n_citation | int | number of citation | 29790 |
| references | list of strings | citing papers’ ID | [“53e99ef4b7602d97027c2346”, “53e9aa23b7602d970338fb5e”, “53e99cf5b7602d97025aac75”] |
| page_stat | string | start of page | 11 |
| page_end | string | end of page | 18 |
| doc_type | string | paper type: journal, book title… | book |
| lang | string | detected language | en |
| publisher | string | publisher | Elsevier |
| volume | string | volume | 10 |
| issue | string | issue | 29 |
| issn | string | issn | 0020-7136 |
| isbn | string | isbn | 1-55860-489-8 |
| doi | string | doi | 10.4114/ia.v10i29.873 |
| pdf | string | pdf URL | //static.aminer.org/upload/pdf/1254/ 370/239/53e9ab9eb7602d970354a97e.pdf |
| url | list | external links | [“http://dx.doi.org/10.4114/ia.v10i29.873”, “http://polar.lsi.uned.es/revista/index.php/ia/ article/view/479”] |
| abstract | string | abstract | Our ability to generate… |