Skip to content
This repository has been archived by the owner on Mar 2, 2022. It is now read-only.

Prep network data for visualization #5

Open
vordemann opened this issue Oct 9, 2020 · 0 comments
Open

Prep network data for visualization #5

vordemann opened this issue Oct 9, 2020 · 0 comments
Assignees
Milestone

Comments

@vordemann
Copy link
Member

vordemann commented Oct 9, 2020

Goal of this issue is to have a clustered first dataset for visualization of the networks. To achieve that one must do the following steps.

  1. This prep step reduces the amount of data but is not necessary. This step depends on the rollback functionality in Visualize hierarchy of our keywords + rollback function to remove branches #2. Use it to remove obvious useless categories and then rollback the changes into our wikidata. After that update the mapping in keyword_wikidata with https://github.com/icebreaker-science/network/blob/master/postgresql/scripts/wikidata.sql
  2. Starting point is the table keyword_wikidata where you can find a mapping of the networks keywords to the wikidata with its parents over 10 iterations. This is a 1:n mapping, since a keyowrd may appear multiple times in the wikidata.
  3. Write a script that inputs predefined categories and then iterates over all the mappings in keyword_wikidata. It then checks if it finds the category names in the parents list of each keyword. If it matches then it will add this keyword category relation to a new table called keyword_categories. A keyword in the network can be part of several categories. If a keyword in the network does not match with any of the provided categories, add it to the artificial category "Others". This is a temporary fix and will be addressed in a later issue.

The category list may change in the future but is for now:
[biomolecule | chemical substance | metal | process | analytical method | biochemical relation | property]

@vordemann vordemann added this to the milestone 3 milestone Oct 9, 2020
@vordemann vordemann changed the title Prep network data for visualization and keyword linking to devices Prep network data for visualization Oct 24, 2020
@dajenet dajenet self-assigned this Nov 10, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants