Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Define the benchmark for spectral clustering 2. #79

Open
1 of 3 tasks
ypriverol opened this issue Apr 4, 2019 · 4 comments
Open
1 of 3 tasks

Define the benchmark for spectral clustering 2. #79

ypriverol opened this issue Apr 4, 2019 · 4 comments
Assignees
Labels
enhancement New feature or request
Milestone

Comments

@ypriverol
Copy link
Member

ypriverol commented Apr 4, 2019

We have to design the spectral clustering benchmark, the following preconditions are needed for the test:

  • A single machine, with local space for stable tests.
  • Different sizes of datasets.
  • Set of well-defined metrics for performance (time, cpu/memory consumption, ) and clustering quality.
@ypriverol ypriverol added the enhancement New feature or request label Apr 4, 2019
@ypriverol
Copy link
Member Author

I have to check my local Machine and we have:

Memory: 15.6GB
Processor: Core i7 CPU 870 2.93GHz x 8
Disk: 500G

@jgriss let me know (with a +1) if you think this is fine and I tick the first precondition in the issue.

@ypriverol
Copy link
Member Author

We should define datasets of the following size:

Performance datasets: 100k, 1M, 2M, 5M, 10M, 100M.
Quality Datasets: Synthetic Peptide dataset.

@jgriss
Copy link
Member

jgriss commented Apr 5, 2019

Suggestions:

  • Melanoma dataset
  • Kuester peptides
  • PRIDE Human
  • Gigy
  • TCGA Colon Cancer

@ypriverol
Copy link
Member Author

Create a user for @jgriss in the machine for testing.

@ypriverol ypriverol added this to the 0.2 milestone Apr 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants