-
Notifications
You must be signed in to change notification settings - Fork 752
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hifiadapterfilt #7398
base: master
Are you sure you want to change the base?
Hifiadapterfilt #7398
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good! Just some minor comments; I understand that at the moment there is only one pacbio test dataset, that unfortunately doesn't have any adapters to filter. Would it be easy/possible to upload another pacbio minimal test dataset where adapters are getting filtered out (and probably also have fewer to non empty output files)? Also, even though the conda environment and the containers describe version 3.0.0, in the versions.yml, 3.0.1 is written out. Can you investigate why the mismatch?
Co-authored-by: Evangelos Karatzas <[email protected]>
Co-authored-by: Evangelos Karatzas <[email protected]>
[ | ||
|
||
] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The test produces no snaps at all - maybe this can be changed?
{ assert snapshot(process.out[0][1].findAll { | ||
file(it).name == "test.filt.fastq.gz" && | ||
file(it).name == "test.contaminant.blastout" && | ||
file(it).name == "test.blocklist" }).match()}, | ||
{ assert path(process.out.stats[0].get(1)).text.contains('Number of adapter contaminated ccs reads: 0 (0% of total)') }, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These files seem to not be found in the snapshot?
""" | ||
input[0] = [ | ||
[ id:'test' ], | ||
file(params.test_data['homo_sapiens']['pacbio']['hifi'], checkIfExists: true) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
can you change this to the newer path description? :)
file(params.modules_testdata_base_path + 'genomics/sarscov2/illumina/vcf/test.vcf.gz', checkIfExists: true)
when: | ||
task.ext.when == null || task.ext.when | ||
|
||
script: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
script: | |
script: |
def args = task.ext.args ?: '' | ||
def prefix = task.ext.prefix ?: "${meta.id}" | ||
// The tool AUTOMATICALLY detects fastq files from the input folder, hence an explicit call of "fastq" is not needed. | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR was created to introduce hifiadapterfilt as a new module. It's purpose is to remove adapters that are specific to pacbio hifi reads.
PR checklist
Closes #7390
versions.yml
file.label
nf-core modules test <MODULE> --profile docker
nf-core modules test <MODULE> --profile singularity
nf-core modules test <MODULE> --profile conda
nf-core subworkflows test <SUBWORKFLOW> --profile docker
nf-core subworkflows test <SUBWORKFLOW> --profile singularity
nf-core subworkflows test <SUBWORKFLOW> --profile conda