-
Notifications
You must be signed in to change notification settings - Fork 20
Very slow validation for single-end fastq samplesheets #90
Comments
That's weird, I'll have to investigate further when I have some time. Could you in the meantime please try this:
|
Thanks so much 😄 I have updated the code in that repo to include validation with The results are the same, except that it seems like validation of the JSON schema is done twice now (I'm assuming once for
I was running this with Nextflow version 23.04.1 build 5866 |
I wonder if there's something unexpected happening here with type coercion of file objects, where a lot of data is being moved around unexpectedly or something... Could be related to #92 |
bump -- having the same problem with SE input. I appreciate @apetkau 's description...I wouldn't have done nearly as nice a job with it. Glad to know that it will eventually get on to the workflow. For what it is worth, I am using |
Thanks so much @awgymer and the nf-validation team for the fix. It works great 😄 |
Hello. I have been working on how to make use of the
fromSamplesheet
function to validate a samplesheet using theassets/schema_input.json
file and create a channel of input data. However, I have been encountering a large difference in the time it takes to validate a samplesheet containing paired-end fastq files and single-end fastq files when usingnf-validator
version0.3.1
.I have written up a method to reproduce the issue at https://github.com/apetkau/from-samplesheet-test-nf, but in brief you can run:
You can replace the
samplesheet.pe.30.csv
with the listed files in the below table to run the other cases.schema_input.json
That is, validating the samplesheet against the
schema_input.json
file appears roughly constant as samples increase for paired-end samplesheets, but for single-end samplesheets going from 30 to 60 samples increases the validation time by a factor of 15x.I am wondering if someone could help me to sort out this issue?
Thanks so much. And thanks for the amazing software. It's helped me out in my work 😄
The text was updated successfully, but these errors were encountered: