Skip to content

Factors affecting performance of Jplag being run on more than 2,000 submissions #1900

Answered by tsaglam
FaizAlam asked this question in Q&A
Discussion options

You must be logged in to vote

Disabling clusters helps with performance, also, you can adjust --shown-comparisons if the report generation takes too long. Avoid additional features like --normalize or --match-merging, as they can increase the runtime significantly. For two submissions with 30 LOC, the comparison should be a few milliseconds. Finally, you can also increase the min token match with -t, but this also adjusts the matching sensitivity. I would only do that if the submissions are larger and the results are still good afterward.

At its core, two factors affect the performance of JPlag: The number of submissions (exponential factor due to pairwise comparison) and the size of the submissions (especially affect…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@tsaglam
Comment options

Answer selected by tsaglam
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question A question, so neither a bug nor a enhancement proposal.
2 participants