Skip to content
Snippets Groups Projects
Commit fe259ddf authored by Studer Gabriel's avatar Studer Gabriel
Browse files

chain mapping: reduce likelihood of grouping dissimilar sequences together

In principle one can have the following alignment:

XXXXXXXXXXA--------
----------AYYYYYYYY

It has a 100% sequence identity! The previously implemented logic of gap
thresholds was also not very helpfil to filter out these cases as it
operated on fraction of gaps between first and last aligned column in
the alignment. That's 0.0 and thus perfect.

This commit simplifies this logic and simply checks for a sequence identity
threshold and a minimum number of aligned columns when grouping sequences
together. This should make grouping these cases together very unlikely.
parent 4c30c576
No related branches found
No related tags found
No related merge requests found
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment