Fix: classify in-frame complex variants (MNP + INDEL) as INFRAME_COMPLEX_VARIANT#1181
Conversation
…th MODERATE impact
|
@tristanpwdennis @jonbrenas please check and merge |
|
Hi @vinitjain2005, can you explain your reasoning? |
|
Hi @jonbrenas, thanks for asking — happy to walk through it. Looking at the surrounding code, the branch just above uses For the effect label, I went with I also want to be transparent: the TODO branch may have been left intentionally if there's ambiguity in how the net frame status is guaranteed for this case. If that's a concern, I'm happy to dig into the logic further or add a test case to validate the behavior. Let me know what would be most helpful! Github: @vinitjain2005 |
|
Hi @vinitjain2005, I think there are 2 different things to take into consideration here:
All that to say that addressing this 'TODO' requires a bit of biological thinking and is not going to have any impact, so I wouldn't put a lot of effort (as I obviously haven't given that it is still 'TODO') in it. |
|
Thanks for the detailed explanation @jonbrenas — that's really helpful context. I understand now that the branch isn't reachable with the current SNP-only usage, and that properly resolving the TODO would need genuine biological reasoning across multiple sites rather than just a label change. I'll leave it as-is. I'll focus my attention on issues that affect active code paths instead. If you have any suggestions for areas of the codebase that would benefit from contribution — especially anything relevant to variant handling or data quality — I'd really appreciate the pointer! |
Replace TODO with INFRAME_COMPLEX_VARIANT/MODERATE placeholder. Code path unreachable in production (SNP-only), proper classification requires frame-shift analysis per PR malariagen#1181 discussion.
Summary
This PR replaces the placeholder effect for in-frame complex variants (MNP + INDEL) in
_get_within_cds_effectwith a proper classification.Changes
Replaced:
effect="TODO in-frame complex variation (MNP + INDEL)", impact="UNKNOWN"
With:
effect="INFRAME_COMPLEX_VARIANT", impact="MODERATE"
Why this change?
Impact
In-frame complex variants are now correctly labeled with a MODERATE impact instead of UNKNOWN, improving downstream analysis and interpretability.
Testing
Closes #1180
Github: @vinitjain2005