Split a list of in-memory Strelka SBS VCF into SBS, DBS, and variants involving > 2 consecutive bases
SBSs are single base substitutions, e.g. C>T, A<G,.... DBSs are double base substitutions, e.g. CC>TT, AT>GG, ... Variants involving > 2 consecutive bases are rare, so this function just records them. These would be variants such ATG>CCT, AGAT>TCTA, ...
SplitListOfStrelkaSBSVCFs( list.of.vcfs, suppress.discarded.variants.warnings = TRUE )
list.of.vcfs |
A list of in-memory data frames containing Strelka SBS VCF file contents. |
suppress.discarded.variants.warnings |
Logical. Whether to suppress warning messages showing information about the discarded variants. Default is TRUE. |
A list of elements as follows:
SBS.vcfs
: List of data.frames of pure SBS mutations – no DBS or
3+BS mutations.
DBS.vcfs
: List of data.frames of pure DBS mutations – no SBS or
3+BS mutations.
discarded.variants
: Non-NULL only if there are variants
that were excluded from the analysis. See the added extra column
discarded.reason
for more details.
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.