Returns quantification of copy number profile and events including tandem duplication and Chromothripisis etc. Only copy number data from autosome is used here. Some of the quantification methods are rough, you use at your risk. You should do some extra work to check the result scores.

scoring(object, TD_size_cutoff = c(1000, 1e+05, 2e+06), TD_cn_cutoff = Inf)

Arguments

object

a object of CopyNumber.

TD_size_cutoff

a length-3 numeric vector used to specify the start, midpoint, end segment size for determining tandem duplication size range, midpoint is used to split TD into short TD and long TD. Default is 1Kb to 100Kb for short TD, 100Kb to 2Mb for long TD.

TD_cn_cutoff

a number defining the maximum copy number of TD, default is Inf, i.e. no cutoff.

Value

a data.table with following scores:

  • cnaBurden: CNA burden representing the altered genomic fraction as previously reported.

  • cnaLoad: CNA load representing the quantity of copy number alteration.

  • MACN: mean altered copy number (MACN) reflecting the property of altered copy number segments, calculated as $$MACN = \frac{\sum_{i} CN_i}{N_{cnv}}$$ where \(CN_i\) is the copy number of altered segment \(i\), \(N_{cnv}\) is the number of CNV.

  • weightedMACN: same as MACN but weighted with segment length. $$MACN_{weighted} = \frac{\sum_{i} (CN_i \times L_{i})}{ \sum_{i} L_{i} }$$ where \(L_{i}\) is the length of altered copy number segment \(i\).

  • Ploidy: ploidy, the formula is same as weightedMACN but using all copy number segments instead of altered copy number segments.

  • TDP_pnas: tandem duplication phenotype score from https://www.pnas.org/content/113/17/E2373, the threshold k in reference is omitted. $$TDP = - \frac{\sum_{chr} |TD_{obs}-TD_{exp}|}{TD_{total}}$$ where \(TD_{total}\) is the number of TD, \(TD_{obs}\) and \(TD_exp\) are observed number of TD and expected number of TD for each chromosome.

  • TDP: tandem duplication score used defined by our group work, TD represents segment with copy number greater than 2. $$TD = \frac{TD_{total}}{\sum_{chr} |TD_{obs}-TD_{exp}|+1}$$

  • sTDP: TDP score for short TD.

  • lTDP: TDP score for long TD.

  • TDP_size : TDP region size (Mb).

  • sTDP_size: sTDP region size (Mb).

  • lTDP_size: lTDP region size(Mb).

  • Chromoth_state: chromothripsis state score, according to reference doi: 10.1016/j.cell.2013.02.023 , chromothripsis frequently leads to massive loss of segments on the affected chromosome with segmental losses being interspersed with regions displaying normal (disomic) copy-number (e.g., copy-number states oscillating between copy-number = 1 and copy-number = 2), form tens to hundreds of locally clustered DNA rearrangements. Most of methods use both SV and CNV to infer chromothripsis, here we roughly quantify it with $$\sum_{chr}{N_{OsCN}^2}$$ where \(N_{OsCN}\) is the number of oscillating copy number pattern "2-1-2" for each chromosome.

Examples

# Load copy number object
load(system.file("extdata", "toy_copynumber.RData",
  package = "sigminer", mustWork = TRUE
))

d <- scoring(cn)
d

d2 <- scoring(cn, TD_cn_cutoff = 4L)
d2