File: detect_suspicious_sample_by_hierarchical_clustering_2comp.Rd

package info (click to toggle)
r-bioc-demixt 1.14.0%2Bdfsg-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 4,300 kB
  • sloc: ansic: 1,591; cpp: 1,209; makefile: 5; sh: 4
file content (56 lines) | stat: -rw-r--r-- 1,817 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
% Generated by roxygen2: do not edit by hand
% Please edit documentation in R/DeMixT_Preprocessing.R
\name{detect_suspicious_sample_by_hierarchical_clustering_2comp}
\alias{detect_suspicious_sample_by_hierarchical_clustering_2comp}
\alias{plot_sd}
\alias{plot_dim}
\title{detect_suspicious_sample_by_hierarchical_clustering_2comp}
\usage{
detect_suspicious_sample_by_hierarchical_clustering_2comp(
  count.matrix,
  normal.id,
  tumor.id
)

plot_sd(count.matrix, normal.id, tumor.id)

plot_dim(
  count.matrix,
  labels,
  legend.position = "bottomleft",
  legend.cex = 1.2
)
}
\arguments{
\item{count.matrix}{A matrix of raw expression count with \eqn{G} by \eqn{(My + M1)}, where \eqn{G} is the number
of genes, \eqn{My} is the number of mixed samples and \eqn{M1} is the number of normal samples. Row names are genes
 column names are sample ids.}

\item{normal.id}{A vector of normal sample ids}

\item{tumor.id}{A vector of tumor sample ids}

\item{legend.position}{Position of legend in the plot. Default is bottomleft.}

\item{legend.cex}{Character expansion factor relative to current par("cex"). Default = 1.2}
}
\value{
list object




}
\description{
Detect suspicious samples by a hierarchical clustering

This function is designed to evaluate the separation of tumor samples and normal samples in a PCA space. 
If some normal samples appear in the tumor-sample dominated cluster, these normal samples are likely to 
be tumor samples and they are supposed to be filtered out before downstream analysis. But for those tumor 
samples appearing in the normal-sample dominated cluster, we do not remove them since they may be the ones 
with low tumor purity.

Plot the standard deviation of log2 raw expression

Plot the distribution of tumor and normal samples in a 2D PCA space based on their expressions
}