File: read10X.Rd

package info (click to toggle)
r-bioc-edger 3.40.2%2Bdfsg-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 1,484 kB
  • sloc: cpp: 1,425; ansic: 1,109; sh: 21; makefile: 5
file content (57 lines) | stat: -rw-r--r-- 2,600 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
\name{read10X}
\alias{read10X}

\title{Read 10X Genomics Files}

\description{Reads 10X Genomics files containing single-cell RNA-seq UMI counts in Matrix Market format.}

\usage{
read10X(mtx = NULL, genes = NULL, barcodes = NULL, path = ".", DGEList = TRUE)
}

\arguments{
\item{mtx}{name of \code{mtx} file containing counts in Matrix Exchange Format. Defaults to \code{matrix.mtx} or \code{matrix.mtx.gz}.}
\item{genes}{name of file containing gene IDs and names. Defaults to \code{features.tsv} or \code{genes.tsv} or gzipped versions of the same.}
\item{barcodes}{optional name of file containing barcodes. Defaults to \code{"barcodes.tsv"} or \code{barcodes.tsv.gz}.}
\item{path}{character string giving the directory containing the files.
Defaults to the current working directory.}
\item{DGEList}{logical. If \code{TRUE}, a \code{DGEList} will be returned, otherwise an unclassed list is returned.}
}

\details{
This function reads output files created by the 10X Genomics Cellranger pipeline, see
\url{https://support.10xgenomics.com/single-cell-gene-expression/software/pipelines/latest/output/matrices}.
The UMI counts are assembled into an integer matrix in R with accompanying gene IDs and gene symbols.
The results are returned as either a \code{DGEList} or an ordinary list.

The files \code{mtx}, \code{genes} and \code{barcodes} can be provided in either gzipped or unzipped versions.

This function creates an ordinary matrix of counts.
To read the counts instead into a sparse matrix format, the \code{read10xResults} function in the scater package is an alternative.
}

\value{
Either a \code{\link[edgeR:DGEList-class]{DGEList}} object (if \code{DGEList=TRUE}) or an ordinary list with the following components:
\item{counts}{matrix of counts.}
\item{genes}{data.frame counting gene symbols.}
\item{samples}{data.frame containing information about each cell. This will be omitted if \code{barcodes=NULL} and \code{DGEList=FALSE}.}
The only difference between the \code{DGEList} or list formats is that the \code{DGEList} adds some extra columns to the \code{samples} data.frame.
}

\author{Gordon Smyth}

\examples{
\dontrun{
GEO <- "ftp://ftp.ncbi.nlm.nih.gov/geo/samples/GSM2510nnn/GSM2510617/suppl/"
GEOmtx <- paste0(GEO,"GSM2510617_P7-matrix.mtx.gz")
GEOgenes <- paste0(GEO,"GSM2510617_P7-genes.tsv.gz")
download.file(GEOmtx,"matrix.mtx.gz")
download.file(GEOgenes,"genes.tsv.gz")
y <- read10X("matrix.mtx.gz", "genes.tsv.gz")
}}

\seealso{
\code{read10xResults} in the scater package.
}

\concept{Reading data files}