File: page_rank.Rd

package info (click to toggle)
r-cran-igraph 1.0.1-1%2Bdeb9u1
  • links: PTS, VCS
  • area: main
  • in suites: stretch
  • size: 18,232 kB
  • sloc: ansic: 173,538; cpp: 19,365; fortran: 4,550; yacc: 1,164; tcl: 931; lex: 484; makefile: 149; sh: 9
file content (142 lines) | stat: -rw-r--r-- 6,063 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
% Generated by roxygen2 (4.1.1): do not edit by hand
% Please edit documentation in R/centrality.R
\name{page_rank}
\alias{page.rank}
\alias{page.rank.old}
\alias{page_rank}
\alias{page_rank_old}
\title{The Page Rank algorithm}
\usage{
page_rank(graph, algo = c("prpack", "arpack", "power"), vids = V(graph),
  directed = TRUE, damping = 0.85, personalized = NULL, weights = NULL,
  options = NULL)

page_rank_old(graph, vids = V(graph), directed = TRUE, niter = 1000,
  eps = 0.001, damping = 0.85, old = FALSE)
}
\arguments{
\item{graph}{The graph object.}

\item{algo}{Character scalar, which implementation to use to carry out the
calculation. The default is \code{"prpack"}, which uses the PRPACK library
(https://github.com/dgleich/prpack). This is a new implementation in igraph
version 0.7, and the suggested one, as it is the most stable and the fastest
for all but small graphs.  \code{"arpack"} uses the ARPACK library, the
default implementation from igraph version 0.5 until version 0.7.
\code{power} uses a simple implementation of the power method, this was the
default in igraph before version 0.5 and is the same as calling
\code{page_rank_old}.}

\item{vids}{The vertices of interest.}

\item{directed}{Logical, if true directed paths will be considered for
directed graphs. It is ignored for undirected graphs.}

\item{damping}{The damping factor (\sQuote{d} in the original paper).}

\item{personalized}{Optional vector giving a probability distribution to
calculate personalized PageRank. For personalized PageRank, the probability
of jumping to a node when abandoning the random walk is not uniform, but it
is given by this vector. The vector should contains an entry for each vertex
and it will be rescaled to sum up to one.}

\item{weights}{A numerical vector or \code{NULL}. This argument can be used
to give edge weights for calculating the weighted PageRank of vertices. If
this is \code{NULL} and the graph has a \code{weight} edge attribute then
that is used. If \code{weights} is a numerical vector then it used, even if
the graph has a \code{weights} edge attribute. If this is \code{NA}, then no
edge weights are used (even if the graph has a \code{weight} edge attribute.}

\item{options}{Either a named list, to override some ARPACK options. See
\code{\link{arpack}} for details; or a named list to override the default
options for the power method (if \code{algo="power"}).  The default options
for the power method are \code{niter=1000} and \code{eps=0.001}. This
argument is ignored if the PRPACK implementation is used.}

\item{niter}{The maximum number of iterations to perform.}

\item{eps}{The algorithm will consider the calculation as complete if the
difference of PageRank values between iterations change less than this value
for every node.}

\item{old}{A logical scalar, whether the old style (pre igraph 0.5)
normalization to use. See details below.}
}
\value{
For \code{page_rank} a named list with entries: \item{vector}{A
numeric vector with the PageRank scores.} \item{value}{The eigenvalue
corresponding to the eigenvector with the page rank scores. It should be
always exactly one.} \item{options}{Some information about the underlying
ARPACK calculation. See \code{\link{arpack}} for details. This entry is
\code{NULL} if not the ARPACK implementation was used.}

For \code{page_rank_old} a numeric vector of Page Rank scores.
}
\description{
Calculates the Google PageRank for the specified vertices.
}
\details{
For the explanation of the PageRank algorithm, see the following webpage:
\url{http://infolab.stanford.edu/~backrub/google.html}, or the following
reference:

Sergey Brin and Larry Page: The Anatomy of a Large-Scale Hypertextual Web
Search Engine. Proceedings of the 7th World-Wide Web Conference, Brisbane,
Australia, April 1998.

igraph 0.5 (and later) contains two PageRank calculation implementations.
The \code{page_rank} function uses ARPACK to perform the calculation, see
also \code{\link{arpack}}.

The \code{page_rank_old} function performs a simple power method, this is
the implementation that was available under the name \code{page_rank} in pre
0.5 igraph versions. Note that \code{page_rank_old} has an argument called
\code{old}. If this argument is \code{FALSE} (the default), then the proper
PageRank algorithm is used, i.e. \eqn{(1-d)/n} is added to the weighted
PageRank of vertices to calculate the next iteration. If this argument is
\code{TRUE} then \eqn{(1-d)} is added, just like in the PageRank paper;
\eqn{d} is the damping factor, and \eqn{n} is the total number of vertices.
A further difference is that the old implementation does not renormalize the
page rank vector after each iteration.  Note that the \code{old=FALSE}
method is not stable, is does not necessarily converge to a fixed point. It
should be avoided for new code, it is only included for compatibility with
old igraph versions.

Please note that the PageRank of a given vertex depends on the PageRank of
all other vertices, so even if you want to calculate the PageRank for only
some of the vertices, all of them must be calculated. Requesting the
PageRank for only some of the vertices does not result in any performance
increase at all.

Since the calculation is an iterative process, the algorithm is stopped
after a given count of iterations or if the PageRank value differences
between iterations are less than a predefined value.
}
\examples{
g <- sample_gnp(20, 5/20, directed=TRUE)
page_rank(g)$vector

g2 <- make_star(10)
page_rank(g2)$vector

# Personalized PageRank
g3 <- make_ring(10)
page_rank(g3)$vector
reset <- seq(vcount(g3))
page_rank(g3, personalized=reset)$vector
}
\author{
Tamas Nepusz \email{ntamas@gmail.com} and Gabor Csardi
\email{csardi.gabor@gmail.com}
}
\references{
Sergey Brin and Larry Page: The Anatomy of a Large-Scale
Hypertextual Web Search Engine. Proceedings of the 7th World-Wide Web
Conference, Brisbane, Australia, April 1998.
}
\seealso{
Other centrality scores: \code{\link{closeness}},
\code{\link{betweenness}}, \code{\link{degree}}
}
\keyword{graphs}