Google's PageRank Algorithm

Complete mathematical guide to Google's fundamental link analysis algorithm

Henrik Bondtofte
20 min read
Mathematical guide
Google PageRank algorithm illustration
PR(A) = (1-d) + d × Σ(PR(Ti)/C(Ti))

Table of Contents

What is PageRank?

PageRank is the name of the link analysis algorithm invented by Sergey Brin and Larry (Lawrence) Page at Stanford University, and simultaneously the formula that was the foundation for establishing Google.

Many people mistakenly believe that PageRank is no longer part of Google's algorithms. It's important to understand that PageRank is one of Google's cornerstones and an absolutely essential part of Google.

In its simplicity, the formula aims to assign numerical values to hyperlinks to measure a page's relative importance. In other words, PageRank is a popularity algorithm based on citation analysis.

Origin of the Name

It was Larry himself who named the formula, hence the name PageRank. It's not called PageRank because the value covers a website as a whole. PageRank accumulates on all pages of a website that Google has access to, not at the domain level.

Common Misconception

PageRank is applied at the page level, which means there's no such thing as, for example, a PageRank-4 domain. It would be called a website with a homepage that has a PageRank of 4 – it's not the domain itself.

The History of PageRank

1996

The Invention

Larry Page and Sergey Brin develop BackRub (later Google) at Stanford University

First iteration of the PageRank algorithm based on citation analysis

1998

Patent Filing

Stanford University files patent for the PageRank algorithm

Patent US6285999B1 describes the mathematical algorithm in detail

1998-2000

Google Foundation

Google Inc. is founded with PageRank as the core algorithm

PageRank becomes the primary ranking factor for search results

2000-2013

Toolbar PageRank

Google Toolbar displays public PageRank values (0-10 scale)

Webmasters can see their pages' PageRank scores in Google Toolbar

2013

Toolbar Discontinued

Google stops updating public PageRank values

PageRank becomes internal Google tool, no longer publicly accessible

2016+

Modern Implementation

PageRank integrated with hundreds of other ranking signals

Still fundamental to Google, but combined with machine learning

The Original PageRank Formula

Basic PageRank Formula

PR(A) = (1-d) + d × Σ(PR(Ti)/C(Ti))

PR(A) = PageRank of page A

d = Damping factor (typically 0.85)

PR(Ti) = PageRank of page Ti that links to A

C(Ti) = Number of outbound links from page Ti

Simplified Version

The formula in its original form distributes weight equally among the links found on a page, regardless of whether they are internal or external links.

Value = PageRank + (PageRank from sources × 0.85) / number of links

Practical Example

If you have, for example, 10 outbound links on a page, each link transfers 10% of the PageRank value you have the ability to transfer.

PageRank ÷ 10 links = 10% per link

Mathematical Analysis

Matrix Representation

PageRank can be calculated using matrix algebra, where the web is represented as a transition matrix M:

M = d × H + (1-d)/N × J

Where:
H = Link matrix (hij = 1/L(j) if j links to i, else 0)
J = Matrix of all 1s 
N = Number of pages
d = Damping factor

Eigenvector Calculation

The PageRank vector is the dominant eigenvector of transition matrix M:

π = M × π

Where π is the PageRank vector and M is the transition matrix

Convergence Criteria

The algorithm converges when the difference between iterations is sufficiently small:

||π(k+1) - π(k)|| < ε

Where ε is typically 10⁻⁶ for high precision

Damping Factor (0.85)

What is the damping factor?

The damping factor (d = 0.85) represents the probability that a user continues clicking on links, rather than starting a new search. In other words, there's an 85% chance of following a link and a 15% chance of jumping to a random page.

With damping factor (d = 0.85)

  • ✓ Prevents ranking manipulation
  • ✓ Handles dangling nodes (pages without outgoing links)
  • ✓ Ensures algorithm convergence
  • ✓ Models realistic user behavior

Without damping factor (d = 1.0)

  • ✗ Rank sinks (pages that collect all PageRank)
  • ✗ Algorithm doesn't always converge
  • ✗ More vulnerable to manipulation
  • ✗ Unrealistic user model

Mathematical significance of damping factor

15%
Random teleport
(1-d) component
85%
Link following
d × link component
~100
Iterations
to convergence

Iterative Calculation Process

PageRank is calculated iteratively, where each iteration improves the estimate of all pages' PageRank values:

Iterative Algorithm

1. Initialize: PR⁰(i) = 1/N for all pages i
2. For k = 0, 1, 2, ... until convergence:
   PR^(k+1)(i) = (1-d)/N + d × Σ(PR^k(j)/L(j))
   where j links to i
3. Stop when ||PR^(k+1) - PR^k|| < ε
Iteration 0
Initial values
PR(i) = 1/N
Iteration 1-99
Convergence process
Gradual stabilization
Iteration 100+
Convergence
Stable values

Matrix-based Calculation

Adjacency Matrix Example

For a simple 4-page network, we can represent the link structure as a matrix:

     A    B    C    D
A  [ 0   1/2  1/2   0 ]
B  [1/3   0   1/3  1/3]  
C  [1/2   0    0   1/2]
D  [ 0    1    0    0 ]

Matrix H (link transition matrix)

Google Matrix Construction

G = d × H + (1-d)/N × J

Where J is matrix of 1/N values:
     A     B     C     D
A  [0.25  0.25  0.25  0.25]
B  [0.25  0.25  0.25  0.25]
C  [0.25  0.25  0.25  0.25] 
D  [0.25  0.25  0.25  0.25]

Final Google Matrix (d=0.85)

       A      B      C      D
A  [0.0375  0.4625  0.4625  0.0375]
B  [0.3208  0.0375  0.3208  0.3208]
C  [0.4625  0.0375  0.0375  0.4625]
D  [0.0375  0.8875  0.0375  0.0375]

Network Visualization

Interactive PageRank Network

APR: 2.5BPR: 1.8CPR: 3.2DPR: 1.9

Visualization Explanation

  • Node size: Represents PageRank value
  • Arrows: Show link direction
  • Animation: Simulates PageRank flow
  • Colors: Blue = normal, Purple = active iteration

Practical Calculation Examples

Interactive PageRank Calculator

Calculation:

PR(A) = (1-d) + d × (PR(B)/L(B))

PR(A) = (1-0.85) + 0.85 × (5/10)

PR(A) = 0.150 + 0.425

PR(A) = 0.575

Example 1: Simple calculation

Page A receives a link from page B with PageRank 5.0

Page B has 10 outgoing links

PR(A) = 0.15 + 0.85 × (5.0/10) = 0.15 + 0.425 = 0.575

Example 2: Multiple links

Page A receives links from page B (PR=3.0, 5 links) and C (PR=2.0, 2 links)

PR(A) = 0.15 + 0.85 × (3.0/5 + 2.0/2) = 0.15 + 0.85 × 1.6 = 1.51

Modern vs. Original PageRank

Original PageRank (1998-2010)

  • • Primary ranking factor
  • • Publicly accessible (Toolbar)
  • • Simple link-based algorithm
  • • Vulnerable to manipulation
  • • Monthly updates

Modern PageRank (2010+)

  • • One of hundreds of factors
  • • Internal Google tool
  • • Integrated with machine learning
  • • Spam-resistant improvements
  • • Real-time updates

Modern Improvements

Personalized PageRank:

Adjusted based on user interests and search history

Topical PageRank:

Weighting based on topic relevance and context

TrustRank integration:

Combined with trust signals for spam protection

Temporal factors:

Time-based weighting of links and freshness signals

Limitations and Challenges

Manipulation and spam

  • • Link farms and PBN networks
  • • Artificial link exchanges
  • • Purchased links for manipulation
  • • Comment spam and forum spam

Technical challenges

  • • Computational complexity for billions of pages
  • • Dangling nodes (pages without outgoing links)
  • • Spider traps and infinite loops
  • • Scaling to real-time updates

Conceptual limitations

  • • Focus only on link popularity, not content
  • • Bias toward older, established websites
  • • Ignores user intentions and context
  • • Static model vs. dynamic web

PageRank's Significance Today

Still Fundamental

PageRank is still a core part of Google's algorithm, although it now works in combination with hundreds of other ranking factors. The fundamental idea of link-based authority remains central to how Google evaluates web pages.

~200
Ranking factors
PageRank is one of many
25+
Years active
Since 1998
Core
Status
Still fundamental

Practical implications for SEO

Links are still important:

Quality links from authoritative pages still have high value

Focus on quality:

Get links from relevant, trustworthy sources

Internal links:

Distribute PageRank strategically on your own site

Holistic approach:

Combine link building with content and technical SEO

Want to master modern link building?

Now that you understand the mathematical background of PageRank, you can learn to apply this knowledge practically in modern SEO and link building strategies.

Read Link Building Book