File: wal.html

package info (click to toggle)
pgadmin3 1.4.3-2
links: PTS
area: main
in suites: etch, etch-m68k
size: 29,796 kB
ctags: 10,758
sloc: cpp: 55,356; sh: 6,164; ansic: 1,520; makefile: 576; sql: 482; xml: 100; perl: 18
file content (101 lines) | stat: -rw-r--r-- 6,093 bytes
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>Chapter26.Reliability and the Write-Ahead Log</title>
<link rel="stylesheet" href="stylesheet.css" type="text/css">
<link rev="made" href="pgsql-docs@postgresql.org">
<meta name="generator" content="DocBook XSL Stylesheets V1.70.0">
<link rel="start" href="index.html" title="PostgreSQL 8.1.4 Documentation">
<link rel="up" href="admin.html" title="PartIII.Server Administration">
<link rel="prev" href="disk-full.html" title="25.2.Disk Full Failure">
<link rel="next" href="wal-intro.html" title="26.2.Write-Ahead Logging (WAL)">
<link rel="copyright" href="ln-legalnotice.html" title="Legal Notice">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="chapter" lang="en" id="wal">
<div class="titlepage"><div><div><h2 class="title">
<a name="wal"></a>Chapter26.Reliability and the Write-Ahead Log</h2></div></div></div>
<div class="toc">
<p><b>Table of Contents</b></p>
<dl>
<dt><span class="sect1"><a href="wal.html#wal-reliability">26.1. Reliability</a></span></dt>
<dt><span class="sect1"><a href="wal-intro.html">26.2. Write-Ahead Logging (<acronym class="acronym">WAL</acronym>)</a></span></dt>
<dt><span class="sect1"><a href="wal-configuration.html">26.3. <acronym class="acronym">WAL</acronym> Configuration</a></span></dt>
<dt><span class="sect1"><a href="wal-internals.html">26.4. WAL Internals</a></span></dt>
</dl>
</div>
<p>  This chapter explain how the Write-Ahead Log is used to obtain
  efficient, reliable operation.
 </p>
<div class="sect1" lang="en">
<div class="titlepage"><div><div><h2 class="title" style="clear: both">
<a name="wal-reliability"></a>26.1.Reliability</h2></div></div></div>
<p>   Reliability is an important property of any serious database
   system, and <span class="productname">PostgreSQL</span> does everything possible to
   guarantee reliable operation. One aspect of reliable operation is
   that all data recorded by a committed transaction should be stored
   in a nonvolatile area that is safe from power loss, operating
   system failure, and hardware failure (except failure of the
   nonvolatile area itself, of course).  Successfully writing the data
   to the computer's permanent storage (disk drive or equivalent)
   ordinarily meets this requirement.  In fact, even if a computer is
   fatally damaged, if the disk drives survive they can be moved to
   another computer with similar hardware and all committed
   transactions will remain intact.
  </p>
<p>   While forcing data periodically to the disk platters might seem like
   a simple operation, it is not. Because disk drives are dramatically
   slower than main memory and CPUs, several layers of caching exist
   between the computer's main memory and the disk platters.
   First, there is the operating system's buffer cache, which caches
   frequently requested disk blocks and combines disk writes. Fortunately,
   all operating systems give applications a way to force writes from
   the buffer cache to disk, and <span class="productname">PostgreSQL</span> uses those
   features.  (See the <a href="runtime-config-wal.html#guc-wal-sync-method">wal_sync_method</a> parameter
   to adjust how this is done.)
  </p>
<p>   Next, there may be a cache in the disk drive controller; this is
   particularly common on <acronym class="acronym">RAID</acronym> controller cards. Some of
   these caches are <em class="firstterm">write-through</em>, meaning writes are passed
   along to the drive as soon as they arrive. Others are
   <em class="firstterm">write-back</em>, meaning data is passed on to the drive at
   some later time. Such caches can be a reliability hazard because the
   memory in the disk controller cache is volatile, and will lose its
   contents in a power failure.  Better controller cards have
   <em class="firstterm">battery-backed</em> caches, meaning the card has a battery that
   maintains power to the cache in case of system power loss.  After power
   is restored the data will be written to the disk drives.
  </p>
<p>   And finally, most disk drives have caches. Some are write-through
   while some are write-back, and the
   same concerns about data loss exist for write-back drive caches as
   exist for disk controller caches.  Consumer-grade IDE drives are
   particularly likely to contain write-back caches that will not
   survive a power failure.
  </p>
<p>   When the operating system sends a write request to the disk hardware,
   there is little it can do to make sure the data has arrived at a truly
   non-volatile storage area. Rather, it is the
   administrator's responsibility to be sure that all storage components
   ensure data integrity.  Avoid disk controllers that have non-battery-backed
   write caches.  At the drive level, disable write-back caching if the
   drive cannot guarantee the data will be written before shutdown.
  </p>
<p>   Another risk of data loss is posed by the disk platter write
   operations themselves. Disk platters are divided into sectors,
   commonly 512 bytes each.  Every physical read or write operation
   processes a whole sector.
   When a write request arrives at the drive, it might be for 512 bytes,
   1024 bytes, or 8192 bytes, and the process of writing could fail due
   to power loss at any time, meaning some of the 512-byte sectors were
   written, and others were not.  To guard against such failures,
   <span class="productname">PostgreSQL</span> periodically writes full page images to
   permanent storage <span class="emphasis"><em>before</em></span> modifying the actual page on
   disk. By doing this, during crash recovery <span class="productname">PostgreSQL</span> can
   restore partially-written pages.  If you have a battery-backed disk
   controller or file-system software (e.g., Reiser4) that prevents partial
   page writes,  you can turn off this page imaging by using the 
   <a href="runtime-config-wal.html#guc-full-page-writes">full_page_writes</a> parameter.
  </p>
</div>
</div></body>
</html>