File: rules-triggers.html

package info (click to toggle)
pgadmin3 1.4.3-2
links: PTS
area: main
in suites: etch, etch-m68k
size: 29,796 kB
ctags: 10,758
sloc: cpp: 55,356; sh: 6,164; ansic: 1,520; makefile: 576; sql: 482; xml: 100; perl: 18
file content (190 lines) | stat: -rw-r--r-- 8,286 bytes
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1">
<title>34.6.Rules versus Triggers</title>
<link rel="stylesheet" href="stylesheet.css" type="text/css">
<link rev="made" href="pgsql-docs@postgresql.org">
<meta name="generator" content="DocBook XSL Stylesheets V1.70.0">
<link rel="start" href="index.html" title="PostgreSQL 8.1.4 Documentation">
<link rel="up" href="rules.html" title="Chapter34.The Rule System">
<link rel="prev" href="rules-status.html" title="34.5.Rules and Command Status">
<link rel="next" href="xplang.html" title="Chapter35.Procedural Languages">
<link rel="copyright" href="ln-legalnotice.html" title="Legal Notice">
</head>
<body bgcolor="white" text="black" link="#0000FF" vlink="#840084" alink="#0000FF"><div class="sect1" lang="en">
<div class="titlepage"><div><div><h2 class="title" style="clear: both">
<a name="rules-triggers"></a>34.6.Rules versus Triggers</h2></div></div></div>
<a name="id719460"></a><a name="id719476"></a><p>    Many things that can be done using triggers can also be
    implemented using the <span class="productname">PostgreSQL</span>
    rule system.  One of the things that cannot be implemented by
    rules are some kinds of constraints, especially foreign keys. It is possible
    to place a qualified rule that rewrites a command to <code class="literal">NOTHING</code>
    if the value of a column does not appear in another table.
    But then the data is silently thrown away and that's
    not a good idea. If checks for valid values are required,
    and in the case of an invalid value an error message should
    be generated, it must be done by a trigger.</p>
<p>    On the other hand, a trigger that is fired on
    <code class="command">INSERT</code> on a view can do the same as a rule: put
    the data somewhere else and suppress the insert in the view. But
    it cannot do the same thing on <code class="command">UPDATE</code> or
    <code class="command">DELETE</code>, because there is no real data in the
    view relation that could be scanned, and thus the trigger would
    never get called. Only a rule will help.</p>
<p>    For the things that can be implemented by both, which is best
    depends on the usage of the database.
    A trigger is fired for any affected row once. A rule manipulates
    the query or generates an additional query. So if many
    rows are affected in one statement, a rule issuing one extra
    command is likely to be faster than a trigger that is
    called for every single row and must execute its operations
    many times.  However, the trigger approach is conceptually far
    simpler than the rule approach, and is easier for novices to get right.</p>
<p>    Here we show an example of how the choice of rules versus triggers
    plays out in one situation.  There are two tables:

</p>
<pre class="programlisting">CREATE TABLE computer (
    hostname        text,    -- indexed
    manufacturer    text     -- indexed
);

CREATE TABLE software (
    software        text,    -- indexed
    hostname        text     -- indexed
);</pre>
<p>

    Both tables have many thousands of rows and the indexes on
    <code class="structfield">hostname</code> are unique.  The rule or trigger should
    implement a constraint that deletes rows from <code class="literal">software</code>
    that reference a deleted computer.  The trigger would use this command:

</p>
<pre class="programlisting">DELETE FROM software WHERE hostname = $1;</pre>
<p>

    Since the trigger is called for each individual row deleted from
    <code class="literal">computer</code>, it can prepare and save the plan for this
    command and pass the <code class="structfield">hostname</code> value in the
    parameter.  The rule would be written as

</p>
<pre class="programlisting">CREATE RULE computer_del AS ON DELETE TO computer
    DO DELETE FROM software WHERE hostname = OLD.hostname;</pre>
<p>
   </p>
<p>    Now we look at different types of deletes. In the case of a 
    
</p>
<pre class="programlisting">DELETE FROM computer WHERE hostname = 'mypc.local.net';</pre>
<p>

    the table <code class="literal">computer</code> is scanned by index (fast), and the
    command issued by the trigger would also use an index scan (also fast).
    The extra command from the rule would be

</p>
<pre class="programlisting">DELETE FROM software WHERE computer.hostname = 'mypc.local.net'
                       AND software.hostname = computer.hostname;</pre>
<p>

    Since there are appropriate indexes setup, the planner
    will create a plan of

</p>
<pre class="literallayout">Nestloop
  -&gt;  Index Scan using comp_hostidx on computer
  -&gt;  Index Scan using soft_hostidx on software</pre>
<p>

    So there would be not that much difference in speed between
    the trigger and the rule implementation.
   </p>
<p>    With the next delete we want to get rid of all the 2000 computers
    where the <code class="structfield">hostname</code> starts with
    <code class="literal">old</code>. There are two possible commands to do that. One
    is

</p>
<pre class="programlisting">DELETE FROM computer WHERE hostname &gt;= 'old'
                       AND hostname &lt;  'ole'</pre>
<p>

    The command added by the rule will be

</p>
<pre class="programlisting">DELETE FROM software WHERE computer.hostname &gt;= 'old' AND computer.hostname &lt; 'ole'
                       AND software.hostname = computer.hostname;</pre>
<p>

    with the plan

</p>
<pre class="literallayout">Hash Join
  -&gt;  Seq Scan on software
  -&gt;  Hash
    -&gt;  Index Scan using comp_hostidx on computer</pre>
<p>

    The other possible command is

</p>
<pre class="programlisting">DELETE FROM computer WHERE hostname ~ '^old';</pre>
<p>

    which results in the following executing plan for the command
    added by the rule:

</p>
<pre class="literallayout">Nestloop
  -&gt;  Index Scan using comp_hostidx on computer
  -&gt;  Index Scan using soft_hostidx on software</pre>
<p>

    This shows, that the planner does not realize that the
    qualification for <code class="structfield">hostname</code> in
    <code class="literal">computer</code> could also be used for an index scan on
    <code class="literal">software</code> when there are multiple qualification
    expressions combined with <code class="literal">AND</code>, which is what it does
    in the regular-expression version of the command. The trigger will
    get invoked once for each of the 2000 old computers that have to be
    deleted, and that will result in one index scan over
    <code class="literal">computer</code> and 2000 index scans over
    <code class="literal">software</code>. The rule implementation will do it with two
    commands that use indexes.  And it depends on the overall size of
    the table <code class="literal">software</code> whether the rule will still be faster in the
    sequential scan situation. 2000 command executions from the trigger over the SPI
    manager take some time, even if all the index blocks will soon be in the cache.</p>
<p>    The last command we look at is

</p>
<pre class="programlisting">DELETE FROM computer WHERE manufacturer = 'bim';</pre>
<p>

    Again this could result in many rows to be deleted from
    <code class="literal">computer</code>. So the trigger will again run many commands
    through the executor.  The command generated by the rule will be

</p>
<pre class="programlisting">DELETE FROM software WHERE computer.manufacturer = 'bim'
                       AND software.hostname = computer.hostname;</pre>
<p>

    The plan for that command will again be the nested loop over two
    index scans, only using a different index on <code class="literal">computer</code>:

</p>
<pre class="programlisting">Nestloop
  -&gt;  Index Scan using comp_manufidx on computer
  -&gt;  Index Scan using soft_hostidx on software</pre>
<p>

    In any of these cases, the extra commands from the rule system
    will be more or less independent from the number of affected rows
    in a command.</p>
<p>    The summary is, rules will only be significantly slower than
    triggers if their actions result in large and badly qualified
    joins, a situation where the planner fails.</p>
</div></body>
</html>