File: c17.html

package info (click to toggle)
libpcapnav 0.8-2
links: PTS
area: main
in suites: jessie, jessie-kfreebsd
size: 1,968 kB
ctags: 207
sloc: sh: 8,869; ansic: 1,849; makefile: 221
file content (389 lines) | stat: -rw-r--r-- 9,755 bytes
parent folder | download | duplicates (2)
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<HTML
><HEAD
><TITLE
>Introduction</TITLE
><META
NAME="GENERATOR"
CONTENT="Modular DocBook HTML Stylesheet Version 1.7"><LINK
REL="HOME"
TITLE="libpcapnav Manual"
HREF="index.html"><LINK
REL="PREVIOUS"
TITLE="libpcapnav Manual"
HREF="index.html"><LINK
REL="NEXT"
TITLE="Using libpcapnav"
HREF="c61.html"><LINK
REL="STYLESHEET"
TYPE="text/css"
HREF="stylesheet.css"></HEAD
><BODY
CLASS="CHAPTER"
BGCOLOR="#FFFFFF"
TEXT="#000000"
LINK="#0000FF"
VLINK="#840084"
ALINK="#0000FF"
><DIV
CLASS="NAVHEADER"
><TABLE
SUMMARY="Header navigation table"
WIDTH="100%"
BORDER="0"
CELLPADDING="0"
CELLSPACING="0"
><TR
><TH
COLSPAN="3"
ALIGN="center"
><CODE
CLASS="FUNCTION"
>libpcapnav</CODE
> Manual</TH
></TR
><TR
><TD
WIDTH="10%"
ALIGN="left"
VALIGN="bottom"
><A
HREF="index.html"
ACCESSKEY="P"
>Prev</A
></TD
><TD
WIDTH="80%"
ALIGN="center"
VALIGN="bottom"
></TD
><TD
WIDTH="10%"
ALIGN="right"
VALIGN="bottom"
><A
HREF="c61.html"
ACCESSKEY="N"
>Next</A
></TD
></TR
></TABLE
><HR
ALIGN="LEFT"
WIDTH="100%"></DIV
><DIV
CLASS="CHAPTER"
><H1
><A
NAME="AEN17"
></A
>Introduction</H1
><DIV
CLASS="TOC"
><DL
><DT
><B
>Table of Contents</B
></DT
><DT
><A
HREF="c17.html#AEN21"
>What is <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
>?</A
></DT
><DT
><A
HREF="c17.html#AEN30"
>How does it work?</A
></DT
></DL
></DIV
><P
>    Welcome! You're looking at the manual for <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
>. Thanks for reading this.
    </P
><BR
CLEAR="all"><DIV
CLASS="SECT1"
><H1
CLASS="SECT1"
><A
NAME="AEN21"
>What is <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
>?</A
></H1
><P
>        <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
> is a <CODE
CLASS="FUNCTION"
>libpcap</CODE
> wrapper library that allows navigation to
        arbitrary locations in a <CODE
CLASS="FUNCTION"
>tcpdump</CODE
> trace file between reads.
        The API is intentionally much like that of the pcap library.
        You can navigate in trace files both in time and space: you
        can jump to a packet which is at appr. 2/3 of the trace, or
        you can jump as closely as possible to a packet with a given
        timestamp, and then read packets from there. In addition, the
        API provides convenience functions for manipulating timeval
        structures.
      </P
><P
>        Like <CODE
CLASS="FUNCTION"
>libpcap</CODE
>, this library handles things through an opaque
        handle struct. For trace file navigation and reading packets,
        this handle is enough. If you need to apply BPF filters or
        write packets to disk, you can access the familiar pcap
        handle that is used internally.
      </P
></DIV
><BR
CLEAR="all"><DIV
CLASS="SECT1"
><H1
CLASS="SECT1"
><A
NAME="AEN30"
>How does it work?</A
></H1
><P
>        At the core of <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
> is the ability to resynchronize to
        the sequence of packets contained in a <CODE
CLASS="FUNCTION"
>tcpdump</CODE
> trace file
        at arbitrary location of the file position indicator. The
        algorithm is based on Vern Paxson's method from the the <CODE
CLASS="FUNCTION"
>tcpslice</CODE
>
        tool, that basically works as follows: the point near which the
        file position indicator is to be synchronized with the packet
        sequence is undershot a little bit, as it is much easier to
        scan forwards to the desired location, once the packet sequence
        has been detected. The file is scanned from that initial
        offset in single-byte steps, at each step assuming a <CODE
CLASS="FUNCTION"
>libpcap</CODE
>
        packet header is present and sanity-checking the values read.
        Several checks analyze this potential header for sane timestamps,
        capture lengths etc. If the header appears valid, the next
        packet header is examined in a similar function, based upon
        the offset that the checked header provides. If a sequence
        of three packets seems valid, the algorithm considers the
        file position pointer to be synchronized with the packet flow
        and scans as closely as possible to the desired location.
        If the synchronization point is supposed to be a packet with
        a given timestamp, some interpolation is done and the process
        repeated, until the packet closest to the desired timestamp
        has been found.
      </P
><P
>        <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
>'s algorithm contains a few modifications that are explained
	in gory detail in the Netdude
	<A
HREF="http://netdude.sourceforge.net/doco/netdude-freenix2004/index.html"
TARGET="_top"
>Freenix paper</A
>,
	and briefly listed here:
      </P
><P
></P
><UL
><LI
><P
>            <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
> doesn't use Vern's state-machine approach to determine
            definitive header matches. I've done a lot of my
            testing with a trace that was captured while NFS-copying
            another trace file, thus containing lots of "bogus" headers
            to make things fun, and I've seen a number of problems in
            this case. This data causes a number of nasty problems, such as
            large snaplens in the captured data (where a single packet
            may contain many smaller packets) or payload packets that
            have a caplen that causes the next packet to be read 
            directly from the next valid header.
            Much of this should be handled through invalid timestamps,
            but this is not 100% reliable.
          </P
><P
>            To rectify this, pcapnav uses a different approach: once
            a header is found that does not instantly appear to be 
            invalid, the chain of packets that it starts is followed, up
            to a maximum number of packets or until we're out of buffer
            space.
          </P
><P
>            For this, buffers already containing data loaded from disk
            are used as much as possible, but when this buffer doesn't
            suffice, more data is loaded from disk. The hope is that
            most attempts will point to invalid headers anyway so that
            this additional load never happens unless we have good
            reason to believe we've actually found a good header. The
            difference between <CODE
CLASS="CONSTANT"
>PCAPNAV_PERHAPS</CODE
> and
            <CODE
CLASS="CONSTANT"
>PCAPNAV_DEFINITELY</CODE
> (explained in detail
            later in this document)
            is then based on the length of the chain found.
          </P
><P
>            While checking headers, the best valid header (ie the one
            with the longest chain) is remembered, as well as the offset
            in the trace that'll be the successor of this packet,
            so that it isn't confused with a "new" good header.
          </P
><P
>            The fun part without doubt are header clashes. A clash in
            this new system occurs when two headers have the same,
            maximum, chain length and the same level of reliability
            of the chain lengths (eg, the chain search could have been
            stopped because we were out of buffer space or because we
            have hit the limit of packets we check &mdash; the latter is
            considered more reliable).
          </P
><P
>            If we hit a clash, we simply forget the old best match and
            keep looking after the clash packet. If we cannot find any
            better headers afterwards, we return a clash, otherwise the
            best match found afterwards.
	  </P
></LI
><LI
><P
>            I've seen traces with rather strange final packet headers,
            containing invalid caplen/len field values and packet data.
            To make sure we don't miss the last few correct packet
            headers, I've added some padding space and thus start
            looking for the last packet in the trace a bit earlier
            in the file. As the last-packet timestamp and offset is
            buffered in the pcapnav_t handle anyway, this performance
            hit is probably negligible.
	  </P
></LI
><LI
><P
>            To find the last packet in a trace, we now go back a lot
            more from the end of a trace, then find a packet more
            reliably by using the chain approach described above,
            and then use pcap to iterate to the last valid packet.
            Slower, but safer.	  
          </P
></LI
><LI
><P
>            A buffer abstraction was introduced to help reduce the
            number of local variables and parameters to functions.
            See <TT
CLASS="FILENAME"
>pcapnav_buf.h</TT
>.
          </P
></LI
><LI
><P
>            The original tcpslice version used the <CODE
CLASS="CONSTANT"
>PACKET_HDR_LEN</CODE
> macro,
            yielding the size of a struct pcap_pkthdr, even when the
            trace file at hand actually uses the extended, larger
            patched headers.
	  </P
></LI
></UL
></DIV
></DIV
><DIV
CLASS="NAVFOOTER"
><HR
ALIGN="LEFT"
WIDTH="100%"><TABLE
SUMMARY="Footer navigation table"
WIDTH="100%"
BORDER="0"
CELLPADDING="0"
CELLSPACING="0"
><TR
><TD
WIDTH="33%"
ALIGN="left"
VALIGN="top"
><A
HREF="index.html"
ACCESSKEY="P"
>Prev</A
></TD
><TD
WIDTH="34%"
ALIGN="center"
VALIGN="top"
><A
HREF="index.html"
ACCESSKEY="H"
>Home</A
></TD
><TD
WIDTH="33%"
ALIGN="right"
VALIGN="top"
><A
HREF="c61.html"
ACCESSKEY="N"
>Next</A
></TD
></TR
><TR
><TD
WIDTH="33%"
ALIGN="left"
VALIGN="top"
><CODE
CLASS="FUNCTION"
>libpcapnav</CODE
> Manual</TD
><TD
WIDTH="34%"
ALIGN="center"
VALIGN="top"
>&nbsp;</TD
><TD
WIDTH="33%"
ALIGN="right"
VALIGN="top"
>Using <CODE
CLASS="FUNCTION"
>libpcapnav</CODE
></TD
></TR
></TABLE
></DIV
></BODY
></HTML
>