This file contains an explanation of the crawl variables. $CRAWL is assumed to be a $CRAWLER_OBJECT returned by crawl_new(). --------------------------------------------------------------------------- Crawl data structures --------------------------------------------------------------------------- %$CRAWL->{config} - configuration values (see below); key=config key, value=value of key &$CRAWL->{crawl} - subfunction which just calls LW2::crawl($CRAWL) &$CRAWL->{reset} - subfunction which resets all the values in $CRAWL %$CRAWL->{track} - All the URLs seen/requested; key=url, value=HTTP response code, or '?' if not actually requested %$CRAWL->{request} - Libwhisker request hash used during crawling %$CRAWL->{response} - Libwhisker response hash used during crawling $CRAWL->{depth} - Default max depth set by crawl_new() $CRAWL->{start} - Default start URL set by crawl_new() @$CRAWL->{errors} - All encountered errors during crawl'ing @$CRAWL->{urls} - Temporary array used internally by crawl() %$CRAWL->{server_tags} - Server banners encountered while crawling; key=banner, value=# times seen %$CRAWL->{referrers} - Keeps track of who refers to what URL; key=target URL, value=anon array of all URLs that point to it %$CRAWL->{offsites} - All URLs that point to other hosts; key=URL, value=# times seen %$CRAWL->{non_http} - All non-http/https URLs found; key=URL, value=# times seen %$CRAWL->{cookies} - All cookies encountered during crawling; key=cookie string, value=# times seen %$CRAWL->{forms} - URLs which were the target of