1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630
|
--------------------------------------------------------------------
0.5.4 summary from 0.5.3 (fabien@juliana-multimedia.com)
--------------------------------------------------------------------
Fixes :
o Clarifications on INSTALL manual.
o Added /usr/share/fonts/freefont to the default path for truetype fonts,
so that Redhat users can use the provided freefont rpm, which installs
FreeType fonts in this directory.
Changes / Additions :
o Better reverse dns algorithm, thanks to Lucas Nussbaum (lucas@lucas-nussbaum.net).
We now store the negative results in the database too.
Increased TTL for resolved cache entries from 3 days to 30 days
o Added greatly improved engines.list, thanks to Enric Naval (enric@griho.net)
o W3C log format fix, forwarded by Klaus Reimer:
# ------------------------------------------------------------------------
Patch: webalizer-w3c.phu
For: Webalizer 2.01.10 + webalizer-w3c patch
From: Pierre Hubaut <phubaut@hotmail.com>
Desc: Support tab separator when handling W3C Extended Log Format (ELF).
Version: 1
This patch, in addition to the patch written by Klaus Reimer <k@ailis.de>
providing support for W3C ELF files, adds support of tab fields separator
which is encouraged by W3C.
Potentially known bug: It is unclear if multiple tabs (or spaces or mixture
of whitespace character) is allowed as field separator see
http://www.w3.org/TR/WD-logfile.html - question is pending at W3C
# ------------------------------------------------------------------------
--------------------------------------------------------------------
0.5.3 summary from 0.5.2 (fabien@juliana-multimedia.com)
--------------------------------------------------------------------
Fixes :
o An off-by-one bug in a test in dot_output.c which is rarely executed.
On some circumstances, this could lead to an attempt to read
memory outside of the adress space allocated to The WebDruid,
thus leading to a segmentation fault.
o Do not crash if engines list file is malformed, because one engine
lacks at least one querytag field.
o Fix forgot to write html header somewhere in output.c
--------------------------------------------------------------------
0.5.2 summary from 0.5.1 (fabien@juliana-multimedia.com)
--------------------------------------------------------------------
Fixes :
o Text alignment in countries pie chart.
o Command line options before -c switch could be ignored if the config file
specified was given the same option with a different value.
o ./configure fixes, added freetype and pthread detection. Reordered checks
for iconv and gettext. This allows us to --enable-static with ./configure.
Changes / Additions :
o DNS support is back. It now uses the generic logs management API.
o Increased MAXURLH & MAXREFH to 512.
o More work on XML support.
--------------------------------------------------------------------
0.5.1 summary from 0.5.0 (fabien@juliana-multimedia.com)
--------------------------------------------------------------------
Fixes :
o Various i18n fixes, french translation updates, german translation updates
(it seems it was broken before).
o Various fixes in dot_output.c, hashtab.h, preserve.c to be able
to compile on non C99 compliant compilers (thanks to Klaus Reimer
for raising the issue).
o Fixed Berkeley DB autodetection...seems somebody had a monkey playing
with this part of the code, it was totally broken. We were linking
to -ldb3 in all cases, which was plain wrong (thanks to Klaus Reimer
again).
o Don't link against -lnsl. It's useless.
o ./configure script update, config.h support,
Add detection macros for gettext, iconv stuff.
Changes / Additions :
o De-uglyfication Part I : white background for html pages & charts.
o Completed internationalisation. We use the gd's ability to print text
using true type fonts. Requires gd >= 1.8.4.
o Path graphs & Users flow, if enabled, now default to use
the FreeSerif font.
o Added config file parameter to set the location of the dot utility.
Added config file parameter to set the font face used by gd2.
o Removed border around paths graphs & users flow drawings.
o Sample configuration file updates.
o With all above changes, compiling under cygwin now works.
--------------------------------------------------------------------
0.5.0 summary from webalizer 2.01-10-18 (fabien@juliana-multimedia.com)
--------------------------------------------------------------------
Fixes :
o Fixed memory leaks when parsing options from both
/etc/webalizer.conf and from the file given
at command line -- Valgrind helped me to do this.
o Increased MAXSRCHH to 128
Changes / Additions :
o Added path graphs support -- this is something like the
Pathalizer does. See README for more information.
o Added php* to page_type - page_type is now {php*, htm*, cgi}
o WebDruid now generates UTF-8 web pages. It will allow for
better i18n output. It solves also the issue of non
ascii characters being displayed wrongly if the users
making the requests used some of them.
o WebDruid now supports the CLF log format (used by IIS), thanks
to Klaus Reimer <k@ailis.de>
o Completely rewritten search engines support - ~ 2% overall
speed improvement with a list of 90 search engines.
~ 25% speed improvement with the same 'small' (10 items)
list that we had before.
o WebDruid comes now with an external text file ('engines.list')
which contains a preloaded list of about 90 search engines.
o WebDruid now displays search phrases grouped per engines,
as well as the url(s) where the engine sent the user
('Target Url')
o Updated French Translation (only parts found in HTML output)
o Parser code rearranged
o Webdruid supports now a basic kind of versioning for it's
incremental datafile. As soon as data format has changed, the current
incremental data file will with the old format will be ignored, and
a warning will be displayed.
o Work in progress : XML support
o Work in progress : new XHTML report
Removed :
o DNS lookup feature is no more supported. It is still is the code,
but it is untested (read : obviously completely buggy).
It will reappear as soon as somebody gives me some log files
to be able to test.
o Squid proxy logs, as well as FTP logs, are not supported anymore.
This is because i want to keep the code as simple as possible,
avoiding if()'s everywhere. -- And yes, the WebDruid target is
web, and only web...
o HistoryName parameter removed : The Webdruid will now use
webalizer.hist by default.
o IncrementalName parameter removed : incremental.i will now be used,
'i' being the incremental data version number.
--------------------------------------------------------------------
Webdruid Fork begins here
--------------------------------------------------------------------
--------------------------------------------------------------------
2.01-10-18 summary from 2.01-10 (Debian team)
--------------------------------------------------------------------
Fixes
o Webalizer now recognizes Opera browsers
o Increased MAXAGENT in webalizer.h
o Increase URL size in webalizer.h; hopefully this results in less
"oversized request field" warnings
Changes/addition:
o Switched to GNU gettext for internationalization (Cyril Bouthors
and Johan Poirier)
o IPv6 port by Arkadiusz Mikiewicz <misiek@pld.ORG.PL>
for PLD GNU/Linux
--------------------------------------------------------------------
2.01-xx changes from 1.30-04 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Fix posible obscure buffer overflow bug in DNS resolver code
o Added additional extended character fixes
o Let code accept partial content response codes along with 200's
o Added code to catch blank hostnames (yes, they have been found!)
Will convert them into 'Unknown'
o Security fix for cross-site scripting vulnerability found by
Flavio Veloso (www.magnux.com).
o Fixed a TOTAL_RC off by one error, which would prevent the last
response code from being saved when using incremental mode.
o Fixed possible segfault condition in MangleAgent code on
some malformed user agent names.
o Fixed DNS to prevent hangs on blank and malformed hostnames.
o Fixed problem calculating visits. Changed timestamps to use
seconds since epoch (1/1/1970) which results in more accurate
analysis. Also changed normal out of sequence code to handle
up to 1 hour of 'slop' in the timestamps. This changed the
semantics of the VisitTimeout and -m configuration options, as
the values are now specified in number of seconds.
o Fixed hostname lowercase problem (wasn't) when using DNS lookups.
o Fixed problem with incremental datafile which could cause a read
error under certain circumstances (removes control characters).
Also changed code to now abort on a read error.
o Fixed problem with hash table node creation where objects that
were exactly the maximum length would wind up leaving a garbage
byte at the end of the memory space allocated. This was causing
some very infrequent and widely different problems.
o Fixed problem where country graph could be produced incorrectly
if using a non-english language and the country name overlapped
the pie chart.
o Found and fixed a problem with a possible 32-bit wrap around
problem using incremental mode on large sites. The problem
would cause the KBytes data on large groups to become inaccuate.
Changes/Additions:
o Modified configure to allow specification of the default config
directory. If not given, will use /etc (/etc/webalizer.conf).
o Added DailyGraph and DailyStats configuration options to enable
or disable the Daily usage graph and stats table from output.
o Improved visit calculation logic to reduce 'false' counts generated
by external image referrals.
o Added reverse DNS lookup capability. This adds the command
line switchs -D and -N, and configuration keywords "DNSCache"
and "DNSChildren". See the DNS.README for additional info.
Based in part on code submitted by Henning P. Schmiedehausen
(hps@tanstaafl.de).
o Added ability to dump Sites, URL's, Referrers, User Agents,
Usernames and Search Strings to tab delimited files, suitable
for import into most database and spreadsheet programs. The
location of this file may be specified using the "DumpPath"
configuration keyword, allowing the data to be kept someplace
outside the web servers document tree. The configuration
keywords "DumpSites", "DumpURLs", "DumpReferrers", "DumpAgents",
"DumpUsers" and "DumpSearchStr" have been added to control the
file dumps. Column headers can be included in the file with
the "DumpHeader" keyword. Dump filename extensions may be
specified using the "DumpExtension" keyword (default is .tab).
o Added username analysis, based on usernames found in the log,
and only available if username information is present in the
log (ie: http authentication or wu-ftpd xferlog). The keywords
'GroupUser', 'HideUser', 'IgnoreUser', 'IncludeUser', 'AllUsers',
and 'TopUsers' have been added to the configuration file code.
This change also modified the format of the incremental data file.
o Added the ability to display ALL sites, URL's, Referrers,
User Agents and Search Strings on a seperate HTML page from
the normal statistics page. This adds the configuration
keywords 'AllSites', 'AllURLs', 'AllReferrers', 'AllAgents'
and 'AllSearchStr', which can have either a "yes" or "no"
value (default is "no"). Will add a "View All..." link to
the bottom of the appropriate "Top" table if enabled.
o Added support for squid proxy logs, thanks to code submitted
by Steinar H. Gunderson (sgunderson@bigfoot.com). To use
squid logs, specify a LogType of 'squid' in the configuration
file. This also changed the behaviour of the '-F' command
line switch, which now requires a second argument of either
'clf', 'ftp' or 'squid'.
o Completely modified the way the various TOP tables are handled
and sorted, which now allows extremely large top tables without
any performance degredation. Previously, tables greater than
a few hundred elements produced a noticable perfomance penalty
during processing.
o Added the ability to group domains automatically and to hide
individual host names from the report, using the 'GroupDomains'
and 'HideAllSites' configuration keywords (-g and -X command
line options). Domain Grouping is configurable as to the level
of grouping (second level domain, third, etc...). HideAllSites
forces only grouped site records to be displayed if any. Based
on ideas/code by Michael Klemme (mklemme@gmx.de). This changes
the behaviour of the '-g' switch, which previously was used to
force the use of GMT time for reports.
o Added user configurable search engine specification, used for
search string analysis. This adds the 'SearchEngine' keyword
in configuration files. Based on idea/code by Alexey Kizilov.
o Changed code to use the latest version of GD which supports PNG
images instead of GIF images. Also included changes in configure
script to ensure the presence of the libpng and libz libraries.
o Added ability to override log file to STDIN by use of '-' on
the command line.
o Added gzipped logfile support. The program will automatically
detect logfiles with a '.gz' extension and uncompress on the
fly. Uses gz file support of zlib, since it's required for
our gd/png stuff anyway. Please note that using gzipped logs
will incur a small performance penality.
o Minor changes to search string code to increase accuracy. This
also removes a previous condition that would occasionally cause
search strings to incorrectly be counted twice or to be counted
as different search strings when only differing by a space.
o Minor changes to URL parse code to allow additional characters.
Also changed unescape code to properly handle extended chars.
o Major changes to hash table node format for reduced memory usage.
Instead of fixed size strings, the new format will dynamically
allocate string memory and use pointers to existing table data
under certain circumstances. The memory savings is significant
and will be greatly noticed with large sites. Because of these
changes, the formatting of the incremental data file had to be
changed, therefore it is incompatible with previous versions.
o Major code reorganization and cleanup. This was to facilitate
future development and make things more managable.
o Usual documentation updates for new features/functions.
--------------------------------------------------------------------
1.30-xx changes from 1.22-06 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Fixed minor bug that would allow incorrect site totals for the
first day of the month under certain conditions.
Changes/Additions:
o Added Top Entry and Exit Page tables. Added configuration file
keywords TopEntry (-e command line) and TopExit (-E command line)
to specify the number of entries to display for each table. The
default for both is 10. See README for additional information.
o Added 'Group' labels. Allows display of a specified label for
grouped entries (in 'Top' tables). Based on patch submitted
by Oliver Graf (ograf@rhein-zeitung.de). See sample.conf for
examples.
o Added 'Visits' totals. The length of time that constitutes a
'visit' can be set using the VisitTimeout configuration keyword
(-m command line option). The value must be given in HHMMSS
format, you can omit leading zeros. Default is 30 minutes (3000).
o Added 'Pages' totals, based on user specified extensions. Changes
made to generated graphs as well. Configuration keyword PageType
(and command line -P switch) allows specification of extensions
to use (defaults to 'htm*' and 'cgi'). Also called "pageviews".
o Added Search String analysis. Keyword 'TopSearch' defines how
many of the top search strings to display. Default is 20. Can
be disabled by using zero (0).
o Added native support for ftp logs (xferlog ala wu-ftpd). Added
'LogType' configuration file keyword (-F command line option)
to specify log type. Values can be either 'web' or 'ftp', with
the default of 'web'.
o Changed graphs to handle pages and visits totals. Also added
color coded legends, which can be disabled using the GraphLegend
configuration keyword (-L command line option). Default is to
display them.
o Added background lines to graphs. Default is 2 lines, and can
be set to any number using the GraphLines configuration keyword
(-l command line option). Can use anywhere from none (0) to
twenty lines. They will be drawn in all but the country graph.
o Added CountryGraph configuration file keyword (-Y command line
option) to enable/disable display of country usage pie chart.
o Added FoldSeqErr keyword (-f command line option). Normally,
the program will ignore log records that are out of sequence
(chronological order). This option lets them be folded into
the analysis anyway, as if the were the same date/time as the
last good record. Apache users can safely ignore :)
o Added additonal 'Top' tables for SITES and URL's, sorted by
KBytes instead of hits. Two new configuration file keywords,
TopKSites and TopKURLs, can be used to specify the number of
entries for each (zero to disable). Default for both is 10.
o Added additional calculations for max/avg files, pages, visits
and KBytes in monthly statistics.
o Updated generated HTML code to fully comply with the HTML 4.0
Transitional spec. DOCTYPE header reflects this change as well.
o Changed code to use 4 digit years in filenames. Purely for the
Y2K phobes who couldn't deal with only two digits (even though
it was _purely_ for humans, the program couldn't care less).
Unfortunately, this means that you will have to rename previous
month files to the new format. Not a big deal if you plan on
re-running all your logs to take advantage of the new features.
o Major changes to both history file and incremental file formats
to handle additional totals (pages/visits data). As a result,
this version is INCOMPATABLE with previous versions. See the
file README.FIRST for important information on upgrading.
o Language files and documentation updated for new functions.
--------------------------------------------------------------------
1.22-xx changes from 1.20-11 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Fixed bug in country total generation. Caused country table
to show bogus entries if logs contain hostnames that were not
fully qualified (ie: don't have the domain name/TLD portion).
o Changed/fixed incremental data I/O routines to better detect and
handle error conditions. This involved some minor incremental
data file format changes as well. Fixes problem large sites were
having where random tables were getting munged.
o Fixed record parse code to better detect and strip query portion
from URL's and Referrer strings.
o Fixed segfault condition when more than MAX_CTRY entries were
specified for the "Top Countries" table.
Changes/Additions:
o Added code to detect negative byte transfer sizes in logs (another
netscape server kludge :) Could cause KByte xfer sizes to become
corrupt.
o Several small changes (mostly ifdef/endif's) to make code compile
clean 'out-of-the-box' across more platforms (ala SunOS). Also
added a GNU autoconf 'configure' script which helps a bit as well.
o Added Include* keywords. Allows forcing the inclusion of specified
log records. Takes precedence over counterpart Ignore* keywords.
o Added HTMLPre, HTMLBody, HTMLEnd and HTMLExtension keywords, and
changed behaviour of HTMLHead keyword. Previous versions need
only change the 'HTMLHead' keword in existing files to 'HTMLBody'
to upgrade. Thanks to Colin Viebrock <cmv@privateworld.com> for
the idea and code examples.
o Changed mangle agent code to support Opera and other browsers.
Also updated response codes to IETF HTTP/1.1 Rev 6 draft.
Thanks to Yves Lafon <ylafon@w3.org> for this these.
o Added HistoryName and IncrementalName keywords, which allow the
specification of the history and incremental data filenames.
o Added UseHTTPS keyword, which allows using 'https://' instead
of 'http://' for links to URLS in the 'Top URL's' table. Also
added check for URL's that already have the protocol specified
(such as on virtual web and proxy servers), and to use unmodified
if found (will only force to lowercase for matching).
o Added code to ignore out-of-sequence log records.
o Added code to force hostnames to lowercase (was causing country skew).
o Disabled display of blank (zero hit) days at start of daily stat table.
o Added records per second calculation to timing totals.
o ALT= tags now use translated strings instead of forcing english.
o Updated documentation for new functions/features.
--------------------------------------------------------------------
1.20-xx changes from 1.12-10 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Modified record parse routine to not touch stuff between quotes
("). Was causing problems parsing some malformed request fields.
o Fixed memory leak in MangleAgent code, and relocated to elimitate
un-necessary processing (causing segfault on some machines).
Changes/Additions:
o Changed transfer totals on host/url structures to support large
groupings (such as *.gif) on heavly hit servers. Hopefully, this
should cure the 32bit overflow problem large sites were having.
o Changed daily transfer totals to support transfers greater than
roughly 4.2 gigabytes a day.
o Added some missing HTML tags and altered the way totals are
calculated on the 'Top' tables (to correct for grouped records).
o Added incremental run capability (-p command line option or
"Incremental" configuration file keyword).
--------------------------------------------------------------------
1.1x-xx changes from 1.00-05 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Re-wrote the Group* logic, fixing a bug that allowed hiding of
objects when they shouldn't be.
o Fixed broken IgnoreReferrer code.
o Modified config parse code to handle extended characters.
o Misc. minor bug fixes/changes. Added a missing fclose.
o Cleaned up generated HTML.
o Fixed duplicate warnings on large referrer fields.
o Fixed country table bug adding grouped records to totals.
Changes/Additions:
o Added GroupSite, GroupReferrer and GroupAgent keywords to round
out the Group* configuration options.
o Added GroupShading and GroupHighlight keywords to allow selective
highlight and shading on grouped rows in table.
o Removed the '-L' command line option. Groupings can now only
be specified from a configuration file. Language files changed
to reflect change.
o Added '-V' command line option (identical to '-v') for version.
o Added additional language support. Language files will be marked
/* New for 1.1 */ where changes have been made.
o Various rewrites to streamline the code, accomidate the new
group options and make things easier down the road when I implement
incremental (partial log) processing.
o Usual README and CHANGES documentation updates.
--------------------------------------------------------------------
1.00-xx changes from 0.99-06 (brad@mrunix.net)
--------------------------------------------------------------------
Fixes:
o Modify record parser so that spaces in usernames (auth field)
don't cause record to be skipped (w/'Bad Record' message).
o Included various error conditions that were being ignored in
the timing statistics ('bad records' value) totals.
Changes/Additions:
o Added GMTTime (-g) option to force display of timestamps in
GMT (UTC) time instead of local timezone.
o Added GroupURL (-L) option for grouping of URL's as if they
were a single object. See README for details.
o Language support in the form of a language specific header
file containing all strings used by The Webalizer. English
file is used by default unless changed. Support for other
languages will be distributed as I receive them.
--------------------------------------------------------------------
0.99-xx changes from 0.98-16 (brad@mrunix.net)
--------------------------------------------------------------------
0.99 is mostly a bug-fix release, with a few added extra goodies.
Fixes:
o Fixed monthly total transfer size (silent) overflow problem.
o Fixed the numerous fprintf format errors. Only seemed to wreak havok
on non-intel machines though.
o Fixed core dump condition on certain machines when using stdin for
input.
o Fixed floating point code that caused divide by zero errors on some
platforms (most noticably on SCO OpenServer).
o Netscape server kludges: Added code to deal with Netscape log header
record gracefully. Also added workaround for timestamp error where
Netscape sometimes makes a day have 0-24 hours instead of 0-23. The
Webalizer will now treat anything greater than 23 as 0.
o Resized some fixed field sizes to gain memory usage improvements.
Changes/Additions:
o Ignore* config keywords added. This allows you to completely ignore
certain log records based on site name, URL, user agent or referrer.
* Use will cause inaccurate statistics results. See documentation.
o ReallyQuiet config keyword (-Q command line option) added. Causes
The Webalizer to supress _all_ messages. Useful for cron jobs.
o Removed the "Sites" total at the bottom of the summary by month.
The total for sites is a useless number and produces a misleadingly
high value which detracts from the accuracy of the other totals.
o Updated README and CHANGES
|