[ Top | Up |
Prev | Next |
Map | Index ]
 Analog CE 6.0:
Upgrading from earlier versions
 Analog CE 6.0:
Upgrading from earlier versions
This section lists those commands which existed in older versions of analog,
but which have been changed or abolished in this version. It also lists reasons
why the same input might now produce different output. The new features in this
version are listed in the section What's new in
this version?.
If you are upgrading from a previous version of analog, you should keep your
old analog.cfg, and your old anlghead.h if you are
compiling your own copy of analog rather than using one of the precompiled
versions.
  - Bug fixes in XML schema allow XML output under Analog 6.0  
  
- If using XML OUTPUT a new XMLDTD command allows you to specify a public http/https URL to the public XML DTD file.
  - The DNSTIMEOUT command is
  now off by default. On platforms where it's available, you have to specify
  it explicitly if you want it.
  - The computer-readable output style now forces
  English language output. The idea is to have a consistent output style, and
  let the post-processor do any necessary localisation. But this may break some
  existing applications.
  - The default output style is now XHTML, not HTML. Also the command-line
      argument "-a" selects XHTML, not HTML.
  
- If you are using a style sheet for HTML output, it probably won't work
      well with XHTML output, because it replaces the default style
      sheet. So you will probably need to write a new one for XHTML.
  
- The default for PNGIMAGES
      is now ON, so bar charts are made from png's not gif's by
      default. Some users may need to copy the png versions of the bar charts
      into their images directory.
  - More browsers are recognised, which will also reduce
      the Mozilla numbers.
  - In the Operating System Report, Macintosh is no longer split into PPC
      and 68k. (68k is now so rare that it's not worth it. Unfortunately,
      it's not possible to distinguish OS 9 from OS X, because the browsers
      don't all report it.)
  
- The language files have changed, so don't try and use the old ones.
  
- Some dead search engines have been removed from the default
      configuration file, and some new ones added.
  
- Kilobytes, megabytes etc. are now listed to two not three decimal places
      by default; but you can configure the number of decimal places with the
      new BYTESDP command.
  - There will be small changes to the browser and operating system reports
      thanks to better parsing of recent Netscape and Mozilla browsers.
  
- In version 5.23 and earlier, earlier UNCOMPRESS commands took
      precedence over later ones. This has been corrected in 5.24.
  - As a side-effect of the security fix in version 5.22, the
      JAPANESE-JIS character set no longer works from the form
      interface.
  - The language files have all changed, so don't try and use the old ones.
  
- The rules for generating "organisations" from numerical
      addresses have changed, which will change the Organisation Report.
  
- Filenames given on the command line, as opposed to in configuration
      commands, are now relative to the current directory, not the analog
      directory. This will probably only affect Unix users.
  - The order of the lines in the report descriptions files has changed, so
      don't try to use the old versions.
  - LANGUAGE JAPANESE now selects the JIS character set instead
      of EUC. (Four Japanese character sets can be chosen explicitly through
      the LANGUAGE command.)
  - The language files have all changed, so don't try and use the old ones.
  
- Organisations in Belgium are now considered to be at depth 3, which will
      change the Organisation Report.
  
- There are some new images in the images/ directory, which you
      will have to copy into your IMAGEDIR.
  - LOCALCHARTDIR and DESCFILE can no longer be
      specified from the form interface, for
      security reasons.
  - Previously, when an output
      INCLUDE or EXCLUDE command was in operation,
      the excluded items were still included in the "not listed"
      line at the bottom of the report. This is no longer the case.
      Also, percentages of the requests and bytes are now calculated
      ignoring the excluded items, which will make those numbers
      larger, make their wedges in the pie charts larger, and change what is
      included by floors specified in terms of
      percentages.
- The Operating System Report, Browser Report & Browser Summary are now
    sorted by page requests instead of raw requests by default.
  - There are some new files (with names beginning with sq) in
      the images/ directory. These will need to be copied into your
      IMAGEDIR.
  
- *.asp is no longer included in the
      default definition of "pages" (which
      it was from 4.11 to 4.90beta1) because there are too many other equally
      well qualified candidates. But you can easily re-enable it in
      analog.cfg.
  
- The format of the "busiest time" and floor/sortby lines has
      changed in the computer-readable output.
  
- The default DNSGOODHOURS and
      DNSBADHOURS have been increased. In particular, the
      default DNSGOODHOURS is now as near infinity as makes no
      difference!
  - Some languages have still not been translated for version 5 yet:
      see the list of available languages in the
      What's new? section.
  
- All files are now looked for in sensible directories, specified at
      compile time, if no other directory is specified. On platforms where
      these directories are not known at compile time, analog formerly looked
      in the current working directory, but now looks in (its guess at) the
      directory of the analog binary. However, filenames specified in command
      line arguments are within the current working directory since version
      5.2.
  
- The cache file now includes data on the first-request time of each
      item. But if you read a cache file from an older version of analog, this
      data will not have been recorded, and so the last-request time will be
      used instead. Analog will warn you about this.
  
- If you specify a CACHEFILE command but no LOGFILE
      command, analog won't read the default (compile-time) logfile. This is
      much more intuitive behaviour, but some users may have been relying on
      the old behaviour. The actual rule is given in the documentation on
      Cache files.
  
- Some browsers will be rediagnosed as robots in the Operating System
      Report. This will mainly reduce the "OS unknown" total, but
      may also reduce other categories.
  
- US English now uses the 12-hour clock by default. If you
      want to continue to use the 24-hour clock, use the language file
      us24.lng instead. (Either use a
      LANGFILE command, or
      rename us24.lng to us.lng). Conversely, you can
      get British English with the 12-hour clock by using uk12.lng.
  
- Because of the twelve new reports, if you use a
      REPORTORDER command,
      you should include the corresponding new
      letters: 1Q76wLlMRjyY.
  
- Computer readable output now has an extra
      line reporting the busiest time period for the time reports.
  
- The date codes in the
      OUTFILE and CACHEOUTFILE commands now always
      produce dates in English.
  
- The REFLINKINCLUDE command now only controls links in the
      Referrer Report. Use REDIRREFLINKINCLUDE and
      FAILREFLINKINCLUDE for the Redirected and Failed Referrer
      Reports.
  
- When doing a negative floor, items are
      no longer included if they have 0 of the criterion in question, even if
      there aren't enough items otherwise. For example FLOOR -25p
      will list fewer than 25 items if there aren't 25 items with requests for
      pages, even if there are other items with 0 requests for pages.
  
- When a logfile line contains bytes but no filename, analog previously
      ignored the bytes. It now counts them for other items on the line, but
      doesn't put them in the File Size Report or General Summary (to avoid
      double-counting).
  
- There are no longer any HTML language files containing HTML entities. So,
      for example, the HTML output will always contain a literal é
      instead of the code é. This should make no
      difference to the reader, but please do tell me about any problems.
  
- The source files have moved to the src/ directory. And there
      are new source files in subdirectories of the src/ directory.
      Also the  header files have different variables in them. This means that
      automatic build scripts will have to be rewritten.
  - Regular expressions in an INCLUDE or EXCLUDE
      command must now occur on a line on their own, not within a
      comma-separated list.
  
- The search terms reported in the Search Word and Search Query Reports
      are no longer converted to lower case if you are using a multibyte
      character set.
  
- Unprintable characters in the output are now replaced by '?', except for
      multibyte character sets.
  - *INCLUDE "" and *EXCLUDE "" (see
      documentation) now apply to items
      which were present but corrupt. This may have the effect of including or
      excluding some new lines.
  
- There has been a tiny change in computer-readable
      output style. Previously if a time was blank, it took up only one
      column. Now it takes up as many columns as if it had been present.
  - There is better parsing of extended format and WebSTAR format logs,
      which may cause differences in some cases.
  
- All referrers now count as "pages" irrespective of
      any PAGEINCLUDE and PAGEEXCLUDE commands.
      Consequently, for example,
      "REFLINKINCLUDE pages" is now the
      same as "REFLINKINCLUDE *". You can recreate
      the previous behaviour with
      "REFLINKINCLUDE *.html,*.htm,*/".
  
- The default REPORTORDER has changed.
  
- New anchor names are used internally to the output page.
  - There may be slight differences in the results in this version owing to
      stripping anchors off filenames.
  - Some of the default paths have changed in anlghead.h.
  - It is now recommended that you don't run analog as a CGI program, or put
      it in the directory with your CGI programs, for
      security reasons.
  
- Each browser in the Browser Summary is now sorted by major version
      number then minor version number. So SUBBROW */* will now
      only show the major versions. To get all the minor versions, you need
      SUBBROW */*.*
  
- PAGEWIDTH has been replaced by
      HTMLPAGEWIDTH and
      ASCIIPAGEWIDTH.
  
- PRINTVARS has been renamed
      SETTINGS.
  - The form interface has been completely rewritten, and old versions of
      anlgform.html will not work with this version.
  
- The Browser Summary now diagnoses MSIE, Opera and WebTV browsers
      better. This will cause differences in output from previous versions.
  
- With RAWBYTES OFF, bytes are now listed as, for example,
      47.68 Mbytes instead of 48,832 kbytes. This should be less confusing.
  
- The DNS file has a new time encoding. It's only a
      few hours different, so I haven't made any special provision for it. The
      effect is that the DNSGOODHOURS and
      DNSBADHOURS may be a few hours out
      for existing entries (but not for new ones).
  - There is a new set of graphics in the images directory, which
      you will have to move to your web directory.
  
- In the Mac version, if a configuration file is dragged onto the analog
      icon, it is used instead of, not as well as, the default configuration
      file.
  - In the computer-readable output style, the
      line L7, the time the last seven days begins after, has been
      replaced by E7, the time the last seven days ends. This is
      for consistency with the other output styles.
  
- Also in the computer-readable output, there is a new line reporting the
      floor and the SORTBY for the report. In 3.11 and earlier,
      this didn't exist, and in 3.2 it only reported the floor, not the
      SORTBY.
  
- %R (Mac-style filename) has been abolished in the
      LOGFORMAT. Just use plain %r instead.
  
- It is no longer allowed to set the CACHEOUTFILE to be the
      same as a previous cache file.
  
- The definition of the common log format and related formats changed
      between 3.11 & 3.2, and again between 3.2 & 3.3. This could
      cause differences in output, but they are likely to be only very minor.
  - Lines without a particular item now work properly with
      INCLUDE and EXCLUDE commands. For example, if
      you do an INCLUDE to look at only certain lines, then lines
      without that type of item at all will not now be included, whereas
      previously they would have been. This can make the results lower than in
      these earlier versions.
  
- I have reluctantly removed support for NetPresenz logs. This hasn't
      worked for some time, and I have already been advising NetPresenz users
      not to use newer versions of analog because they could get wrong
      results. Unfortunately, fixing it would require a complete rewrite of
      the entire parsing code, which isn't going to happen any time soon. So
      my advice remains the same: continue to use version 2.11 or (even
      better) pre-process your logfiles into a form which analog can handle
      safely.
  
- The English domains file has changed name from domains.tab
      to ukdom.tab.
- If using the form interface on Windows, it is now necessary to put the
analog executable at \analog\analog.exe instead of
\Program Files\analog\analog.exe
  - LOGFORMAT MICROSOFT has been replaced by
      LOGFORMAT MICROSOFT-NA and LOGFORMAT MICROSOFT-INT;
      and similarly for LOGFORMAT NETPRESENZ.
  - It is possible that there may be small discrepancies between the results
      from previous versions and the results from this version because the
      parsing code has changed, but any such differences should be minor.
      However...
  
- If you used to use REFEXCLUDE or BROWEXCLUDE, you
      most likely now want
      REFREPEXCLUDE or
      BROWREPEXCLUDE
      instead, or you will exclude lots of lines that were previously included.
  
- It is possible that this version may not automatically parse a logfile
      that previous versions could parse, because it checks more carefully
      that the logfile is in the format claimed. If so, you will have to use a
      LOGFORMAT command.
  
- Approximate host counting has been abolished, unless there's a
      significant demand for it.
  
- Count of number of new hosts in last seven days abolished. It was too
      confusing because it depended on which old logfiles you analysed.
  
- The Error Report has been abolished (together with the configuration
      commands ERROR, ERRLOG and ERRMINOCCS).
      See the What's new? page.
  
- The BROWLOG and REFLOG commands have also been
      abolished: just use LOGFILE
      instead.
  
- The HASHSIZE commands have been abolished: analog now chooses
      the size of the hash tables itself.
  
- The MINREQS and similar options have been replaced by the
      FLOOR commands.
  
- Only one * is now allowed on the left-hand side of aliases,
      to avoid ambiguities.
  
- Automatic detection of log type is now on a per-file rather than a
      per-line basis.
  
- ISPAGE is now called
      PAGEINCLUDE.
  
- WITHARGS and REFWITHARGS are now called
      ARGSINCLUDE and
      REFARGSINCLUDE.
  
- MONTHLYBACK is now called MONTHBACK.
  
- FULLHOSTS is now just called HOST.
  
- The old LOGOURL command is now called LOGO.
      LOGOURL now has a new function, namely where to link the logo
      to.
  
- The UNIT commands have been abolished. They weren't very
      useful, and they didn't make much sense with the different ways of
      displaying the time report bar charts. The unit is now always chosen
      automatically.
  
- DIRLEVEL has been abolished, because the
      SUBDIR command is more general.
      Use SUBDIR */* or whatever instead.
  
- Comments aren't allowed in the domains file.
      I don't think this should cause a problem.
  
- GRAPHICAL is abolished. Instead, use lower case letters with
      the GRAPH commands.
  
- NUMLOOKUP has been replaced by
      DNS, and DNSFRESHHOURS
      by the commands DNSGOODHOURS and
      DNSBADHOURS.
  
- DNS cache files from previous versions are not compatible with this
      version.
  
- You can't use PAGES in the columns or SORTBY or
      FLOOR for the Request Report. Use REQINCLUDE pages
      instead.
  
- - as a synonym for none has been abolished in some
      places, e.g., HOSTURL.
  
- The following command line arguments have been abolished from earlier
      versions, many of the letters getting new meanings: 7,
      l, n, p, s, u,
      v, w. (-v has moved to
      -settings.) Others have been changed since version 1.2 as
      well.
  - Filenames for logfiles etc. should now be given DOS-style, with
      backslashes, rather than Unix-style with forward slashes.
  - There is no longer an automatic progress report. Use the
     PROGRESSFREQ command
      instead.
  - Use *INCLUDE and
      *EXCLUDE instead of *ONLY and
      *IGNORE.
  
- The syntax of the *FLOOR commands has changed.
  
- Use *SORTBY REQUESTS or BYTES instead of
      *SORTBY BYREQUESTS or BYBYTES.
Go to the Analog CE home page.
Stephen Turner
19 December 2004
[ Top | Up |
Prev | Next |
Map | Index ]