Web Statistics How-to

Use the Web Log Analysis Form to generate custom web statistics for any web pages that live on the Library's main web server (e.g. anything with a URL which begins www.lib.uchicago.edu or www1.lib.uchicago.edu)

Improvements/Changes Implemented July 2008 (old stats reports can be re-run to take advantage of some of these changes)
1) improved filtering of hits coming from web crawlers, bots, and other automated sources so that the numbers more accurately reflect real activity
2) greater control over the report output
3) referrer data and query data is now logged and reported (this data is only available from July 2007 forward)


Default report |Other report options | FAQ | Analog 6.0: How the web works

Default report:

Section Notes
General Summary The meaningful line is the Successful requests for pages which shows hits to your web pages (html, php, php3, or shtml) and is not bloated by requests for favicons, banner graphics, etc.
Directory Report Page hits broken down by directories/subdirectories. This is a summary report (see Request report for detail at the page level).
Request Report/Limit Page hits for individual pages in your node. Default is set to pages with >= 20 hits. Other options: 1 or 100 hits.
Search Query Report Top 30 search queries that resulted in clicks to your pages.
Referring Site Report Web sites that users were on when they clicked to your node/page. This is a summary report (see the Referrer report for detail at the page level).
Referrer Report/Limit Individual Web pages that users are on when they click to your node/page. Limited to pages with more than 20 hits. Other limit options: 1 or 100 hits.
Domain Report/Limit Number and percentage of hits by the highest level of the domain name (.edu, .net, .com, etc). Includes details of hits from uchicago.edu and lib.uchicago.edu domains. Default is limited to domains with 20 or more hits. Other limit options: 1 or 100.

Other report options (change the radio buttons to include these reports)

Section Notes
Monthly Total hits summarized by month
Weekly Total hits summarized by week
Daily Summary Total hits summarized by day of the week (i.e. are there more users on Mondays or Saturdays)
Daily Total hits for each individual day of the report period (This gets really long if you looking at several months of data).
Hourly Total hits summarized by hour of the day (i.e. are there more users at 9am or 5pm)
Organizations/Limit List of domains that visiting computers are coming from. NOTE: the numeric ones are IP addresses that cannot be resolved to a domain name
Hosts Visits from specific computers. NOTE: you may want to exclude your own.
Browser summary List of the browsers being used by your visitors.

FAQ

Q: Under the "General Summary" what counts as a "successful request?"
The 'Successful Requests' number includes requests for all types of content that make up a web page, including embedded banner images, icons, etc., This does not give a very meaningful number.

Q: Under the "General Summary" what counts as a page?
We have defined a page to include all the file types that we consider standard web pages (html, shtml, php3) so the 'Successful requests for pages' number is the total you should be looking at. All the detailed reports are showing successful page hits. We explicitly exclude image files as banner images, icons, etc bloat the statistics and create duplicate hits on each page.

Q: Why are my numbers so much lower than last year?
In an effort to make these statistics more accurate, we have eliminated the hits that are easy to identify as orignating from bots and other processess that do not reflect human activity. Statistics for 2008 will set new benchmarks. You may go back and re-run statistics for previous years using this new configuration if you want earlier comparisons.