endico%mozilla.org
|
c9b38c329e
|
weed out cruft. two years of data contained 5 million urls most of which were various bonsai and htdig queries. This is insane. Crop off everything in the url after a question mark. Crop off "index.html". Crop off trailing slashes except the root of the web site which is a lone slash. Added a shell script to preprocess the log data since that code was reused from another web analizer script.
|
2001-08-01 03:52:22 +00:00 |
|
dmose%mozilla.org
|
2db9bdbbbe
|
updated license boilerplate
|
1999-11-01 23:33:56 +00:00 |
|
terry%netscape.com
|
343dba4848
|
Don't choke on really big URLs.
|
1999-04-20 16:19:44 +00:00 |
|
terry%netscape.com
|
00932f72b0
|
Was ignoring "/" -- the main web page!
|
1999-02-12 19:08:06 +00:00 |
|
terry%netscape.com
|
f25091afa7
|
Wasn't clearing out hash table after flushing its contents to the db.
|
1999-02-12 15:02:18 +00:00 |
|
terry%netscape.com
|
26ef0a94a2
|
Generate web page statistics.
|
1999-02-12 14:45:59 +00:00 |
|