Commit Graph

705 Commits

Author SHA1 Message Date
Sebastian Hahn
399a9a76dc Add metrics-lib submodule 2013-08-22 16:45:04 +02:00
Sebastian Hahn
12cdab447a Fix build errors on Debian systems
This ports commit 3e60ccdaaba598cabb7281d45f9a415299b8e3e3 from
metrics-lib over, originally written by Jens-Michael Hoffmann
<jmho@jmho.de>, with minor tweaks by Karsten Loesing.

The local lib directory is not used anymore and respective references
were removed. The java dependencies are now specified in the build.xml
and taken from their installed locations.

In addition to git, openjdk-6-jdk and ant the following java packages
have to be installed:
 - libcommons-codec-java
 - libcommons-compress-java

Slighty tweaked by Sebastian Hahn to remove references to unused junit.
2013-08-22 16:13:31 +02:00
Karsten Loesing
4ce2bb867b Warn if assignments.log contains duplicate fingerprints. 2013-07-17 12:28:11 +02:00
Karsten Loesing
b65f0a1aa2 Keep ntor-onion-key lines in sanitized bridge descriptors.
Ian says in #8028 that doing so seems safe to him, because it's just a
random group element.
2013-02-01 05:07:19 +01:00
Karsten Loesing
abbdce6858 Stop rate-limiting torperf downloads in Java.
Instead, change cronjob to */6, which is possible now that metrics-db has
five separate modules.
2012-12-01 07:05:34 +01:00
Karsten Loesing
6d870f1cf9 Keep geoip6-db-digest lines in sanitized bridge descriptors. 2012-11-07 16:44:24 -05:00
Karsten Loesing
45e91ade9c Fix typo in build file. 2012-11-07 13:07:09 -05:00
Karsten Loesing
3064d679d7 Fix log message. 2012-11-07 13:03:59 -05:00
Karsten Loesing
a03dad001c Take out now unused config options. 2012-11-07 12:51:11 -05:00
Karsten Loesing
2caf3f1b11 Tweak shell scripts. 2012-11-07 12:38:07 -05:00
Karsten Loesing
8e5a7a7d9e Make five modules executable on their own.
This will prevent one failing module from breaking the other four.  It
will also make it easier to change the timing of execution.
2012-11-07 12:04:02 -05:00
Karsten Loesing
e6e430baff Actually parse TorperfOutputDirectory config line. 2012-11-03 14:12:30 -04:00
Karsten Loesing
a06510c7f6 Update .gitignore. 2012-11-03 14:12:21 -04:00
Karsten Loesing
40a8efabeb Move default output directories to out/. 2012-11-03 11:20:44 -04:00
Karsten Loesing
efa2b13d3f Move default input directories to in/. 2012-11-03 10:47:29 -04:00
Karsten Loesing
3235ecfc4c Raise the threshold for warning about missing descriptors.
Apparently, it happens quite often that 0.1% of extra-info descriptors
are missing that are referenced from a vote.  Raising the threshold to
0.5% to reduce frequent warnings.  Anything between 0.1% and 0.5% still
does not indicate a problem.
2012-11-03 06:23:49 -04:00
Karsten Loesing
bee74460bb Make check for missing descriptors much more efficient. 2012-10-31 12:28:01 -04:00
Karsten Loesing
9fac7fdda3 Create stats/ directory before writing torperf temp file. 2012-10-27 18:26:24 -04:00
Karsten Loesing
b75f680950 Download votes of all known authorities.
We currently look at the downloaded consensus to decide which votes to
download.  This approach fails when we're missing a consensus.  Instead,
download votes published by all known authorities, not just the ones
contained in the consensus.  Implements #5812.
2012-10-27 18:25:06 -04:00
Karsten Loesing
f7b58361aa Recognize when data sources become stale.
Implements #3850.
2012-10-27 13:52:42 -04:00
Karsten Loesing
0336ad311f Wait until all threads are done before exiting. 2012-10-26 21:19:39 -04:00
Karsten Loesing
663cdab721 Make copying to rsync/ more efficient.
Each module now writes to its own subdirectory in rsync/ in parallel to
writing to its output directory for making tarballs.  This should be more
efficient than going over the output directories and copying files to
rsync/.
2012-10-26 21:19:33 -04:00
Karsten Loesing
a2db139da1 Prepare for running relay descriptor downloader twice per hour. 2012-10-26 17:24:19 -04:00
Karsten Loesing
7db75dbedf Run five data-processing modules in parallel. 2012-10-26 15:13:21 -04:00
Karsten Loesing
5bdf0b0121 Take out options for copying to rsync directory. 2012-10-26 14:20:02 -04:00
Karsten Loesing
9766f4c4e9 Make each module copy its own files to rsync/. 2012-10-26 14:15:16 -04:00
Karsten Loesing
0df849df62 Move all processing logic to sub modules. 2012-10-26 13:24:39 -04:00
Karsten Loesing
f106b16b76 Organize imports, fix whitespace issues. 2012-10-26 12:47:42 -04:00
Karsten Loesing
8746badd1b Group classes by kind of processed metrics data.
Also remove unused unit tests.
2012-10-26 12:44:42 -04:00
Karsten Loesing
7b69045600 Add Faravahar to default list of directory authorities. 2012-09-21 12:37:56 -04:00
Karsten Loesing
f0b4f989a1 Correctly parse IPv6 addresses with :: in the middle. 2012-09-19 17:47:55 -04:00
Karsten Loesing
592fe79cc9 Update .gitignore and build.xml to match what's on yatei. 2012-08-21 15:31:19 +02:00
Karsten Loesing
0f6a01e9ce Remove html log file handler. 2012-08-21 15:23:30 +02:00
Karsten Loesing
8d8bdf1331 Remove GetTor statistics processing code. 2012-08-07 12:46:12 +02:00
Karsten Loesing
7d2ec97550 Close a few readers/streams, spotted by Eclipse. 2012-07-25 08:59:53 +02:00
Karsten Loesing
ba749152d3 Bump to @type bridge-extra-info 1.1. 2012-06-29 13:56:04 +02:00
Karsten Loesing
4d82879488 Sanitize pluggable transport lines in bridge descriptors.
Implements #4957.
2012-06-29 12:43:10 +02:00
Karsten Loesing
1b62b6cdd9 Provide torperf files via rsync. 2012-06-01 14:02:54 +02:00
Karsten Loesing
b920fb0798 Add @type annotation and Downloaded line to exit lists. 2012-05-31 15:44:52 +02:00
Karsten Loesing
f4c952bc8c Only include active descriptor sinks in rsync/. 2012-05-31 15:16:32 +02:00
Karsten Loesing
843ae9b36c Include GetTor stats in rsync/. 2012-05-31 15:14:25 +02:00
Karsten Loesing
193dad7033 Add @type annotation to GetTor stats file. 2012-05-31 15:13:21 +02:00
Karsten Loesing
a76ade4725 Add @type annotation to bridge pool assignment files. 2012-05-31 11:45:40 +02:00
Karsten Loesing
1e683fea7f Don't scrub entry-* lines from bridge descriptors, either. 2012-05-30 17:37:37 +02:00
Karsten Loesing
2db5c12fe9 Make some major changes to the bridge descriptor sanitizer.
- Bridge network statuses contain a "published" line containing the
  publication timestamp, so that parsers don't have to learn that
  timestamp from the file name anymore.

- Bridge network status entries are ordered by hex-encoded
  fingerprint, not by base64-encoded fingerprint, which is mostly a
  cosmetic change.

- Server descriptors and extra-info descriptors are stored under the
  SHA1 hashes of the descriptor identifiers of their non-scrubbed
  forms.  Previously, descriptors were (supposed to be; see #5607)
  stored under the digests of their scrubbed forms.  The reason for
  hashing digests is to prevent looking up an existing descriptor
  from the bridge authority by its non-scrubbed descriptor digest.
  With this change, we don't have to repair references between
  statuses, server descriptors, and extra-info descriptors anymore
  which turned out to be error-prone (#5608).  Server descriptors and
  extra-info descriptors contain a new "router-digest" line with the
  hex-formatted descriptor identifier.  These lines are necessary,
  because we cannot calculate the identifier anymore and because we
  don't want to rely on the file name.

- Stop sanitizing bridge nicknames (#5684).

- Stop sanitizing *-stats lines (#5807).

- All sanitized bridge descriptors contain @type annotations (#5651).
2012-05-30 11:54:13 +02:00
Karsten Loesing
2288810bad Tweak some Torperf-related log messages. 2012-05-30 08:28:21 +02:00
Karsten Loesing
a77d67de24 Print out a warning if we're missing Torperf data lines. 2012-05-29 20:17:41 +02:00
Karsten Loesing
8b45d2fb00 Merge Torperf files into new .tpf file format.
Implements part of #3036.
2012-05-29 20:17:40 +02:00
Karsten Loesing
65c525c2ac Handle @type annotations when reading descriptors. 2012-05-11 16:17:09 +02:00
Karsten Loesing
bba639f852 Remove binaries.
Don't tell Sebastian, or he'll do his I-told-you-so dance...
2012-05-11 15:45:55 +02:00