third_party_rust_rust-base64/README.md

[base64](https://crates.io/crates/base64)
===

[![](https://img.shields.io/crates/v/base64.svg)](https://crates.io/crates/base64) [![Docs](https://docs.rs/base64/badge.svg)](https://docs.rs/base64) [![Build](https://travis-ci.org/marshallpierce/rust-base64.svg?branch=master)](https://travis-ci.org/marshallpierce/rust-base64) [![codecov](https://codecov.io/gh/marshallpierce/rust-base64/branch/master/graph/badge.svg)](https://codecov.io/gh/marshallpierce/rust-base64) [![unsafe forbidden](https://img.shields.io/badge/unsafe-forbidden-success.svg)](https://github.com/rust-secure-code/safety-dance/)

<a href="https://www.jetbrains.com/?from=rust-base64"><img src="/icon_CLion.svg" height="40px"/></a>

Made with CLion. Thanks to JetBrains for supporting open source!

It's base64. What more could anyone want?

This library's goals are to be *correct* and *fast*. It's thoroughly tested and widely used. It exposes functionality at multiple levels of abstraction so you can choose the level of convenience vs performance that you want, e.g. `decode_config_slice` decodes into an existing `&mut [u8]` and is pretty fast (2.6GiB/s for a 3 KiB input), whereas `decode_config` allocates a new `Vec<u8>` and returns it, which might be more convenient in some cases, but is slower (although still fast enough for almost any purpose) at 2.1 GiB/s.

Example
---

```rust
extern crate base64;

use base64::{encode, decode};

fn main() {
    let a = b"hello world";
    let b = "aGVsbG8gd29ybGQ=";

    assert_eq!(encode(a), b);
    assert_eq!(a, &decode(b).unwrap()[..]);
}
```

See the [docs](https://docs.rs/base64) for all the details.

Rust version compatibility
---

The minimum required Rust version is 1.34.0.

Developing
---

Benchmarks are in `benches/`. Running them requires nightly rust, but `rustup` makes it easy:

```bash
rustup run nightly cargo bench
```

Decoding is aided by some pre-calculated tables, which are generated by:

```bash
cargo run --example make_tables > src/tables.rs.tmp && mv src/tables.rs.tmp src/tables.rs
```

no_std
---

This crate supports no_std. By default the crate targets std via the `std` feature. You can deactivate the `default-features` to target core instead. In that case you lose out on all the functionality revolving around `std::io`, `std::error::Error` and heap allocations. There is an additional `alloc` feature that you can activate to bring back the support for heap allocations.

Profiling
---

On Linux, you can use [perf](https://perf.wiki.kernel.org/index.php/Main_Page) for profiling. Then compile the benchmarks with `rustup nightly run cargo bench --no-run`.

Run the benchmark binary with `perf` (shown here filtering to one particular benchmark, which will make the results easier to read). `perf` is only available to the root user on most systems as it fiddles with event counters in your CPU, so use `sudo`. We need to run the actual benchmark binary, hence the path into `target`. You can see the actual full path with `rustup run nightly cargo bench -v`; it will print out the commands it runs. If you use the exact path that `bench` outputs, make sure you get the one that's for the benchmarks, not the tests. You may also want to `cargo clean` so you have only one `benchmarks-` binary (they tend to accumulate).

```bash
sudo perf record target/release/deps/benchmarks-* --bench decode_10mib_reuse
```

Then analyze the results, again with perf:

```bash
sudo perf annotate -l
```

You'll see a bunch of interleaved rust source and assembly like this. The section with `lib.rs:327` is telling us that 4.02% of samples saw the `movzbl` aka bit shift as the active instruction. However, this percentage is not as exact as it seems due to a phenomenon called *skid*. Basically, a consequence of how fancy modern CPUs are is that this sort of instruction profiling is inherently inaccurate, especially in branch-heavy code.

```text
 lib.rs:322    0.70 :     10698:       mov    %rdi,%rax
    2.82 :        1069b:       shr    $0x38,%rax
         :                  if morsel == decode_tables::INVALID_VALUE {
         :                      bad_byte_index = input_index;
         :                      break;
         :                  };
         :                  accum = (morsel as u64) << 58;
 lib.rs:327    4.02 :     1069f:       movzbl (%r9,%rax,1),%r15d
         :              // fast loop of 8 bytes at a time
         :              while input_index < length_of_full_chunks {
         :                  let mut accum: u64;
         :
         :                  let input_chunk = BigEndian::read_u64(&input_bytes[input_index..(input_index + 8)]);
         :                  morsel = decode_table[(input_chunk >> 56) as usize];
 lib.rs:322    3.68 :     106a4:       cmp    $0xff,%r15
         :                  if morsel == decode_tables::INVALID_VALUE {
    0.00 :        106ab:       je     1090e <base64::decode_config_buf::hbf68a45fefa299c1+0x46e>
```


Fuzzing
---

This uses [cargo-fuzz](https://github.com/rust-fuzz/cargo-fuzz). See `fuzz/fuzzers` for the available fuzzing scripts. To run, use an invocation like these:

```bash
cargo +nightly fuzz run roundtrip
cargo +nightly fuzz run roundtrip_no_pad
cargo +nightly fuzz run roundtrip_random_config -- -max_len=10240
cargo +nightly fuzz run decode_random
```


License
---

This project is dual-licensed under MIT and Apache 2.0.
Update README and version 2017-04-23 01:23:38 +00:00			`[base64](https://crates.io/crates/base64)`
Update README 2015-12-03 19:48:32 -05:00			`===`
README: Add badges for build and coverage 2018-10-19 13:15:24 +11:00
Advertise that unsafe code is forbidden Having zero unsafe code and forbidding it is a desirable property for Rust code. This commit adds a badge to advertise this property. This helps users make a more informed choice when picking a base64 crate. In this PR clicking the badge leads to the Rust Secure Code WG coordination repo for removing unnecessary unsafe code from the Rust ecosystem. I'm fine with making the link lead somewhere else or making the badge non-clickable if you prefer. 2019-09-07 14:07:04 +02:00			[![](https://img.shields.io/crates/v/base64.svg)](https://crates.io/crates/base64) [![Docs](https://docs.rs/base64/badge.svg)](https://docs.rs/base64) [![Build](https://travis-ci.org/marshallpierce/rust-base64.svg?branch=master)](https://travis-ci.org/marshallpierce/rust-base64) [![codecov](https://codecov.io/gh/marshallpierce/rust-base64/branch/master/graph/badge.svg)](https://codecov.io/gh/marshallpierce/rust-base64) [![unsafe forbidden](https://img.shields.io/badge/unsafe-forbidden-success.svg)](https://github.com/rust-secure-code/safety-dance/)
Update README 2015-12-03 19:48:32 -05:00
Add the CLion logo 2019-01-24 19:45:43 -07:00			`<a href="https://www.jetbrains.com/?from=rust-base64"><img src="/icon_CLion.svg" height="40px"/></a>`

			`Made with CLion. Thanks to JetBrains for supporting open source!`

Update README 2015-12-03 19:48:32 -05:00			`It's base64. What more could anyone want?`

Re-implement DecoderReader. Turns out the initial, simple implementation didn't handle cases where slow or otherwise unpredictable delegate `Read`ers wouldn't provide as many bytes as we asked for, which meant there has to be a lot more bookkeeping. Also, only one nontrivial buffer is needed now. 2019-11-24 13:30:14 -07:00			This library's goals are to be correct and fast. It's thoroughly tested and widely used. It exposes functionality at multiple levels of abstraction so you can choose the level of convenience vs performance that you want, e.g. `decode_config_slice` decodes into an existing `&mut [u8]` and is pretty fast (2.6GiB/s for a 3 KiB input), whereas `decode_config` allocates a new `Vec<u8>` and returns it, which might be more convenient in some cases, but is slower (although still fast enough for almost any purpose) at 2.1 GiB/s.
Add some notes to the readme 2019-07-28 06:33:56 -07:00
Update README 2015-12-03 19:48:32 -05:00			`Example`
			`---`

			```rust
Update README and version 2017-04-23 01:23:38 +00:00			`extern crate base64;`
Update README 2015-12-03 19:48:32 -05:00
Update README and version 2017-04-23 01:23:38 +00:00			`use base64::{encode, decode};`
Update README 2015-12-03 19:48:32 -05:00
Update README and version 2017-04-23 01:23:38 +00:00			`fn main() {`
			`let a = b"hello world";`
			`let b = "aGVsbG8gd29ybGQ=";`
Update README 2015-12-03 19:48:32 -05:00
Update README and version 2017-04-23 01:23:38 +00:00			`assert_eq!(encode(a), b);`
			`assert_eq!(a, &decode(b).unwrap()[..]);`
			`}`
Update README 2015-12-03 19:48:32 -05:00			```

Add `encode_config_slice` that writes to a `&[u8]`. Also, enable some compiler warnings and tighten up the docs. Hat tip to @quodlibetor in #49 for starting this work. 2017-10-31 16:04:44 -05:00			`See the [docs](https://docs.rs/base64) for all the details.`
Update README 2015-12-03 19:48:32 -05:00
Bump minimum rust version 2018-11-26 09:13:49 -07:00			`Rust version compatibility`
Update README and version 2017-04-23 01:23:38 +00:00			`---`
Update README 2015-12-03 19:48:32 -05:00
Bump Minimum Version to 1.34.0 2019-08-13 10:17:07 +02:00			`The minimum required Rust version is 1.34.0.`
Update README 2015-12-03 19:48:32 -05:00
Basic encode/decode benchmarks 2016-12-28 12:29:48 -06:00			`Developing`
			`---`

			Benchmarks are in `benches/`. Running them requires nightly rust, but `rustup` makes it easy:

Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```bash
Basic encode/decode benchmarks 2016-12-28 12:29:48 -06:00			`rustup run nightly cargo bench`
			```
Switch to lookup tables for decoding. This brings big speedups: over 2x at 100 byte inputs to 6x by 3KiB and larger. A consequence of this change in logic is that internal padding characters (a '=' in the middle of base64) is now rejected. This behavior is allowed per the RFC (4648 s. 3.3), but such characters were silently ignored before. On i7-6850K I'm seeing >3 IPC with 0.01% branch mispredict on the 10MiB test. Old code had 1.4 IPC with a pretty hefty 8.46% branch mispredict. 2016-12-30 14:04:22 -08:00
			`Decoding is aided by some pre-calculated tables, which are generated by:`

Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```bash
Move table script into examples; rename decode_tables to tables 2017-01-17 10:30:53 -08:00			`cargo run --example make_tables > src/tables.rs.tmp && mv src/tables.rs.tmp src/tables.rs`
Switch to lookup tables for decoding. This brings big speedups: over 2x at 100 byte inputs to 6x by 3KiB and larger. A consequence of this change in logic is that internal padding characters (a '=' in the middle of base64) is now rejected. This behavior is allowed per the RFC (4648 s. 3.3), but such characters were silently ignored before. On i7-6850K I'm seeing >3 IPC with 0.01% branch mispredict on the 10MiB test. Old code had 1.4 IPC with a pretty hefty 8.46% branch mispredict. 2016-12-30 14:04:22 -08:00			```
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00
Write Documentation and add CI for no_std 2019-09-03 15:51:20 +02:00			`no_std`
			`---`

			This crate supports no_std. By default the crate targets std via the `std` feature. You can deactivate the `default-features` to target core instead. In that case you lose out on all the functionality revolving around `std::io`, `std::error::Error` and heap allocations. There is an additional `alloc` feature that you can activate to bring back the support for heap allocations.

Add notes on profiling with perf 2017-01-11 15:43:13 -08:00			`Profiling`
			`---`

Improve benchmarking instructions 2017-09-21 20:56:56 -05:00			On Linux, you can use [perf](https://perf.wiki.kernel.org/index.php/Main_Page) for profiling. Then compile the benchmarks with `rustup nightly run cargo bench --no-run`.
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00
Speed up encoding by reading a u64 at a time 2017-01-17 20:12:36 -08:00			Run the benchmark binary with `perf` (shown here filtering to one particular benchmark, which will make the results easier to read). `perf` is only available to the root user on most systems as it fiddles with event counters in your CPU, so use `sudo`. We need to run the actual benchmark binary, hence the path into `target`. You can see the actual full path with `rustup run nightly cargo bench -v`; it will print out the commands it runs. If you use the exact path that `bench` outputs, make sure you get the one that's for the benchmarks, not the tests. You may also want to `cargo clean` so you have only one `benchmarks-` binary (they tend to accumulate).
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00
Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```bash
Speed up encoding by reading a u64 at a time 2017-01-17 20:12:36 -08:00			`sudo perf record target/release/deps/benchmarks-* --bench decode_10mib_reuse`
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00			```

			`Then analyze the results, again with perf:`

Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```bash
Update README and version 2017-04-23 01:23:38 +00:00			`sudo perf annotate -l`
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00			```

			You'll see a bunch of interleaved rust source and assembly like this. The section with `lib.rs:327` is telling us that 4.02% of samples saw the `movzbl` aka bit shift as the active instruction. However, this percentage is not as exact as it seems due to a phenomenon called skid. Basically, a consequence of how fancy modern CPUs are is that this sort of instruction profiling is inherently inaccurate, especially in branch-heavy code.

Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```text
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00			`lib.rs:322 0.70 : 10698: mov %rdi,%rax`
			`2.82 : 1069b: shr $0x38,%rax`
			`: if morsel == decode_tables::INVALID_VALUE {`
			`: bad_byte_index = input_index;`
			`: break;`
			`: };`
			`: accum = (morsel as u64) << 58;`
			`lib.rs:327 4.02 : 1069f: movzbl (%r9,%rax,1),%r15d`
			`: // fast loop of 8 bytes at a time`
			`: while input_index < length_of_full_chunks {`
			`: let mut accum: u64;`
			`:`
			`: let input_chunk = BigEndian::read_u64(&input_bytes[input_index..(input_index + 8)]);`
			`: morsel = decode_table[(input_chunk >> 56) as usize];`
			`lib.rs:322 3.68 : 106a4: cmp $0xff,%r15`
			`: if morsel == decode_tables::INVALID_VALUE {`
Update README, version 2017-02-12 15:00:44 -08:00			`0.00 : 106ab: je 1090e <base64::decode_config_buf::hbf68a45fefa299c1+0x46e>`
Add notes on profiling with perf 2017-01-11 15:43:13 -08:00			```
Add apache license closes #24 2017-04-27 17:42:59 +00:00
Add a few fuzzers. Fixes #21 2017-04-30 11:26:10 -07:00
			`Fuzzing`
			`---`

			This uses [cargo-fuzz](https://github.com/rust-fuzz/cargo-fuzz). See `fuzz/fuzzers` for the available fuzzing scripts. To run, use an invocation like these:

Add doc-comment to test README examples 2019-04-30 21:39:11 +02:00			```bash
v0.8.0 2017-11-17 08:41:39 -06:00			`cargo +nightly fuzz run roundtrip`
			`cargo +nightly fuzz run roundtrip_no_pad`
			`cargo +nightly fuzz run roundtrip_random_config -- -max_len=10240`
Get Travis to detect the broken fuzzer. 2019-01-19 06:47:38 -07:00			`cargo +nightly fuzz run decode_random`
Add a few fuzzers. Fixes #21 2017-04-30 11:26:10 -07:00			```


Add apache license closes #24 2017-04-27 17:42:59 +00:00			`License`
			`---`

			`This project is dual-licensed under MIT and Apache 2.0.`