rocksdb

Go to file

Peter Dillinger 5f8f2fda0e Refactor / clean up / optimize FullFilterBitsReader (#5941 )

Summary:
FullFilterBitsReader, after creating in BloomFilterPolicy, was
responsible for decoding metadata bits. This meant that
FullFilterBitsReader::MayMatch had some metadata checks in order to
implement "always true" or "always false" functionality in the case
of inconsistent or trivial metadata. This made for ugly
mixing-of-concerns code and probably had some runtime cost. It also
didn't really support plugging in alternative filter implementations
with extensions to the existing metadata schema.

BloomFilterPolicy::GetFilterBitsReader is now (exclusively) responsible
for decoding filter metadata bits and constructing appropriate instances
deriving from FilterBitsReader. "Always false" and "always true" derived
classes allow FullFilterBitsReader not to be concerned with handling of
trivial or inconsistent metadata. This also makes for easy expansion
to alternative filter implementations in new, alternative derived
classes. This change makes calls to FilterBitsReader::MayMatch
*necessarily* virtual because there's now more than one built-in
implementation. Compared with the previous implementation's extra
'if' checks in MayMatch, there's no consistent performance difference,
measured by (an older revision of) filter_bench (differences here seem
to be within noise):

Inside queries...
- Dry run (407) ns/op: 35.9996
+ Dry run (407) ns/op: 35.2034
- Single filter ns/op: 47.5483
+ Single filter ns/op: 47.4034
- Batched, prepared ns/op: 43.1559
+ Batched, prepared ns/op: 42.2923
...
- Random filter ns/op: 150.697
+ Random filter ns/op: 149.403
----------------------------
Outside queries...
- Dry run (980) ns/op: 34.6114
+ Dry run (980) ns/op: 34.0405
- Single filter ns/op: 56.8326
+ Single filter ns/op: 55.8414
- Batched, prepared ns/op: 48.2346
+ Batched, prepared ns/op: 47.5667
- Random filter ns/op: 155.377
+ Random filter ns/op: 153.942
Average FP rate %: 1.1386

Also, the FullFilterBitsReader ctor was responsible for a surprising
amount of CPU in production, due in part to inefficient determination of
the CACHE_LINE_SIZE used to construct the filter being read. The
overwhelming common case (same as my CACHE_LINE_SIZE) is now
substantially optimized, as shown with filter_bench with
-new_reader_every=1 (old option - see below) (repeatable result):

Inside queries...
- Dry run (453) ns/op: 118.799
+ Dry run (453) ns/op: 105.869
- Single filter ns/op: 82.5831
+ Single filter ns/op: 74.2509
...
- Random filter ns/op: 224.936
+ Random filter ns/op: 194.833
----------------------------
Outside queries...
- Dry run (aa1) ns/op: 118.503
+ Dry run (aa1) ns/op: 104.925
- Single filter ns/op: 90.3023
+ Single filter ns/op: 83.425
...
- Random filter ns/op: 220.455
+ Random filter ns/op: 175.7
Average FP rate %: 1.13886

However PR#5936 has/will reclaim most of this cost. After that PR, the optimization of this code path is likely negligible, but nonetheless it's clear we aren't making performance any worse.

Also fixed inadequate check of consistency between filter data size and
num_lines. (Unit test updated.)
Pull Request resolved: https://github.com/facebook/rocksdb/pull/5941

Test Plan:
previously added unit tests FullBloomTest.CorruptFilters and
FullBloomTest.RawSchema

Differential Revision: D18018353

Pulled By: pdillinger

fbshipit-source-id: 8e04c2b4a7d93223f49a237fd52ef2483929ed9c

2019-10-18 14:50:52 -07:00

buckifier

Change buckifier to support parameterized dependencies (#5648 )

2019-08-02 10:55:17 -07:00

build_tools

Remove deprecated RocksDBCommonHelper and cont_integration.sh (#5889 )

2019-10-09 07:40:35 -07:00

cache

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

cmake

cmake: s/SNAPPY_LIBRARIES/snappy_LIBRARIES/ (#5687 )

2019-08-16 15:49:23 -07:00

coverage

Fix interpreter lines for files with python2-only syntax.

2019-07-09 10:51:37 -07:00

Fix PlainTableReader not to crash sst_dump (#5940 )

2019-10-18 14:44:42 -07:00

docs

Blog post for write_unprepared (#5711 )

2019-08-15 14:41:13 -07:00

env

Add Env::SanitizeEnvOptions (#5885 )

2019-10-14 12:25:00 -07:00

examples

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

file

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

hdfs

Add copyright headers per FB open-source checkup tool. (#5199 )

2019-04-18 10:55:01 -07:00

include/rocksdb

Expose db stress tests (#5937 )

2019-10-18 09:46:44 -07:00

java

Fix the rocksjava release Vagrant build on CentOS (#5901 )

2019-10-10 17:21:18 -07:00

logging

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

memory

Charge block cache for cache internal usage (#5797 )

2019-09-16 15:26:21 -07:00

memtable

Charge block cache for cache internal usage (#5797 )

2019-09-16 15:26:21 -07:00

monitoring

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

options

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

port

Fix block cache ID uniqueness for Windows builds (#5844 )

2019-10-11 18:19:31 -07:00

table

Fix PlainTableReader not to crash sst_dump (#5940 )

2019-10-18 14:44:42 -07:00

test_util

Apply formatter to recent 200+ commits. (#5830 )

2019-09-20 12:04:26 -07:00

third-party

Refactor/consolidate legacy Bloom implementation details (#5784 )

2019-09-16 16:17:09 -07:00

tools

Fix PlainTableReader not to crash sst_dump (#5940 )

2019-10-18 14:44:42 -07:00

trace_replay

Enable trace_replay with multi-threads (#5934 )

2019-10-18 14:13:50 -07:00

util

Refactor / clean up / optimize FullFilterBitsReader (#5941 )

2019-10-18 14:50:52 -07:00

utilities

Move blob_index.h to db/ (#5919 )

2019-10-14 12:54:05 -07:00

.clang-format

A script that automatically reformat affected lines

2014-01-14 12:21:24 -08:00

.gitignore

Block cache simulator: Add pysim to simulate caches using reinforcement learning. (#5610 )

2019-07-26 14:41:13 -07:00

.lgtm.yml

Create lgtm.yml for LGTM.com C/C++ analysis (#4058 )

2018-06-26 12:43:04 -07:00

.travis.yml

Remove a webhook due to potential security concern (#5902 )

2019-10-10 18:05:16 -07:00

.watchmanconfig

Added .watchmanconfig file to rocksdb repo (#5593 )

2019-07-19 15:00:33 -07:00

appveyor.yml

New API to get all merge operands for a Key (#5604 )

2019-08-06 14:26:44 -07:00

AUTHORS

Update RocksDB Authors File

2017-10-18 14:42:10 -07:00

CMakeLists.txt

filter_bench - a prelim tool for SST filter benchmarking (#5825 )

2019-10-07 20:10:53 -07:00

CODE_OF_CONDUCT.md

Adopt Contributor Covenant

2019-08-29 23:21:01 -07:00

CONTRIBUTING.md

Add Code of Conduct

2017-12-05 18:42:35 -08:00

COPYING

Add GPLv2 as an alternative license.

2017-04-27 18:06:12 -07:00

DEFAULT_OPTIONS_HISTORY.md

options.delayed_write_rate use the rate of rate_limiter by default.

2017-05-24 09:58:24 -07:00

defs.bzl

Change buckifier to support parameterized dependencies (#5648 )

2019-08-02 10:55:17 -07:00

DUMP_FORMAT.md

First version of rocksdb_dump and rocksdb_undump.

2015-06-19 16:24:36 -07:00

HISTORY.md

Update HISTORY.md with recent BlobDB adjacent changes

2019-10-18 10:24:23 -07:00

INSTALL.md

Update the version of the dependencies used by the RocksJava static build (#4761 )

2018-12-18 20:25:43 -08:00

issue_template.md

Add a template for issues

2017-09-29 11:41:28 -07:00

LANGUAGE-BINDINGS.md

LANGUAGE-BINDINGS.md: mention python-rocksdb

2019-03-20 11:10:48 -07:00

LICENSE.Apache

Change RocksDB License

2017-07-15 16:11:23 -07:00

LICENSE.leveldb

Add back the LevelDB license file

2017-07-16 18:42:18 -07:00

Makefile

Expose db stress tests (#5937 )

2019-10-18 09:46:44 -07:00

README.md

Replaced some words (#5877 )

2019-10-07 12:28:09 -07:00

ROCKSDB_LITE.md

Fix some typos in comments and docs.

2018-03-08 10:27:25 -08:00

src.mk

Divide file_reader_writer.h and .cc (#5803 )

2019-09-16 10:33:51 -07:00

TARGETS

Divide file_reader_writer.h and .cc (#5803 )

2019-09-16 10:33:51 -07:00

thirdparty.inc

Fix build jemalloc api (#5470 )

2019-06-24 17:40:32 -07:00

USERS.md

Add avrio to USERS.md (#5748 )

2019-09-15 21:29:09 -07:00

Vagrantfile

Adding CentOS 7 Vagrantfile & build script

2018-02-26 15:27:17 -08:00

WINDOWS_PORT.md

#5145 , rename port/dirent.h to port/port_dirent.h to avoid compile err when use port dir as header dir output (#5152 )

2019-04-04 11:38:19 -07:00

README.md

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key-value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it especially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/

License

RocksDB is dual-licensed under both the GPLv2 (found in the COPYING file in the root directory) and Apache 2.0 License (found in the LICENSE.Apache file in the root directory). You may select, at your option, one of the above-listed licenses.

Languages

C++ 82.1%

Java 10.3%

C 2.5%

Python 1.7%

Perl 1.1%

Other 2.1%