rocksdb

Go to file

Andrew Kryczka d904233d2f Limit buffering for collecting samples for compression dictionary (#7970 )

Summary:
For dictionary compression, we need to collect some representative samples of the data to be compressed, which we use to either generate or train (when `CompressionOptions::zstd_max_train_bytes > 0`) a dictionary. Previously, the strategy was to buffer all the data blocks during flush, and up to the target file size during compaction. That strategy allowed us to randomly pick samples from as wide a range as possible that'd be guaranteed to land in a single output file.

However, some users try to make huge files in memory-constrained environments, where this strategy can cause OOM. This PR introduces an option, `CompressionOptions::max_dict_buffer_bytes`, that limits how much data blocks are buffered before we switch to unbuffered mode (which means creating the per-SST dictionary, writing out the buffered data, and compressing/writing new blocks as soon as they are built). It is not strict as we currently buffer more than just data blocks -- also keys are buffered. But it does make a step towards giving users predictable memory usage.

Related changes include:

- Changed sampling for dictionary compression to select unique data blocks when there is limited availability of data blocks
- Made use of `BlockBuilder::SwapAndReset()` to save an allocation+memcpy when buffering data blocks for building a dictionary
- Changed `ParseBoolean()` to accept an input containing characters after the boolean. This is necessary since, with this PR, a value for `CompressionOptions::enabled` is no longer necessarily the final component in the `CompressionOptions` string.

Pull Request resolved: https://github.com/facebook/rocksdb/pull/7970

Test Plan:
- updated `CompressionOptions` unit tests to verify limit is respected (to the extent expected in the current implementation) in various scenarios of flush/compaction to bottommost/non-bottommost level
- looked at jemalloc heap profiles right before and after switching to unbuffered mode during flush/compaction. Verified memory usage in buffering is proportional to the limit set.

Reviewed By: pdillinger

Differential Revision: D26467994

Pulled By: ajkr

fbshipit-source-id: 3da4ef9fba59974e4ef40e40c01611002c861465

2021-02-19 14:09:54 -08:00

.circleci

Add circleci format_compatible nightly build (#7926 )

2021-02-09 20:48:53 -08:00

.github/workflows

Update clang-format-diff.py path (#7944 )

2021-02-09 12:49:38 -08:00

buckifier

Update zstd in buck build (#7923 )

2021-02-08 14:46:01 -08:00

build_tools

Use actual url instead of tinyurl.com (#7950 )

2021-02-10 10:08:09 -08:00

cache

Add a SystemClock class to capture the time functions of an Env (#7858 )

2021-01-25 22:09:11 -08:00

cmake

Add find_dependency() in cmake config file. (#6791 )

2020-05-12 21:18:29 -07:00

coverage

Find the correct gcov (#6904 )

2020-06-01 16:33:05 -07:00

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

db_stress_tool

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

docs

Update github-pages and dependencies (#7850 )

2021-01-11 12:48:01 -08:00

env

Wal recovery failure with encryption due to zero bytes WAL size. (#7924 )

2021-02-05 12:40:52 -08:00

examples

Bring the Configurable options together (#5753 )

2020-09-14 17:01:01 -07:00

file

Fix typo: replace readadhead with readahead (#7953 )

2021-02-18 14:31:20 -08:00

fuzz

Remove Legacy and Custom FileWrapper classes from header files (#7851 )

2021-01-28 22:10:32 -08:00

hdfs

fix build with 'USE_HDFS' on windows (#6950 )

2020-06-12 16:21:50 -07:00

include/rocksdb

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

java

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

logging

Add a SystemClock class to capture the time functions of an Env (#7858 )

2021-01-25 22:09:11 -08:00

memory

slightly improve jemalloc allocator API header (#7592 )

2020-10-28 13:47:12 -07:00

memtable

Add a SystemClock class to capture the time functions of an Env (#7858 )

2021-01-25 22:09:11 -08:00

monitoring

Add a SystemClock class to capture the time functions of an Env (#7858 )

2021-01-25 22:09:11 -08:00

options

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

plugin

Makefile support to statically link external plugin code (#7918 )

2021-02-10 08:35:34 -08:00

port

Update win_logger.cc : assert failed when return value not checked. (-DROCKSDB_ASSERT_STATUS_CHECKED) (#7955 )

2021-02-18 16:34:10 -08:00

table

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

test_util

Add a SystemClock class to capture the time functions of an Env (#7858 )

2021-01-25 22:09:11 -08:00

third-party

Fix Compilation on ppc64le using Clang 11 (#7713 )

2020-12-01 11:21:44 -08:00

tools

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

trace_replay

Introduce a new trace file format (v 0.2) for better extension (#7977 )

2021-02-18 23:05:35 -08:00

util

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

utilities

Fix an assertion failure in range locking, locktree code. (#7938 )

2021-02-18 18:15:19 -08:00

.clang-format

A script that automatically reformat affected lines

2014-01-14 12:21:24 -08:00

.gitignore

gitignore cmake-build-* for CLion integration (#7933 )

2021-02-19 13:43:15 -08:00

.lgtm.yml

Create lgtm.yml for LGTM.com C/C++ analysis (#4058 )

2018-06-26 12:43:04 -07:00

.travis.yml

Cleanup Travis CI config (#7848 )

2021-01-11 10:30:28 -08:00

.watchmanconfig

Added .watchmanconfig file to rocksdb repo (#5593 )

2019-07-19 15:00:33 -07:00

appveyor.yml

Remove 2019 from appveyor (#7038 )

2020-06-29 14:31:41 -07:00

AUTHORS

Update RocksDB Authors File

2017-10-18 14:42:10 -07:00

CMakeLists.txt

Build a full RocksDB on M1 macs (#7943 )

2021-02-10 10:13:59 -08:00

CODE_OF_CONDUCT.md

Adopt Contributor Covenant

2019-08-29 23:21:01 -07:00

CONTRIBUTING.md

Add Code of Conduct

2017-12-05 18:42:35 -08:00

COPYING

Add GPLv2 as an alternative license.

2017-04-27 18:06:12 -07:00

DEFAULT_OPTIONS_HISTORY.md

options.delayed_write_rate use the rate of rate_limiter by default.

2017-05-24 09:58:24 -07:00

defs.bzl

Make testpilot recognize that these tests have coverage instrumentation

2020-03-20 11:23:23 -07:00

DUMP_FORMAT.md

First version of rocksdb_dump and rocksdb_undump.

2015-06-19 16:24:36 -07:00

HISTORY.md

Limit buffering for collecting samples for compression dictionary (#7970 )

2021-02-19 14:09:54 -08:00

INSTALL.md

Update the version of the dependencies used by the RocksJava static build (#4761 )

2018-12-18 20:25:43 -08:00

issue_template.md

Add Google Group to Issue Template

2020-01-28 14:40:37 -08:00

LANGUAGE-BINDINGS.md

Add RestoreDBFromLatestBackup to C API, add new C# package (#7092 )

2020-07-08 11:56:41 -07:00

LICENSE.Apache

Change RocksDB License

2017-07-15 16:11:23 -07:00

LICENSE.leveldb

Add back the LevelDB license file

2017-07-16 18:42:18 -07:00

Makefile

Makefile support to statically link external plugin code (#7918 )

2021-02-10 08:35:34 -08:00

PLUGINS.md

Makefile support to statically link external plugin code (#7918 )

2021-02-10 08:35:34 -08:00

README.md

Fix the CI badge for ppc64le Jenkins (#7561 )

2020-10-16 09:00:56 -07:00

ROCKSDB_LITE.md

Fix some typos in comments and docs.

2018-03-08 10:27:25 -08:00

src.mk

Integrity protection for live updates to WriteBatch (#7748 )

2021-01-29 12:18:58 -08:00

TARGETS

Update zstd in buck build (#7923 )

2021-02-08 14:46:01 -08:00

thirdparty.inc

Fix build jemalloc api (#5470 )

2019-06-24 17:40:32 -07:00

USERS.md

Add Apache Doris to USERS (#7865 )

2021-01-19 15:31:56 -08:00

Vagrantfile

Adding CentOS 7 Vagrantfile & build script

2018-02-26 15:27:17 -08:00

WINDOWS_PORT.md

#5145 , rename port/dirent.h to port/port_dirent.h to avoid compile err when use port dir as header dir output (#5152 )

2019-04-04 11:38:19 -07:00

README.md

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key-value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it especially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/ and https://rocksdb.slack.com/

License

RocksDB is dual-licensed under both the GPLv2 (found in the COPYING file in the root directory) and Apache 2.0 License (found in the LICENSE.Apache file in the root directory). You may select, at your option, one of the above-listed licenses.

Languages

C++ 82.1%

Java 10.3%

C 2.5%

Python 1.7%

Perl 1.1%

Other 2.1%