rocksdb

A library that provides an embeddable, persistent key-value store for fast storage.

Go to file

Andrew Kryczka 62f70f6d14 Reduce scope of compression dictionary to single SST (#4952 ) Summary: Our previous approach was to train one compression dictionary per compaction, using the first output SST to train a dictionary, and then applying it on subsequent SSTs in the same compaction. While this was great for minimizing CPU/memory/I/O overhead, it did not achieve good compression ratios in practice. In our most promising potential use case, moderate reductions in a dictionary's scope make a major difference on compression ratio. So, this PR changes compression dictionary to be scoped per-SST. It accepts the tradeoff during table building to use more memory and CPU. Important changes include: - The `BlockBasedTableBuilder` has a new state when dictionary compression is in-use: `kBuffered`. In that state it accumulates uncompressed data in-memory whenever `Add` is called. - After accumulating target file size bytes or calling `BlockBasedTableBuilder::Finish`, a `BlockBasedTableBuilder` moves to the `kUnbuffered` state. The transition (`EnterUnbuffered()`) involves sampling the buffered data, training a dictionary, and compressing/writing out all buffered data. In the `kUnbuffered` state, a `BlockBasedTableBuilder` behaves the same as before -- blocks are compressed/written out as soon as they fill up. - Samples are now whole uncompressed data blocks, except the final sample may be a partial data block so we don't breach the user's configured `max_dict_bytes` or `zstd_max_train_bytes`. The dictionary trainer is supposed to work better when we pass it real units of compression. Previously we were passing 64-byte KV samples which was not realistic. Pull Request resolved: https://github.com/facebook/rocksdb/pull/4952 Differential Revision: D13967980 Pulled By: ajkr fbshipit-source-id: 82bea6f7537e1529c7a1a4cdee84585f5949300f		2019-02-11 19:47:32 -08:00
buckifier	Fix skylark incompatible build files in rocksdb	2019-01-07 13:37:40 -08:00
build_tools	Add latest toolchain (gcc-8, etc.) build support for fbcode users (#4923 )	2019-01-28 11:26:32 -08:00
cache	Revert "Move MemoryAllocator option from Cache to BlockBasedTableOpti… (#4697 )	2018-11-21 11:29:57 -08:00
cmake	Make FindZLIB consistent with official definitions (#4823 )	2019-01-02 12:49:57 -08:00
coverage	Remove unused imports, from python scripts. (#4057 )	2018-06-26 12:43:04 -07:00
db	Reduce scope of compression dictionary to single SST (#4952 )	2019-02-11 19:47:32 -08:00
docs	Insane line length detected (#4813 )	2018-12-21 14:54:34 -08:00
env	fix for nvme device path (#4866 )	2019-01-31 19:08:37 -08:00
examples	Pin top-level index on partitioned index/filter blocks (#4037 )	2018-06-22 15:27:46 -07:00
hdfs	Update all unique/shared_ptr instances to be qualified with namespace std (#4638 )	2018-11-09 11:19:58 -08:00
include/rocksdb	WritePrepared: add private options to TransactionDBOptions (#4966 )	2019-02-11 14:44:02 -08:00
java	Remove PlainTable's feature store_index_in_file (#4914 )	2019-01-28 12:50:22 -08:00
memtable	Remove cuckoo hash memtable (#4953 )	2019-02-07 16:15:27 -08:00
monitoring	Allow copy for PerfContext objects (#4919 )	2019-02-05 14:29:08 -08:00
options	Remove cuckoo hash memtable (#4953 )	2019-02-07 16:15:27 -08:00
port	Detect if Jemalloc is linked with the binary (#4844 )	2019-01-03 16:30:12 -08:00
table	Reduce scope of compression dictionary to single SST (#4952 )	2019-02-11 19:47:32 -08:00
third-party	Support pragma once in all header files and cleanup some warnings (#4339 )	2018-09-05 18:13:31 -07:00
tools	Reduce scope of compression dictionary to single SST (#4952 )	2019-02-11 19:47:32 -08:00
util	Reduce scope of compression dictionary to single SST (#4952 )	2019-02-11 19:47:32 -08:00
utilities	Enhance transaction_test_util with delays (#4970 )	2019-02-11 16:02:37 -08:00
.clang-format	A script that automatically reformat affected lines	2014-01-14 12:21:24 -08:00
.gitignore	RocksDB Trace Analyzer (#4091 )	2018-08-13 11:44:02 -07:00
.lgtm.yml	Create lgtm.yml for LGTM.com C/C++ analysis (#4058 )	2018-06-26 12:43:04 -07:00
.travis.yml	Fix printf formatting on MacOS (#4533 )	2018-10-19 14:46:09 -07:00
appveyor.yml	Add RocksJava build to AppVeyor	2019-01-03 10:44:44 -08:00
AUTHORS	Update RocksDB Authors File	2017-10-18 14:42:10 -07:00
CMakeLists.txt	Remove cuckoo hash memtable (#4953 )	2019-02-07 16:15:27 -08:00
CODE_OF_CONDUCT.md	Add Code of Conduct	2017-12-05 18:42:35 -08:00
CONTRIBUTING.md	Add Code of Conduct	2017-12-05 18:42:35 -08:00
COPYING	Add GPLv2 as an alternative license.	2017-04-27 18:06:12 -07:00
DEFAULT_OPTIONS_HISTORY.md	options.delayed_write_rate use the rate of rate_limiter by default.	2017-05-24 09:58:24 -07:00
DUMP_FORMAT.md	First version of rocksdb_dump and rocksdb_undump.	2015-06-19 16:24:36 -07:00
HISTORY.md	Reduce scope of compression dictionary to single SST (#4952 )	2019-02-11 19:47:32 -08:00
INSTALL.md	Update the version of the dependencies used by the RocksJava static build (#4761 )	2018-12-18 20:25:43 -08:00
issue_template.md	Add a template for issues	2017-09-29 11:41:28 -07:00
LANGUAGE-BINDINGS.md	Added PingCaps Rust RocksDB and ObjectiveRocks (#4065 )	2018-06-27 15:43:21 -07:00
LICENSE.Apache	Change RocksDB License	2017-07-15 16:11:23 -07:00
LICENSE.leveldb	Add back the LevelDB license file	2017-07-16 18:42:18 -07:00
Makefile	Change the command to invoke parallel tests (#4922 )	2019-01-28 15:02:26 -08:00
README.md	Create lgtm.yml for LGTM.com C/C++ analysis (#4058 )	2018-06-26 12:43:04 -07:00
ROCKSDB_LITE.md	Fix some typos in comments and docs.	2018-03-08 10:27:25 -08:00
src.mk	Remove cuckoo hash memtable (#4953 )	2019-02-07 16:15:27 -08:00
TARGETS	Remove cuckoo hash memtable (#4953 )	2019-02-07 16:15:27 -08:00
thirdparty.inc	Provide a way to override windows memory allocator with jemalloc for ZSTD	2018-06-04 12:12:48 -07:00
USERS.md	Adding IOTA Foundation to USERS.MD (#4436 )	2018-10-02 10:03:46 -07:00
Vagrantfile	Adding CentOS 7 Vagrantfile & build script	2018-02-26 15:27:17 -08:00
WINDOWS_PORT.md	Add GCC 8 to Travis (#3433 )	2018-07-13 10:58:06 -07:00

README.md

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it specially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/

License

RocksDB is dual-licensed under both the GPLv2 (found in the COPYING file in the root directory) and Apache 2.0 License (found in the LICENSE.Apache file in the root directory). You may select, at your option, one of the above-listed licenses.