A library that provides an embeddable, persistent key-value store for fast storage.
Go to file
Maysam Yabandeh 58410aee44 Fix the overflow bug in AwaitState
Summary:
https://github.com/facebook/rocksdb/issues/2559 reports an overflow in AwaitState. nbronson has debugged the issue and presented the fix, which is applied to this patch. Moreover this patch adds more comments to clarify the logic in AwaitState.

I tried with both 16 and 64 threads on update benchmark. The fix lowers cpu usage by 1.6 but also lowers the throughput by 1.6 and 2% respectively. Apparently the bug had favored using the spinning more often.

Benchmarks:
TEST_TMPDIR=/dev/shm/tmpdb time ./db_bench --benchmarks="fillrandom" --threads=16 --num=2000000
TEST_TMPDIR=/dev/shm/tmpdb time ./db_bench --use_existing_db=1 --benchmarks="updaterandom[X3]" --threads=16 --num=2000000
TEST_TMPDIR=/dev/shm/tmpdb time ./db_bench --use_existing_db=1 --benchmarks="updaterandom[X3]" --threads=64 --num=200000

Results
$ cat update-16t-bug.txt | tail -4
updaterandom [AVG    3 runs] : 234117 ops/sec;   51.8 MB/sec
updaterandom [MEDIAN 3 runs] : 233581 ops/sec;   51.7 MB/sec
3896.42user 1539.12system 6:50.61elapsed 1323%CPU (0avgtext+0avgdata 331308maxresident)k
0inputs+0outputs (0major+1281001minor)pagefaults 0swaps
$ cat update-16t-fixed.txt | tail -4
updaterandom [AVG    3 runs] : 230364 ops/sec;   51.0 MB/sec
updaterandom [MEDIAN 3 runs] : 226169 ops/sec;   50.0 MB/sec
3865.46user 1568.32system 6:57.63elapsed 1301%CPU (0avgtext+0avgdata 315012maxresident)k
0inputs+0outputs (0major+1342568minor)pagefaults 0swaps

$ cat update-64t-bug.txt | tail -4
updaterandom [AVG    3 runs] : 261878 ops/sec;   57.9 MB/sec
updaterandom [MEDIAN 3 runs] : 262859 ops/sec;   58.2 MB/sec
926.27user 578.06system 2:27.46elapsed 1020%CPU (0avgtext+0avgdata 475480maxresident)k
0inputs+0outputs (0major+1058728minor)pagefaults 0swaps
$ cat update-64t-fixed.txt | tail -4
updaterandom [AVG    3 runs] : 256699 ops/sec;   56.8 MB/sec
updaterandom [MEDIAN 3 runs] : 256380 ops/sec;   56.7 MB/sec
933.47user 575.37system 2:30.41elapsed 1003%CPU (0avgtext+0avgdata 482340maxresident)k
0inputs+0outputs (0major+1078557minor)pagefaults 0swaps
Closes https://github.com/facebook/rocksdb/pull/2679

Differential Revision: D5553732

Pulled By: maysamyabandeh

fbshipit-source-id: 98b72dc3a8e0f22ea29d4f7c7790af10c369c5bb
2017-08-03 10:43:28 -07:00
arcanist_util Remove arcanist_util directory 2017-07-19 16:49:55 -07:00
buckifier TARGETS file not setting sse explicitly 2017-07-27 17:41:36 -07:00
build_tools Fix use of RocksDBCommonHelper in cont_integration.sh 2017-07-26 19:31:36 -07:00
cache Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
cmake/modules CMake: more MinGW fixes 2017-04-06 14:09:13 -07:00
coverage Fix coverage script 2014-11-03 14:53:00 -08:00
db Fix the overflow bug in AwaitState 2017-08-03 10:43:28 -07:00
docs 5.6.1 release blog post 2017-07-25 12:27:22 -07:00
env Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
examples Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
hdfs Revert "comment out unused parameters" 2017-07-21 18:26:26 -07:00
include/rocksdb Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
java Fix statistics in RocksJava sample 2017-08-01 16:58:26 -07:00
memtable Revert "comment out unused parameters" 2017-07-21 18:26:26 -07:00
monitoring Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
options Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
port LRUCacheShard cache line size alignment 2017-07-24 10:54:37 -07:00
table Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
third-party Revert "comment out unused parameters" 2017-07-21 18:26:26 -07:00
tools support multiple CFs with OPTIONS file 2017-08-02 16:27:01 -07:00
util Add a missing "once" in .h 2017-07-31 12:12:03 -07:00
utilities Refactor TransactionImpl 2017-08-03 08:57:22 -07:00
.clang-format A script that automatically reformat affected lines 2014-01-14 12:21:24 -08:00
.gitignore Simple blob file dumper 2017-05-23 10:42:59 -07:00
.travis.yml Build fewer tests in Travis platform_dependent tests 2017-07-27 17:29:01 -07:00
appveyor.yml Rework test running script. 2017-04-05 11:39:20 -07:00
AUTHORS Add AUTHORS file. Fix #203 2014-09-29 10:52:18 -07:00
CMakeLists.txt Refactor TransactionImpl 2017-08-03 08:57:22 -07:00
CONTRIBUTING.md Remove the licensing description in CONTRIBUTING.md 2017-07-16 15:57:18 -07:00
COPYING Add GPLv2 as an alternative license. 2017-04-27 18:06:12 -07:00
DEFAULT_OPTIONS_HISTORY.md options.delayed_write_rate use the rate of rate_limiter by default. 2017-05-24 09:58:24 -07:00
DUMP_FORMAT.md First version of rocksdb_dump and rocksdb_undump. 2015-06-19 16:24:36 -07:00
HISTORY.md fix db get/write stats 2017-07-31 12:12:03 -07:00
INSTALL.md add vcpkg as an windows option 2017-07-24 15:12:45 -07:00
LANGUAGE-BINDINGS.md Adding Dlang to the list 2017-02-16 17:24:10 -08:00
LICENSE.Apache Change RocksDB License 2017-07-15 16:11:23 -07:00
LICENSE.leveldb Add back the LevelDB license file 2017-07-16 18:42:18 -07:00
Makefile Replace dynamic_cast<> 2017-07-28 16:27:16 -07:00
README.md Appveyor badge to show master branch 2016-07-26 13:54:08 -07:00
ROCKSDB_LITE.md Optimistic Transactions 2015-05-29 14:36:35 -07:00
src.mk Refactor TransactionImpl 2017-08-03 08:57:22 -07:00
TARGETS Dump Blob DB options to info log 2017-08-01 13:01:47 -07:00
thirdparty.inc Introduce XPRESS compresssion on Windows. (#1081) 2016-04-19 22:54:24 -07:00
USERS.md fixed typo 2017-06-13 16:58:01 -07:00
Vagrantfile Update Vagrant file (test internal phabricator workflow) 2016-10-28 15:39:19 -07:00
WINDOWS_PORT.md Commit both PR and internal code review changes 2015-07-07 16:58:20 -07:00

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

Build Status Build status

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it specially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/