A library that provides an embeddable, persistent key-value store for fast storage.
Go to file
Nathan Bronson 1ae27113c7 reduce comparisons by skiplist
Summary:
Key comparison is the single largest CPU user for CPU-bound
workloads. This diff reduces the number of comparisons in two ways.

The first is that it moves predecessor array gathering from
FindGreaterOrEqual to FindLessThan, so that FindGreaterOrEqual can
return immediately if compare_ returns 0.  As part of this change I
moved the sequential insertion optimization into Insert, to remove the
undocumented (and smelly) requirement that prev must be equal to prev_
if it is non-null.

The second optimization is that all of the search functions skip calling
compare_ when moving to a lower level that has the same Next pointer.
With a branching factor of 4 we would expect this to happen 1/4 of
the time.

On a single-threaded CPU-bound workload (-benchmarks=fillrandom -threads=1
-batch_size=1 -memtablerep=skip_list -value_size=0 --num=1600000
-level0_slowdown_writes_trigger=9999 -level0_stop_writes_trigger=9999
-disable_auto_compactions --max_write_buffer_number=8
-max_background_flushes=8 --disable_wal --write_buffer_size=160000000)
on my dev server this is good for a 7% perf win.

Test Plan: unit tests

Reviewers: rven, ljin, yhchiang, sdong, igor

Reviewed By: igor

Subscribers: dhruba

Differential Revision: https://reviews.facebook.net/D43233
2015-08-11 11:25:22 -07:00
arcanist_util Modernize RocksDB linters 2015-08-10 13:58:55 -07:00
build_tools Upgrading jemalloc from 3.6.0 to the latest for fbcode+gcc 4.8.1 2015-08-04 16:35:26 -07:00
coverage Fix coverage script 2014-11-03 14:53:00 -08:00
db reduce comparisons by skiplist 2015-08-11 11:25:22 -07:00
doc Remove seek compaction 2014-06-20 10:23:02 +02:00
examples [API Change] Improve EventListener::OnFlushCompleted interface 2015-06-05 12:28:51 -07:00
hdfs Improved FileExists API 2015-07-20 17:20:40 -07:00
include/rocksdb Better CompactionJob testing 2015-08-07 21:59:51 -07:00
java Update JAVA-HISTORY.md for v3.13 2015-08-04 18:12:58 -07:00
port Fix WinEnv::NowMicrosec 2015-07-22 14:36:43 -07:00
table Better CompactionJob testing 2015-08-07 21:59:51 -07:00
third-party "make format" against last 10 commits 2015-07-13 13:50:18 -07:00
tools crash_test cleans up directory before testing if TEST_TMPDIR is set 2015-08-04 14:59:28 -07:00
util Fixed Segmentation Fault in db_stress on OSX. 2015-08-11 10:55:27 -07:00
utilities Add function 'GetInfoLogList()' 2015-08-05 16:16:46 -07:00
.arcconfig Integrate Jenkins with Phabricator 2015-04-07 11:56:29 -07:00
.clang-format A script that automatically reformat affected lines 2014-01-14 12:21:24 -08:00
.gitignore Windows Port from Microsoft 2015-07-01 16:13:56 -07:00
.travis.yml Another attempt at adding the Java API and tests to the travis build 2015-07-28 11:52:11 +01:00
appveyor.yml Add auto-build manifest for appveyor 2015-08-07 15:37:46 -07:00
AUTHORS Add AUTHORS file. Fix #203 2014-09-29 10:52:18 -07:00
CMakeLists.txt Add util/delete_scheduler_impl.cc to CMakeLists.txt 2015-08-05 20:56:04 -07:00
CONTRIBUTING.md facebook accounts are not required for CLA signers 2014-07-08 05:57:54 -04:00
DUMP_FORMAT.md First version of rocksdb_dump and rocksdb_undump. 2015-06-19 16:24:36 -07:00
HISTORY.md In HISTORY.md Switch unreleased notes to 3.13 2015-08-06 14:02:44 -07:00
INSTALL.md Commit both PR and internal code review changes 2015-07-07 16:58:20 -07:00
LICENSE Fix copyright year 2014-03-12 12:06:58 -07:00
Makefile valgrind_check to exit on test failures 2015-08-05 17:46:09 -07:00
PATENTS Update Patent Grant. 2015-04-13 10:33:43 +01:00
README.md Replaced "built on on earlier work" by "built on earlier work" in README.md 2014-09-17 01:16:17 -07:00
ROCKSDB_LITE.md Optimistic Transactions 2015-05-29 14:36:35 -07:00
src.mk simple ManagedSnapshot wrapper 2015-08-06 17:59:05 -07:00
thirdparty.inc Conditional use of third-party libraries 2015-07-09 14:42:41 -07:00
USERS.md Add Yahoo's blog post about Sherpa to USERS.md 2015-06-09 12:55:58 -07:00
Vagrantfile RocksDB on FreeBSD support 2015-02-26 15:19:17 -08:00
WINDOWS_PORT.md Commit both PR and internal code review changes 2015-07-07 16:58:20 -07:00

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

Build Status

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it specially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/