A library that provides an embeddable, persistent key-value store for fast storage.
Go to file
Alexey Maykov ee95cae9a4 Modifed the LRU cache eviction code so that it doesn't evict blocks which have exteranl references
Summary:
Currently, blocks which have more than one reference (ie referenced by something other than cache itself) are evicted from cache. This doesn't make much sense:
- blocks are still in RAM, so the RAM usage reported by the cache is incorrect
- if the same block is needed by another iterator, it will be loaded and decompressed again

This diff changes the reference counting scheme a bit. Previously, if the cache contained the block, this was accounted for in its refcount. After this change, the refcount is only used to track external references. There is a boolean flag which indicates whether or not the block is contained in the cache.
This diff also changes how LRU list is used. Previously, both hashtable and the LRU list contained all blocks. After this change, the LRU list contains blocks with the refcount==0, ie those which can be evicted from the cache.

Note that this change still allows for cache to grow beyond its capacity. This happens when all blocks are pinned (ie refcount>0). This is consistent with the current behavior. The cache's insert function never fails. I spent lots of time trying to make table_reader and other places work with the insert which might failed. It turned out to be pretty hard. It might really destabilize some customers, so finally, I decided against doing this.

table_cache_remove_scan_count_limit option will be unneeded after this change, but I will remove it in the following diff, if this one gets approved

Test Plan: Ran tests, made sure they pass

Reviewers: sdong, ljin

Differential Revision: https://reviews.facebook.net/D25503
2014-12-10 22:28:53 -08:00
build_tools Turn on -Wshorten-64-to-32 and fix all the errors 2014-11-11 16:47:22 -05:00
coverage Fix coverage script 2014-11-03 14:53:00 -08:00
db VersionBuilder to use unordered set and map to store added and deleted files 2014-12-10 18:53:30 -08:00
doc Remove seek compaction 2014-06-20 10:23:02 +02:00
examples Add rocksdb::ToString() to address cases where std::to_string is not available. 2014-11-24 20:44:49 -08:00
hdfs Replace exception by abort() in dummy HdfsEnv implementation. 2014-12-05 13:30:57 -08:00
helpers/memenv Turn -Wshadow back on 2014-11-06 11:14:28 -08:00
include Fix calculation of max_total_wal_size in db_options_.max_total_wal_size == 0 case 2014-12-08 15:26:35 -08:00
java Merge pull request #422 from fyrz/RocksJava-Quality-Improvements 2014-12-05 21:38:05 -08:00
linters Fix linters 2014-12-02 13:53:39 -05:00
port Add rocksdb::ToString() to address cases where std::to_string is not available. 2014-11-24 20:44:49 -08:00
table Enforce write buffer memory limit across column families 2014-12-02 12:09:20 -08:00
third-party/rapidjson Fix a rapidjson compile error in mac. 2014-06-23 17:09:24 -06:00
tools add range scan test to benchmark script 2014-12-10 13:04:58 -08:00
util Modifed the LRU cache eviction code so that it doesn't evict blocks which have exteranl references 2014-12-10 22:28:53 -08:00
utilities Clean up StringSplit 2014-11-21 11:05:28 -05:00
.arcconfig Improve/fix bugs for the cpp linter 2014-02-13 17:48:11 -08:00
.clang-format A script that automatically reformat affected lines 2014-01-14 12:21:24 -08:00
.gitignore Ignore IntelliJ idea project files and ignore java/out folder 2014-10-21 15:52:27 +01:00
.travis.yml Don't parallelize the build in travis 2014-11-14 16:23:56 -08:00
AUTHORS Add AUTHORS file. Fix #203 2014-09-29 10:52:18 -07:00
CONTRIBUTING.md facebook accounts are not required for CLA signers 2014-07-08 05:57:54 -04:00
HISTORY.md Modifed the LRU cache eviction code so that it doesn't evict blocks which have exteranl references 2014-12-10 22:28:53 -08:00
INSTALL.md Update INSTALL.md 2014-10-01 11:15:42 -07:00
LICENSE Fix copyright year 2014-03-12 12:06:58 -07:00
Makefile Add -fno-exceptions flag to ROCKSDB_LITE. 2014-12-05 21:34:20 -08:00
PATENTS Fix the patent format 2013-10-16 15:37:32 -07:00
README.md Replaced "built on on earlier work" by "built on earlier work" in README.md 2014-09-17 01:16:17 -07:00
ROCKSDB_LITE.md RocksDBLite 2014-04-15 13:39:26 -07:00
Vagrantfile Package generation for Ubuntu and CentOS 2014-09-29 16:09:46 -07:00

RocksDB: A Persistent Key-Value Store for Flash and RAM Storage

Build Status

RocksDB is developed and maintained by Facebook Database Engineering Team. It is built on earlier work on LevelDB by Sanjay Ghemawat (sanjay@google.com) and Jeff Dean (jeff@google.com)

This code is a library that forms the core building block for a fast key value server, especially suited for storing data on flash drives. It has a Log-Structured-Merge-Database (LSM) design with flexible tradeoffs between Write-Amplification-Factor (WAF), Read-Amplification-Factor (RAF) and Space-Amplification-Factor (SAF). It has multi-threaded compactions, making it specially suitable for storing multiple terabytes of data in a single database.

Start with example usage here: https://github.com/facebook/rocksdb/tree/master/examples

See the github wiki for more explanation.

The public interface is in include/. Callers should not include or rely on the details of any other header files in this package. Those internal APIs may be changed without warning.

Design discussions are conducted in https://www.facebook.com/groups/rocksdb.dev/