rocksdb/java/understanding_options.md
Alan Paxton 8d615a2b1d New-style blob option bindings, Java option getter and improve/fix option parsing (#8999)
Summary:
Implementation of https://github.com/facebook/rocksdb/issues/8221, plus/including extension of Java options API to allow the get() of options from RocksDB. The extension allows more comprehensive testing of options at the Java side, by validating that the options are set at the C++ side.

Variations on methods:
MutableColumnFamilyOptions.MutableColumnFamilyOptionsBuilder getOptions()
MutableDBOptions.MutableDBOptionsBuilder getDBOptions()

retrieve the options via RocksDB C++ interfaces, and parse the resulting string into one of the Java-style option objects.

This necessitated generalising the parsing of option strings in Java, which now parses the full range of option strings returned by the C++ interface, rather than a useful subset. This necessitates the list-separator being changed to :(colon) from , (comma).

Pull Request resolved: https://github.com/facebook/rocksdb/pull/8999

Reviewed By: jay-zhuang

Differential Revision: D31655487

Pulled By: ltamasi

fbshipit-source-id: c38e98145c81c61dc38238b0df580db176ce4efd
2021-10-19 09:21:52 -07:00

5.2 KiB

How RocksDB Options and their Java Wrappers Work

Options in RocksDB come in many different flavours. This is an attempt at a taxonomy and explanation.

RocksDB Options

Initially, I believe, RocksDB had only database options. I don't know if any of these were mutable. Column families came later. Read on to understand the terminology.

So to begin, one sets up a collection of options and starts/creates a database with these options. That's a useful way to think about it, because from a Java point-of-view (and I didn't realise this initially and got very confused), despite making native calls to C++, the APIs are just manipulating a native C++ configuration object. This object is just a record of configuration, and it must later be passed to the database (at create or open time) in order to apply the options.

Database versus Column Family

The concept of the column family or CF is widespread within RocksDB. I think of it as a data namespace, but conveniently transactions can operate across these namespaces. The concept of a default column family exists, and when operations do not refer to a particular CF, it refers to the default.

We raise this w.r.t. options because many options, perhaps most that users encounter, are column family options. That is to say they apply individually to a particular column family, or to the default column family. Crucially also, many/most/all of these same options are exposed as database options and then apply as the default for column families which do not have the option set explicitly. Obviously some database options are naturally database-wide; they apply to the operation of the database and don't make any sense applied to a column family.

Mutability

There are 2 kinds of options

  • Mutable options
  • Immutable options. We name these in contrast to the mutable ones, but they are usually referred to unqualified.

Mutable options are those which can be changed on a running RocksDB instance. Immutable options can only be configured prior to the start of a database. Of course, we can configure the immutable options at this time too; The entirety of options is a strict superset of the mutable options.

Mutable options (whether column-family specific or database-wide) are manipulated at runtime with builders, so we have MutableDBOptions.MutableDBOptionsBuilder and MutableColumnFamilyOptions.MutableColumnFamilyOptionsBuilder which share tooling classes/hierarchy and maintain and manipulate the relevant options as a (key,value) map.

Mutable options are then passed using setOptions() and setDBOptions() methods on the live RocksDB, and then take effect immediately (depending on the semantics of the option) on the database.

Advanced

There are 2 classes of options

  • Advanced options
  • Non-advanced options

It's not clear to me what the conceptual distinction is between advanced and not. However, the Java code takes care to reflect it from the underlying C++.

This leads to 2 separate type hierarchies within column family options, one for each class of options. The kinds are represented by where the options appear in their hierarchy.

interface ColumnFamilyOptionsInterface<T extends ColumnFamilyOptionsInterface<T>>
    extends AdvancedColumnFamilyOptionsInterface<T>
interface MutableColumnFamilyOptionsInterface<T extends MutableColumnFamilyOptionsInterface<T>>
    extends AdvancedMutableColumnFamilyOptionsInterface<T>

And then there is ultimately a single concrete implementation class for CF options:

class ColumnFamilyOptions extends RocksObject
    implements ColumnFamilyOptionsInterface<ColumnFamilyOptions>,
    MutableColumnFamilyOptionsInterface<ColumnFamilyOptions>

as there is a single concrete implementation class for DB options:

class DBOptions extends RocksObject
    implements DBOptionsInterface<DBOptions>,
    MutableDBOptionsInterface<DBOptions>

Interestingly DBOptionsInterface doesn't extend MutableDBOptionsInterface, if only in order to disrupt our belief in consistent basic laws of the Universe.

Startup/Creation Options

class Options extends RocksObject
    implements DBOptionsInterface<Options>,
    MutableDBOptionsInterface<Options>,
    ColumnFamilyOptionsInterface<Options>,
    MutableColumnFamilyOptionsInterface<Options>

Example - Blob Options

The enable_blob_files and min_blob_size options are per-column-family, and are mutable. The options also appear in the unqualified database options. So by initial configuration, we can set up a RocksDB database where for every (key,value) with a value of size at least min_blob_size, the value is written (indirected) to a blob file. Blobs may share a blob file, subject to the configuration values set. Later, using the MutableColumnFamilyOptionsInterface of the ColumnFamilyOptions, we can choose to turn this off (enable_blob_files=false) , or alter the min_blob_size for the default column family, or any other column family. It seems to me that we cannot, though, mutate the column family options for all column families using the setOptions() mechanism, either for all existing column families or for all future column families; but maybe we can do the latter on a re-`open()/create()'