99ec2412e5
Summary: This makes run_flash_bench.sh configurable. Previously it was hardwired for 1B keys and tests ran for 12 hours each. That kept me from using it. This makes it configuable, adds more tests, makes the duration per-test configurable and refactors the test scripts. Adds the seekrandomwhilemerging test to db_bench which is the same as seekrandomwhilewriting except the writer thread does Merge rather than Put. Forces the stall-time column in compaction IO stats to use a fixed format (H:M:S) which makes it easier to scrape and parse. Also adds an option to AppendHumanMicros to force a fixed format. Sometimes automation and humans want different format. Calls thread->stats.AddBytes(bytes); in db_bench for more tests to get the MB/sec summary stats in the output at test end. Adds the average ingest rate to compaction IO stats. Output now looks like: https://gist.github.com/mdcallag/2bd64d18be1b93adc494 More information on the benchmark output is at https://gist.github.com/mdcallag/db43a58bd5ac624f01e1 For benchmark.sh changes default RocksDB configuration to reduce stalls: * min_level_to_compress from 2 to 3 * hard_rate_limit from 2 to 3 * max_grandparent_overlap_factor and max_bytes_for_level_multiplier from 10 to 8 * L0 file count triggers from 4,8,12 to 4,12,20 for (start,stall,stop) Task ID: #6596829 Blame Rev: Test Plan: run tools/run_flash_bench.sh Revert Plan: Database Impact: Memcache Impact: Other Notes: EImportant: - begin *PUBLIC* platform impact section - Bugzilla: # - end platform impact - Reviewers: igor Reviewed By: igor Subscribers: dhruba Differential Revision: https://reviews.facebook.net/D36075
223 lines
7.6 KiB
Bash
Executable File
223 lines
7.6 KiB
Bash
Executable File
#!/bin/bash
|
|
# REQUIRE: benchmark.sh exists in the current directory
|
|
# After execution of this script, log files are generated in $output_dir.
|
|
# report.txt provides a high level statistics
|
|
|
|
# This should be run from the parent of the tools directory. The command line is:
|
|
# [$env_vars] tools/run_flash_bench.sh [list-of-threads]
|
|
#
|
|
# This runs a sequence of tests in the following sequence:
|
|
# step 1) load - bulkload, compact, fillseq, overwrite
|
|
# step 2) read-only for each number of threads
|
|
# step 3) read-write for each number of threads
|
|
# step 4) merge for each number of threads
|
|
#
|
|
# The list of threads is optional and when not set is equivalent to "24".
|
|
# Were list-of-threads specified as "1 2 4" then the tests in steps 2, 3 and
|
|
# 4 above would be repeated for 1, 2 and 4 threads. The tests in step 1 are
|
|
# only run for 1 thread.
|
|
|
|
# Test output is written to $OUTPUT_DIR, currently /tmp/output. The performance
|
|
# summary is in $OUTPUT_DIR/report.txt. There is one file in $OUTPUT_DIR per
|
|
# test and the tests are listed below.
|
|
#
|
|
# The environment variables are also optional. The variables are:
|
|
# NKEYS - number of key/value pairs to load
|
|
# NWRITESPERSEC - the writes/second rate limit for the *whilewriting* tests.
|
|
# If this is too large then the non-writer threads can get
|
|
# starved.
|
|
# NSECONDS - number of seconds for which to run each test in steps 2,
|
|
# 3 and 4. There are currently 15 tests in those steps and
|
|
# they are repeated for each entry in list-of-threads so
|
|
# this variable lets you control the total duration to
|
|
# finish the benchmark.
|
|
# RANGE_LIMIT - the number of rows to read per range query for tests that
|
|
# do range queries.
|
|
# VAL_SIZE - the length of the value in the key/value pairs loaded.
|
|
# You can estimate the size of the test database from this,
|
|
# NKEYS and the compression rate (--compression_ratio) set
|
|
# in tools/benchmark.sh
|
|
# CACHE_BYTES - the size of the RocksDB block cache in bytes
|
|
# DATA_DIR - directory in which to create database files
|
|
# LOG_DIR - directory in which to create WAL files, may be the same
|
|
# as DATA_DIR
|
|
# DO_SETUP - when set to 0 then a backup of the database is copied from
|
|
# $DATA_DIR.bak to $DATA_DIR and the load tests from step 1
|
|
# The WAL directory is also copied from a backup if
|
|
# DATA_DIR != LOG_DIR. This allows tests from steps 2, 3, 4
|
|
# to be repeated faster.
|
|
# SAVE_SETUP - saves a copy of the database at the end of step 1 to
|
|
# $DATA_DIR.bak. When LOG_DIR != DATA_DIR then it is copied
|
|
# to $LOG_DIR.bak.
|
|
|
|
# Size constants
|
|
K=1024
|
|
M=$((1024 * K))
|
|
G=$((1024 * M))
|
|
|
|
num_keys=${NKEYS:-$((1 * G))}
|
|
wps=${NWRITESPERSEC:-$((10 * K))}
|
|
duration=${NSECONDS:-$((60 * 60))}
|
|
nps=${RANGE_LIMIT:-10}
|
|
vs=${VAL_SIZE:-400}
|
|
cs=${CACHE_BYTES:-$(( 1 * G ))}
|
|
|
|
# If no command line arguments then run for 24 threads.
|
|
if [[ $# -eq 0 ]]; then
|
|
nthreads=( 24 )
|
|
else
|
|
nthreads=( "$@" )
|
|
fi
|
|
|
|
for num_thr in "${nthreads[@]}" ; do
|
|
echo Will run for $num_thr threads
|
|
done
|
|
|
|
# Update these parameters before execution !!!
|
|
db_dir=${DATA_DIR:-"/tmp/rocksdb/"}
|
|
wal_dir=${LOG_DIR:-"/tmp/rocksdb/"}
|
|
|
|
do_setup=${DO_SETUP:-1}
|
|
save_setup=${SAVE_SETUP:-0}
|
|
|
|
output_dir="/tmp/output"
|
|
|
|
ARGS="\
|
|
OUTPUT_DIR=$output_dir \
|
|
NUM_KEYS=$num_keys \
|
|
DB_DIR=$db_dir \
|
|
WAL_DIR=$wal_dir \
|
|
VALUE_SIZE=$vs \
|
|
CACHE_SIZE=$cs"
|
|
|
|
mkdir -p $output_dir
|
|
echo -e "ops/sec\tmb/sec\tL0_GB\tSum_GB\tW-Amp\tW-MB/s\tusec/op\tp50\tp75\tp99\tUptime\tStall-time\tStall%\tTest" \
|
|
> $output_dir/report.txt
|
|
|
|
# Notes on test sequence:
|
|
# step 1) Setup database via sequential fill followed by overwrite to fragment it.
|
|
# Done without setting DURATION to make sure that overwrite does $num_keys writes
|
|
# step 2) read-only tests for all levels of concurrency requested
|
|
# step 3) non read-only tests for all levels of concurrency requested
|
|
# step 4) merge tests for all levels of concurrency requested. These must come last.
|
|
|
|
###### Setup the database
|
|
|
|
if [[ $do_setup != 0 ]]; then
|
|
echo Doing setup
|
|
|
|
# Test 1: bulk load
|
|
env $ARGS ./tools/benchmark.sh bulkload
|
|
|
|
# Test 2: sequential fill
|
|
env $ARGS ./tools/benchmark.sh fillseq
|
|
|
|
# Test 3: single-threaded overwrite
|
|
env $ARGS NUM_THREADS=1 DB_BENCH_NO_SYNC=1 ./tools/benchmark.sh overwrite
|
|
|
|
else
|
|
echo Restoring from backup
|
|
|
|
rm -rf $db_dir
|
|
|
|
if [ ! -d ${db_dir}.bak ]; then
|
|
echo Database backup does not exist at ${db_dir}.bak
|
|
exit -1
|
|
fi
|
|
|
|
echo Restore database from ${db_dir}.bak
|
|
cp -p -r ${db_dir}.bak $db_dir
|
|
|
|
if [[ $db_dir != $wal_dir ]]; then
|
|
rm -rf $wal_dir
|
|
|
|
if [ ! -d ${wal_dir}.bak ]; then
|
|
echo WAL backup does not exist at ${wal_dir}.bak
|
|
exit -1
|
|
fi
|
|
|
|
echo Restore WAL from ${wal_dir}.bak
|
|
cp -p -r ${wal_dir}.bak $wal_dir
|
|
fi
|
|
fi
|
|
|
|
if [[ $save_setup != 0 ]]; then
|
|
echo Save database to ${db_dir}.bak
|
|
cp -p -r $db_dir ${db_dir}.bak
|
|
|
|
if [[ $db_dir != $wal_dir ]]; then
|
|
echo Save WAL to ${wal_dir}.bak
|
|
cp -p -r $wal_dir ${wal_dir}.bak
|
|
fi
|
|
fi
|
|
|
|
###### Read-only tests
|
|
|
|
for num_thr in "${nthreads[@]}" ; do
|
|
# Test 4: random read
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr ./tools/benchmark.sh readrandom
|
|
|
|
# Test 5: random range scans
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr NUM_NEXTS_PER_SEEK=$nps \
|
|
./tools/benchmark.sh fwdrange
|
|
|
|
# Test 6: random reverse range scans
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr NUM_NEXTS_PER_SEEK=$nps \
|
|
./tools/benchmark.sh revrange
|
|
done
|
|
|
|
###### Non read-only tests
|
|
|
|
for num_thr in "${nthreads[@]}" ; do
|
|
# Test 7: overwrite with sync=0
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr DB_BENCH_NO_SYNC=1 \
|
|
./tools/benchmark.sh overwrite
|
|
|
|
# Test 8: overwrite with sync=1
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr ./tools/benchmark.sh overwrite
|
|
|
|
# Test 9: random update with sync=0
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr DB_BENCH_NO_SYNC=1 \
|
|
./tools/benchmark.sh updaterandom
|
|
|
|
# Test 10: random update with sync=1
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr ./tools/benchmark.sh updaterandom
|
|
|
|
# Test 11: random read while writing
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
./tools/benchmark.sh readwhilewriting
|
|
|
|
# Test 12: range scan while writing
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
NUM_NEXTS_PER_SEEK=$nps ./tools/benchmark.sh fwdrangewhilewriting
|
|
|
|
# Test 13: reverse range scan while writing
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
NUM_NEXTS_PER_SEEK=$nps ./tools/benchmark.sh revrangewhilewriting
|
|
done
|
|
|
|
###### Merge tests
|
|
|
|
for num_thr in "${nthreads[@]}" ; do
|
|
# Test 14: random merge with sync=0
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr DB_BENCH_NO_SYNC=1 \
|
|
./tools/benchmark.sh mergerandom
|
|
|
|
# Test 15: random merge with sync=1
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr ./tools/benchmark.sh mergerandom
|
|
|
|
# Test 16: random read while merging
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
./tools/benchmark.sh readwhilemerging
|
|
|
|
# Test 17: range scan while merging
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
NUM_NEXTS_PER_SEEK=$nps ./tools/benchmark.sh fwdrangewhilemerging
|
|
|
|
# Test 18: reverse range scan while merging
|
|
env $ARGS DURATION=$duration NUM_THREADS=$num_thr WRITES_PER_SECOND=$wps \
|
|
NUM_NEXTS_PER_SEEK=$nps ./tools/benchmark.sh revrangewhilemerging
|
|
done
|
|
|
|
cat $output_dir/report.txt
|