BackupEngine computes table checksums only once if db session ids are available (#7110)
Summary: BackupEngine requires computing table checksums twice when backing up table files to the `shared_checksum` directory. The repeated computation can be avoided by utilizing the db session id stored as a part of the table properties. Filenames of table files in the `shared_checksum` directory depend on the following conditions: 1. the naming scheme is `kOptionalChecksumAndDbSessionId`, 2. `db_session_id` is not empty, 3. checksum is available in the DB manifest. If 1,2,3 are satisfied, then the filenames will be of the form `<file_number>_<checksum>_<db_session_id>.sst`. If 1,2 are satisfied, then the filenames will be of the form `<file_number>_<db_session_id>.sst`. In all other cases, the filenames are of the form `<file_number>_<checksum>_<size>.sst`. Additionally, if `kOptionalChecksumAndDbSessionId` is used (and not falling back to `kChecksumAndFileSize`), the `<checksum>` appeared in the filenames is hexadecimally encoded, instead of being plain `uint32_t` value. Pull Request resolved: https://github.com/facebook/rocksdb/pull/7110 Test Plan: backupable_db_test and manual tests. Reviewed By: ajkr Differential Revision: D22508992 Pulled By: gg814 fbshipit-source-id: 5669f0ea9ad5a097f69f6d87aca4abba15032389
This commit is contained in:
parent
0f487cc35f
commit
b923dc720b
@ -25,8 +25,8 @@
|
||||
### New Features
|
||||
* DB identity (`db_id`) and DB session identity (`db_session_id`) are added to table properties and stored in SST files. SST files generated from SstFileWriter and Repairer have DB identity “SST Writer” and “DB Repairer”, respectively. Their DB session IDs are generated in the same way as `DB::GetDbSessionId`. The session ID for SstFileWriter (resp., Repairer) resets every time `SstFileWriter::Open` (resp., `Repairer::Run`) is called.
|
||||
* Added experimental option BlockBasedTableOptions::optimize_filters_for_memory for reducing allocated memory size of Bloom filters (~10% savings with Jemalloc) while preserving the same general accuracy. To have an effect, the option requires format_version=5 and malloc_usable_size. Enabling this option is forward and backward compatible with existing format_version=5.
|
||||
* `BackupTableNameOption BackupableDBOptions::share_files_with_checksum_naming` is added, where `BackupTableNameOption` is an `enum` type with two enumerators `kChecksumAndFileSize` and `kChecksumAndFileSize`. By default, `BackupTableNameOption BackupableDBOptions::share_files_with_checksum_naming` is set to `kChecksumAndDbSessionId`. In this default case, backup table filenames are of the form `<file_number>_<crc32c>_<db_session_id>.sst` as opposed to `<file_number>_<crc32c>_<file_size>.sst`. The new default behavior fixes the backup file name collision problem, which might be possible at large scale, but the option `kChecksumAndFileSize` is added to allow use of old naming in case it is needed. This default behavior change is not an upgrade issue, because previous versions of RocksDB can read, restore, and delete backups using new names, and it's OK for a backup directory to use a mixture of table file naming schemes. Note that `share_files_with_checksum_naming` comes into effect only when both `share_files_with_checksum` and `share_table_files` are true.
|
||||
* Added auto resume function to automatically recover the DB from background Retryable IO Error. When retryable IOError happens during flush and WAL write, the error is mapped to Hard Error and DB will be in read mode. When retryable IO Error happens during compaction, the error will be mapped to Soft Error. DB is still in write/read mode. Autoresume function will create a thread for a DB to call DB->ResumeImpl() to try the recover for Retryable IO Error during flush and WAL write. Compaction will be rescheduled by itself if retryable IO Error happens. Auto resume may also cause other Retryable IO Error during the recovery, so the recovery will fail. Retry the auto resume may solve the issue, so we use max_bgerror_resume_count to decide how many resume cycles will be tried in total. If it is <=0, auto resume retryable IO Error is disabled. Default is INT_MAX, which will lead to a infinit auto resume. bgerror_resume_retry_interval decides the time interval between two auto resumes.
|
||||
* `BackupTableNameOption BackupableDBOptions::share_files_with_checksum_naming` is added, where `BackupTableNameOption` is an `enum` type with two enumerators `kChecksumAndFileSize` and `kOptionalChecksumAndDbSessionId`. By default, `BackupableDBOptions::share_files_with_checksum_naming` is set to `kOptionalChecksumAndDbSessionId`. In the default case, backup table filenames generated by this version of RocksDB are of the form either `<file_number>_<crc32c>_<db_session_id>.sst` or `<file_number>_<db_session_id>.sst` as opposed to `<file_number>_<crc32c>_<file_size>.sst`. Specifically, table filenames are of the form `<file_number>_<crc32c>_<db_session_id>.sst` if `DBOptions::file_checksum_gen_factory` is set to `GetFileChecksumGenCrc32cFactory()`. Futhermore, the checksum value `<crc32c>` appeared in the filenames is hexadecimal-encoded, instead of being decimal-encoded `uint32_t` value. If `DBOptions::file_checksum_gen_factory` is `nullptr`, the table filenames are of the form `<file_number>_<db_session_id>.sst`. The new default behavior fixes the backup file name collision problem, which might be possible at large scale, but the option `kChecksumAndFileSize` is added to allow use of old naming in case it is needed. Moreover, for table files generated prior to this version of RocksDB, using `kOptionalChecksumAndDbSessionId` will fall back on `kChecksumAndFileSize`. In these cases, the checksum value `<crc32c>` in the filenames `<file_number>_<crc32c>_<file_size>.sst` is decimal-encoded `uint32_t` value as before. This default behavior change is not an upgrade issue, because previous versions of RocksDB can read, restore, and delete backups using new names, and it's OK for a backup directory to use a mixture of table file naming schemes. Note that `share_files_with_checksum_naming` comes into effect only when both `share_files_with_checksum` and `share_table_files` are true.
|
||||
* Added auto resume function to automatically recover the DB from background Retryable IO Error. When retryable IOError happens during flush and WAL write, the error is mapped to Hard Error and DB will be in read mode. When retryable IO Error happens during compaction, the error will be mapped to Soft Error. DB is still in write/read mode. Autoresume function will create a thread for a DB to call DB->ResumeImpl() to try the recover for Retryable IO Error during flush and WAL write. Compaction will be rescheduled by itself if retryable IO Error happens. Auto resume may also cause other Retryable IO Error during the recovery, so the recovery will fail. Retry the auto resume may solve the issue, so we use max_bgerror_resume_count to decide how many resume cycles will be tried in total. If it is <=0, auto resume retryable IO Error is disabled. Default is INT_MAX, which will lead to a infinit auto resume. bgerror_resume_retry_interval decides the time interval between two auto resumes.
|
||||
|
||||
### Bug Fixes
|
||||
* Fail recovery and report once hitting a physical log record checksum mismatch, while reading MANIFEST. RocksDB should not continue processing the MANIFEST any further.
|
||||
@ -34,6 +34,7 @@
|
||||
### Performance Improvements
|
||||
* Eliminate key copies for internal comparisons while accessing ingested block-based tables.
|
||||
* Reduce key comparisons during random access in all block-based tables.
|
||||
* BackupEngine avoids unnecessary repeated checksum computation for backing up a table file to the `shared_checksum` directory when using `kOptionalChecksumAndDbSessionId`, except on SST files generated before this version of RocksDB, which fall back on using `kChecksumAndFileSize`.
|
||||
|
||||
## 6.11 (6/12/2020)
|
||||
### Bug Fixes
|
||||
|
@ -34,12 +34,17 @@ constexpr char kBackupFileChecksumFuncName[] = "crc32c";
|
||||
// directory (i.e., both share_table_files and share_files_with_checksum
|
||||
// are true).
|
||||
enum BackupTableNameOption : unsigned char {
|
||||
// Backup SST filenames consist of file_number, crc32c, db_session_id
|
||||
// Backup SST filenames are <file_number>_<crc32c>_<file_size>.sst
|
||||
// where <crc32c> is uint32_t.
|
||||
kChecksumAndFileSize = 0,
|
||||
// Backup SST filenames consist of file_number, crc32c, file_size
|
||||
// When there is no `db_session_id` available in the table file, we use
|
||||
// `file_size` as a fallback.
|
||||
kChecksumAndDbSessionId = 1
|
||||
// Backup SST filenames are <file_number>_<crc32c>_<db_session_id>.sst
|
||||
// where <crc32c> is hexidecimally encoded.
|
||||
// When DBOptions::file_checksum_gen_factory is not set to
|
||||
// GetFileChecksumGenCrc32cFactory(), the filenames will be
|
||||
// <file_number>_<db_session_id>.sst
|
||||
// When there are no db session ids available in the table file, this
|
||||
// option will use kChecksumAndFileSize as a fallback.
|
||||
kOptionalChecksumAndDbSessionId = 1
|
||||
};
|
||||
|
||||
struct BackupableDBOptions {
|
||||
@ -111,9 +116,10 @@ struct BackupableDBOptions {
|
||||
// (file name, crc32c, db session id or file length)
|
||||
//
|
||||
// Note: If this option is set to true, we recommend setting
|
||||
// share_files_with_checksum_naming to kChecksumAndDbSessionId, which is also
|
||||
// our default option. Otherwise, there is a non-negligible chance of filename
|
||||
// collision when sharing tables in shared_checksum among several DBs.
|
||||
// share_files_with_checksum_naming to kOptionalChecksumAndDbSessionId, which
|
||||
// is also our default option. Otherwise, there is a non-negligible chance of
|
||||
// filename collision when sharing tables in shared_checksum among several
|
||||
// DBs.
|
||||
// *turn it on only if you know what you're doing*
|
||||
//
|
||||
// Default: false
|
||||
@ -141,17 +147,17 @@ struct BackupableDBOptions {
|
||||
int max_valid_backups_to_open;
|
||||
|
||||
// Naming option for share_files_with_checksum table files. This option
|
||||
// can be set to kChecksumAndFileSize or kChecksumAndDbSessionId.
|
||||
// can be set to kChecksumAndFileSize or kOptionalChecksumAndDbSessionId.
|
||||
// kChecksumAndFileSize is susceptible to collision as file size is not a
|
||||
// good source of entroy.
|
||||
// kChecksumAndDbSessionId is immune to collision.
|
||||
// kOptionalChecksumAndDbSessionId is immune to collision.
|
||||
//
|
||||
// Modifying this option cannot introduce a downgrade compatibility issue
|
||||
// because RocksDB can read, restore, and delete backups using different file
|
||||
// names, and it's OK for a backup directory to use a mixture of table file
|
||||
// naming schemes.
|
||||
//
|
||||
// Default: kChecksumAndDbSessionId
|
||||
// Default: kOptionalChecksumAndDbSessionId
|
||||
//
|
||||
// Note: This option comes into effect only if both share_files_with_checksum
|
||||
// and share_table_files are true. In the cases of old table files where no
|
||||
@ -170,7 +176,7 @@ struct BackupableDBOptions {
|
||||
uint64_t _callback_trigger_interval_size = 4 * 1024 * 1024,
|
||||
int _max_valid_backups_to_open = INT_MAX,
|
||||
BackupTableNameOption _share_files_with_checksum_naming =
|
||||
kChecksumAndDbSessionId)
|
||||
kOptionalChecksumAndDbSessionId)
|
||||
: backup_dir(_backup_dir),
|
||||
backup_env(_backup_env),
|
||||
share_table_files(_share_table_files),
|
||||
|
@ -46,6 +46,22 @@
|
||||
|
||||
namespace ROCKSDB_NAMESPACE {
|
||||
|
||||
namespace {
|
||||
inline uint32_t ChecksumHexToInt32(const std::string& checksum_hex) {
|
||||
std::string checksum_str;
|
||||
Slice(checksum_hex).DecodeHex(&checksum_str);
|
||||
return EndianSwapValue(DecodeFixed32(checksum_str.c_str()));
|
||||
}
|
||||
inline std::string ChecksumStrToHex(const std::string& checksum_str) {
|
||||
return Slice(checksum_str).ToString(true);
|
||||
}
|
||||
inline std::string ChecksumInt32ToHex(const uint32_t& checksum_value) {
|
||||
std::string checksum_str;
|
||||
PutFixed32(&checksum_str, EndianSwapValue(checksum_value));
|
||||
return ChecksumStrToHex(checksum_str);
|
||||
}
|
||||
} // namespace
|
||||
|
||||
void BackupStatistics::IncrementNumberSuccessBackup() {
|
||||
number_success_backup++;
|
||||
}
|
||||
@ -149,12 +165,12 @@ class BackupEngineImpl : public BackupEngine {
|
||||
std::unordered_map<std::string, uint64_t>* result);
|
||||
|
||||
struct FileInfo {
|
||||
FileInfo(const std::string& fname, uint64_t sz, uint32_t checksum,
|
||||
FileInfo(const std::string& fname, uint64_t sz, const std::string& checksum,
|
||||
const std::string& id = "", const std::string& sid = "")
|
||||
: refs(0),
|
||||
filename(fname),
|
||||
size(sz),
|
||||
checksum_value(checksum),
|
||||
checksum_hex(checksum),
|
||||
db_id(id),
|
||||
db_session_id(sid) {}
|
||||
|
||||
@ -164,11 +180,13 @@ class BackupEngineImpl : public BackupEngine {
|
||||
int refs;
|
||||
const std::string filename;
|
||||
const uint64_t size;
|
||||
const uint32_t checksum_value;
|
||||
const std::string checksum_hex;
|
||||
// DB identities
|
||||
// db_id is obtained for potential usage in the future but not used
|
||||
// currently; db_session_id appears in the backup SST filename
|
||||
// currently
|
||||
const std::string db_id;
|
||||
// db_session_id appears in the backup SST filename if the table naming
|
||||
// option is kOptionalChecksumAndDbSessionId
|
||||
const std::string db_session_id;
|
||||
};
|
||||
|
||||
@ -279,6 +297,7 @@ class BackupEngineImpl : public BackupEngine {
|
||||
inline std::string GetPrivateDirRel() const {
|
||||
return "private";
|
||||
}
|
||||
inline std::string GetSharedDirRel() const { return "shared"; }
|
||||
inline std::string GetSharedChecksumDirRel() const {
|
||||
return "shared_checksum";
|
||||
}
|
||||
@ -292,7 +311,7 @@ class BackupEngineImpl : public BackupEngine {
|
||||
inline std::string GetSharedFileRel(const std::string& file = "",
|
||||
bool tmp = false) const {
|
||||
assert(file.size() == 0 || file[0] != '/');
|
||||
return std::string("shared/") + (tmp ? "." : "") + file +
|
||||
return GetSharedDirRel() + "/" + (tmp ? "." : "") + file +
|
||||
(tmp ? ".tmp" : "");
|
||||
}
|
||||
inline std::string GetSharedFileWithChecksumRel(const std::string& file = "",
|
||||
@ -301,22 +320,29 @@ class BackupEngineImpl : public BackupEngine {
|
||||
return GetSharedChecksumDirRel() + "/" + (tmp ? "." : "") + file +
|
||||
(tmp ? ".tmp" : "");
|
||||
}
|
||||
// If kChecksumAndDbSessionId is the naming option and db_session_id is not
|
||||
// empty, backup SST filenames consist of file_number, crc32c, db_session_id.
|
||||
// Otherwise, backup SST filenames consist of file_number, crc32c, file_size.
|
||||
inline bool UseSessionId(const std::string& sid) const {
|
||||
return GetTableNamingOption() == kOptionalChecksumAndDbSessionId &&
|
||||
!sid.empty();
|
||||
}
|
||||
inline std::string GetSharedFileWithChecksum(
|
||||
const std::string& file, const uint32_t checksum_value,
|
||||
const uint64_t file_size, const std::string& db_session_id) const {
|
||||
const std::string& file, bool has_checksum,
|
||||
const std::string& checksum_hex, const uint64_t file_size,
|
||||
const std::string& db_session_id) const {
|
||||
assert(file.size() == 0 || file[0] != '/');
|
||||
std::string file_copy = file;
|
||||
const std::string suffix =
|
||||
GetTableNamingOption() == kChecksumAndDbSessionId &&
|
||||
!db_session_id.empty()
|
||||
? db_session_id
|
||||
: ROCKSDB_NAMESPACE::ToString(file_size);
|
||||
return file_copy.insert(
|
||||
file_copy.find_last_of('.'),
|
||||
"_" + ROCKSDB_NAMESPACE::ToString(checksum_value) + "_" + suffix);
|
||||
if (UseSessionId(db_session_id)) {
|
||||
if (has_checksum) {
|
||||
return file_copy.insert(file_copy.find_last_of('.'),
|
||||
"_" + checksum_hex + "_" + db_session_id);
|
||||
} else {
|
||||
return file_copy.insert(file_copy.find_last_of('.'),
|
||||
"_" + db_session_id);
|
||||
}
|
||||
} else {
|
||||
return file_copy.insert(file_copy.find_last_of('.'),
|
||||
"_" + ToString(ChecksumHexToInt32(checksum_hex)) +
|
||||
"_" + ToString(file_size));
|
||||
}
|
||||
}
|
||||
inline std::string GetFileFromChecksumFile(const std::string& file) const {
|
||||
assert(file.size() == 0 || file[0] != '/');
|
||||
@ -332,17 +358,6 @@ class BackupEngineImpl : public BackupEngine {
|
||||
return GetBackupMetaDir() + "/" + (tmp ? "." : "") +
|
||||
ROCKSDB_NAMESPACE::ToString(backup_id) + (tmp ? ".tmp" : "");
|
||||
}
|
||||
inline bool IsSstFile(const std::string& fname) const {
|
||||
return fname.length() > 4 && fname.rfind(".sst") == fname.length() - 4;
|
||||
}
|
||||
inline std::string ChecksumInt32ToStr(const uint32_t& checksum_int) {
|
||||
std::string checksum_str;
|
||||
PutFixed32(&checksum_str, EndianSwapValue(checksum_int));
|
||||
return checksum_str;
|
||||
}
|
||||
inline uint32_t ChecksumStrToInt32(const std::string& checksum_str) {
|
||||
return EndianSwapValue(DecodeFixed32(checksum_str.c_str()));
|
||||
}
|
||||
|
||||
// If size_limit == 0, there is no size limit, copy everything.
|
||||
//
|
||||
@ -354,14 +369,13 @@ class BackupEngineImpl : public BackupEngine {
|
||||
const std::string& src, const std::string& dst,
|
||||
const std::string& contents, Env* src_env, Env* dst_env,
|
||||
const EnvOptions& src_env_options, bool sync, RateLimiter* rate_limiter,
|
||||
uint64_t* size = nullptr, uint32_t* checksum_value = nullptr,
|
||||
uint64_t* size = nullptr, std::string* checksum_hex = nullptr,
|
||||
uint64_t size_limit = 0,
|
||||
std::function<void()> progress_callback = []() {},
|
||||
std::string* db_id = nullptr, std::string* db_session_id = nullptr);
|
||||
std::function<void()> progress_callback = []() {});
|
||||
|
||||
Status CalculateChecksum(const std::string& src, Env* src_env,
|
||||
const EnvOptions& src_env_options,
|
||||
uint64_t size_limit, uint32_t* checksum_value);
|
||||
uint64_t size_limit, std::string* checksum_hex);
|
||||
|
||||
// Obtain db_id and db_session_id from the table properties of file_path
|
||||
Status GetFileDbIdentities(Env* src_env, const EnvOptions& src_env_options,
|
||||
@ -370,7 +384,7 @@ class BackupEngineImpl : public BackupEngine {
|
||||
|
||||
struct CopyOrCreateResult {
|
||||
uint64_t size;
|
||||
uint32_t checksum_value;
|
||||
std::string checksum_hex;
|
||||
std::string db_id;
|
||||
std::string db_session_id;
|
||||
Status status;
|
||||
@ -393,7 +407,9 @@ class BackupEngineImpl : public BackupEngine {
|
||||
std::function<void()> progress_callback;
|
||||
bool verify_checksum_after_work;
|
||||
std::string src_checksum_func_name;
|
||||
std::string src_checksum_str;
|
||||
std::string src_checksum_hex;
|
||||
std::string db_id;
|
||||
std::string db_session_id;
|
||||
|
||||
CopyOrCreateWorkItem()
|
||||
: src_path(""),
|
||||
@ -407,7 +423,9 @@ class BackupEngineImpl : public BackupEngine {
|
||||
size_limit(0),
|
||||
verify_checksum_after_work(false),
|
||||
src_checksum_func_name(kUnknownFileChecksumFuncName),
|
||||
src_checksum_str(kUnknownFileChecksum) {}
|
||||
src_checksum_hex(""),
|
||||
db_id(""),
|
||||
db_session_id("") {}
|
||||
|
||||
CopyOrCreateWorkItem(const CopyOrCreateWorkItem&) = delete;
|
||||
CopyOrCreateWorkItem& operator=(const CopyOrCreateWorkItem&) = delete;
|
||||
@ -429,8 +447,10 @@ class BackupEngineImpl : public BackupEngine {
|
||||
result = std::move(o.result);
|
||||
progress_callback = std::move(o.progress_callback);
|
||||
verify_checksum_after_work = o.verify_checksum_after_work;
|
||||
src_checksum_func_name = o.src_checksum_func_name;
|
||||
src_checksum_str = o.src_checksum_str;
|
||||
src_checksum_func_name = std::move(o.src_checksum_func_name);
|
||||
src_checksum_hex = std::move(o.src_checksum_hex);
|
||||
db_id = std::move(o.db_id);
|
||||
db_session_id = std::move(o.db_session_id);
|
||||
return *this;
|
||||
}
|
||||
|
||||
@ -439,10 +459,11 @@ class BackupEngineImpl : public BackupEngine {
|
||||
Env* _src_env, Env* _dst_env, EnvOptions _src_env_options, bool _sync,
|
||||
RateLimiter* _rate_limiter, uint64_t _size_limit,
|
||||
std::function<void()> _progress_callback = []() {},
|
||||
const bool& _verify_checksum_after_work = false,
|
||||
bool _verify_checksum_after_work = false,
|
||||
const std::string& _src_checksum_func_name =
|
||||
kUnknownFileChecksumFuncName,
|
||||
const std::string& _src_checksum_str = kUnknownFileChecksum)
|
||||
const std::string& _src_checksum_hex = "",
|
||||
const std::string& _db_id = "", const std::string& _db_session_id = "")
|
||||
: src_path(std::move(_src_path)),
|
||||
dst_path(std::move(_dst_path)),
|
||||
contents(std::move(_contents)),
|
||||
@ -455,7 +476,9 @@ class BackupEngineImpl : public BackupEngine {
|
||||
progress_callback(_progress_callback),
|
||||
verify_checksum_after_work(_verify_checksum_after_work),
|
||||
src_checksum_func_name(_src_checksum_func_name),
|
||||
src_checksum_str(_src_checksum_str) {}
|
||||
src_checksum_hex(_src_checksum_hex),
|
||||
db_id(_db_id),
|
||||
db_session_id(_db_session_id) {}
|
||||
};
|
||||
|
||||
struct BackupAfterCopyOrCreateWorkItem {
|
||||
@ -507,12 +530,11 @@ class BackupEngineImpl : public BackupEngine {
|
||||
|
||||
struct RestoreAfterCopyOrCreateWorkItem {
|
||||
std::future<CopyOrCreateResult> result;
|
||||
uint32_t checksum_value;
|
||||
RestoreAfterCopyOrCreateWorkItem()
|
||||
: checksum_value(0) {}
|
||||
std::string checksum_hex;
|
||||
RestoreAfterCopyOrCreateWorkItem() : checksum_hex("") {}
|
||||
RestoreAfterCopyOrCreateWorkItem(std::future<CopyOrCreateResult>&& _result,
|
||||
uint32_t _checksum_value)
|
||||
: result(std::move(_result)), checksum_value(_checksum_value) {}
|
||||
const std::string& _checksum_hex)
|
||||
: result(std::move(_result)), checksum_hex(_checksum_hex) {}
|
||||
RestoreAfterCopyOrCreateWorkItem(RestoreAfterCopyOrCreateWorkItem&& o)
|
||||
ROCKSDB_NOEXCEPT {
|
||||
*this = std::move(o);
|
||||
@ -521,7 +543,7 @@ class BackupEngineImpl : public BackupEngine {
|
||||
RestoreAfterCopyOrCreateWorkItem& operator=(
|
||||
RestoreAfterCopyOrCreateWorkItem&& o) ROCKSDB_NOEXCEPT {
|
||||
result = std::move(o.result);
|
||||
checksum_value = o.checksum_value;
|
||||
checksum_hex = std::move(o.checksum_hex);
|
||||
return *this;
|
||||
}
|
||||
};
|
||||
@ -837,8 +859,10 @@ Status BackupEngineImpl::Initialize() {
|
||||
work_item.src_path, work_item.dst_path, work_item.contents,
|
||||
work_item.src_env, work_item.dst_env, work_item.src_env_options,
|
||||
work_item.sync, work_item.rate_limiter, &result.size,
|
||||
&result.checksum_value, work_item.size_limit,
|
||||
work_item.progress_callback, &result.db_id, &result.db_session_id);
|
||||
&result.checksum_hex, work_item.size_limit,
|
||||
work_item.progress_callback);
|
||||
result.db_id = work_item.db_id;
|
||||
result.db_session_id = work_item.db_session_id;
|
||||
if (result.status.ok() && work_item.verify_checksum_after_work) {
|
||||
// unknown checksum function name implies no db table file checksum in
|
||||
// db manifest; work_item.verify_checksum_after_work being true means
|
||||
@ -847,13 +871,10 @@ Status BackupEngineImpl::Initialize() {
|
||||
if (work_item.src_checksum_func_name ==
|
||||
kUnknownFileChecksumFuncName ||
|
||||
work_item.src_checksum_func_name == kDbFileChecksumFuncName) {
|
||||
uint32_t src_checksum_int =
|
||||
ChecksumStrToInt32(work_item.src_checksum_str);
|
||||
if (src_checksum_int != result.checksum_value) {
|
||||
std::string checksum_info("Expected checksum is " +
|
||||
ToString(src_checksum_int) +
|
||||
" while computed checksum is " +
|
||||
ToString(result.checksum_value));
|
||||
if (work_item.src_checksum_hex != result.checksum_hex) {
|
||||
std::string checksum_info(
|
||||
"Expected checksum is " + work_item.src_checksum_hex +
|
||||
" while computed checksum is " + result.checksum_hex);
|
||||
result.status =
|
||||
Status::Corruption("Checksum mismatch after copying to " +
|
||||
work_item.dst_path + ": " + checksum_info);
|
||||
@ -1037,12 +1058,12 @@ Status BackupEngineImpl::CreateNewBackupWithMetadata(
|
||||
auto result = item.result.get();
|
||||
item_status = result.status;
|
||||
if (item_status.ok() && item.shared && item.needed_to_copy) {
|
||||
item_status = item.backup_env->RenameFile(item.dst_path_tmp,
|
||||
item.dst_path);
|
||||
item_status =
|
||||
item.backup_env->RenameFile(item.dst_path_tmp, item.dst_path);
|
||||
}
|
||||
if (item_status.ok()) {
|
||||
item_status = new_backup.get()->AddFile(std::make_shared<FileInfo>(
|
||||
item.dst_relative, result.size, result.checksum_value, result.db_id,
|
||||
item.dst_relative, result.size, result.checksum_hex, result.db_id,
|
||||
result.db_session_id));
|
||||
}
|
||||
if (!item_status.ok()) {
|
||||
@ -1310,18 +1331,19 @@ Status BackupEngineImpl::RestoreDBFromBackup(const RestoreOptions& options,
|
||||
Status s;
|
||||
std::vector<RestoreAfterCopyOrCreateWorkItem> restore_items_to_finish;
|
||||
for (const auto& file_info : backup->GetFiles()) {
|
||||
const std::string &file = file_info->filename;
|
||||
const std::string& file = file_info->filename;
|
||||
std::string dst;
|
||||
// 1. extract the filename
|
||||
size_t slash = file.find_last_of('/');
|
||||
// file will either be shared/<file>, shared_checksum/<file_crc32c_size>
|
||||
// shared_checksum/<file_crc32c_session>, or private/<number>/<file>
|
||||
// file will either be shared/<file>, shared_checksum/<file_crc32c_size>,
|
||||
// shared_checksum/<file_session>, shared_checksum/<file_crc32c_session>,
|
||||
// or private/<number>/<file>
|
||||
assert(slash != std::string::npos);
|
||||
dst = file.substr(slash + 1);
|
||||
|
||||
// if the file was in shared_checksum, extract the real file name
|
||||
// in this case the file is <number>_<checksum>_<size>.<type>
|
||||
// or <number>_<checksum>_<session>.<type> if new naming is used
|
||||
// in this case the file is <number>_<checksum>_<size>.<type>,
|
||||
// <number>_<session>.<type>, or <number>_<checksum>_<session>.<type>
|
||||
if (file.substr(0, slash) == GetSharedChecksumDirRel()) {
|
||||
dst = GetFileFromChecksumFile(dst);
|
||||
}
|
||||
@ -1346,8 +1368,7 @@ Status BackupEngineImpl::RestoreDBFromBackup(const RestoreOptions& options,
|
||||
EnvOptions() /* src_env_options */, false, rate_limiter,
|
||||
0 /* size_limit */);
|
||||
RestoreAfterCopyOrCreateWorkItem after_copy_or_create_work_item(
|
||||
copy_or_create_work_item.result.get_future(),
|
||||
file_info->checksum_value);
|
||||
copy_or_create_work_item.result.get_future(), file_info->checksum_hex);
|
||||
files_to_copy_or_create_.write(std::move(copy_or_create_work_item));
|
||||
restore_items_to_finish.push_back(
|
||||
std::move(after_copy_or_create_work_item));
|
||||
@ -1362,7 +1383,7 @@ Status BackupEngineImpl::RestoreDBFromBackup(const RestoreOptions& options,
|
||||
if (!item_status.ok()) {
|
||||
s = item_status;
|
||||
break;
|
||||
} else if (item.checksum_value != result.checksum_value) {
|
||||
} else if (item.checksum_hex != result.checksum_hex) {
|
||||
s = Status::Corruption("Checksum check failed");
|
||||
break;
|
||||
}
|
||||
@ -1420,15 +1441,15 @@ Status BackupEngineImpl::VerifyBackup(BackupID backup_id,
|
||||
}
|
||||
if (verify_with_checksum) {
|
||||
// verify file checksum
|
||||
uint32_t checksum_value = 0;
|
||||
std::string checksum_hex;
|
||||
ROCKS_LOG_INFO(options_.info_log, "Verifying %s checksum...\n",
|
||||
abs_path.c_str());
|
||||
CalculateChecksum(abs_path, backup_env_, EnvOptions(), 0 /* size_limit */,
|
||||
&checksum_value);
|
||||
if (file_info->checksum_value != checksum_value) {
|
||||
&checksum_hex);
|
||||
if (file_info->checksum_hex != checksum_hex) {
|
||||
std::string checksum_info(
|
||||
"Expected checksum is " + ToString(file_info->checksum_value) +
|
||||
" while computed checksum is " + ToString(checksum_value));
|
||||
"Expected checksum is " + file_info->checksum_hex +
|
||||
" while computed checksum is " + checksum_hex);
|
||||
return Status::Corruption("File corrupted: Checksum mismatch for " +
|
||||
abs_path + ": " + checksum_info);
|
||||
}
|
||||
@ -1440,9 +1461,8 @@ Status BackupEngineImpl::VerifyBackup(BackupID backup_id,
|
||||
Status BackupEngineImpl::CopyOrCreateFile(
|
||||
const std::string& src, const std::string& dst, const std::string& contents,
|
||||
Env* src_env, Env* dst_env, const EnvOptions& src_env_options, bool sync,
|
||||
RateLimiter* rate_limiter, uint64_t* size, uint32_t* checksum_value,
|
||||
uint64_t size_limit, std::function<void()> progress_callback,
|
||||
std::string* db_id, std::string* db_session_id) {
|
||||
RateLimiter* rate_limiter, uint64_t* size, std::string* checksum_hex,
|
||||
uint64_t size_limit, std::function<void()> progress_callback) {
|
||||
assert(src.empty() != contents.empty());
|
||||
Status s;
|
||||
std::unique_ptr<WritableFile> dst_file;
|
||||
@ -1453,15 +1473,7 @@ Status BackupEngineImpl::CopyOrCreateFile(
|
||||
if (size != nullptr) {
|
||||
*size = 0;
|
||||
}
|
||||
if (checksum_value != nullptr) {
|
||||
*checksum_value = 0;
|
||||
}
|
||||
if (db_id != nullptr) {
|
||||
*db_id = "";
|
||||
}
|
||||
if (db_session_id != nullptr) {
|
||||
*db_session_id = "";
|
||||
}
|
||||
uint32_t checksum_value = 0;
|
||||
|
||||
// Check if size limit is set. if not, set it to very big number
|
||||
if (size_limit == 0) {
|
||||
@ -1504,7 +1516,8 @@ Status BackupEngineImpl::CopyOrCreateFile(
|
||||
size_limit -= data.size();
|
||||
TEST_SYNC_POINT_CALLBACK(
|
||||
"BackupEngineImpl::CopyOrCreateFile:CorruptionDuringBackup",
|
||||
IsSstFile(src) ? &data : nullptr);
|
||||
(src.length() > 4 && src.rfind(".sst") == src.length() - 4) ? &data
|
||||
: nullptr);
|
||||
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
@ -1513,9 +1526,8 @@ Status BackupEngineImpl::CopyOrCreateFile(
|
||||
if (size != nullptr) {
|
||||
*size += data.size();
|
||||
}
|
||||
if (checksum_value != nullptr) {
|
||||
*checksum_value =
|
||||
crc32c::Extend(*checksum_value, data.data(), data.size());
|
||||
if (checksum_hex != nullptr) {
|
||||
checksum_value = crc32c::Extend(checksum_value, data.data(), data.size());
|
||||
}
|
||||
s = dest_writer->Append(data);
|
||||
if (rate_limiter != nullptr) {
|
||||
@ -1529,29 +1541,17 @@ Status BackupEngineImpl::CopyOrCreateFile(
|
||||
}
|
||||
} while (s.ok() && contents.empty() && data.size() > 0 && size_limit > 0);
|
||||
|
||||
// Convert uint32_t checksum to hex checksum
|
||||
if (checksum_hex != nullptr) {
|
||||
checksum_hex->assign(ChecksumInt32ToHex(checksum_value));
|
||||
}
|
||||
|
||||
if (s.ok() && sync) {
|
||||
s = dest_writer->Sync(false);
|
||||
}
|
||||
if (s.ok()) {
|
||||
s = dest_writer->Close();
|
||||
}
|
||||
if (s.ok() && GetTableNamingOption() == kChecksumAndDbSessionId) {
|
||||
// When copying SST files and using db_session_id in the name,
|
||||
// try to get DB identities
|
||||
// Note that when CopyOrCreateFile() is called while restoring, we still
|
||||
// try obtaining the ids but as for now we do not use ids to verify
|
||||
// the restored file
|
||||
if (!src.empty()) {
|
||||
// copying
|
||||
if (IsSstFile(src) && (db_id != nullptr || db_session_id != nullptr)) {
|
||||
// SST file
|
||||
// Ignore the returned status
|
||||
// In the failed cases, db_id and db_session_id will be empty
|
||||
GetFileDbIdentities(src_env, src_env_options, src, db_id,
|
||||
db_session_id);
|
||||
}
|
||||
}
|
||||
}
|
||||
return s;
|
||||
}
|
||||
|
||||
@ -1571,65 +1571,67 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
std::string dst_relative = fname.substr(1);
|
||||
std::string dst_relative_tmp;
|
||||
Status s;
|
||||
uint32_t checksum_value = 0;
|
||||
std::string checksum_hex;
|
||||
std::string db_id;
|
||||
std::string db_session_id;
|
||||
// whether the checksum for a table file has been computed
|
||||
// whether the checksum for a table file is available
|
||||
bool has_checksum = false;
|
||||
|
||||
// Whenever a default checksum function name is passed in, we will verify it
|
||||
// before copying. Note that only table files may have a known checksum name
|
||||
// passed in.
|
||||
// Whenever a default checksum function name is passed in, we will compares
|
||||
// the corresponding checksum values after copying. Note that only table files
|
||||
// may have a known checksum function name passed in.
|
||||
//
|
||||
// If no default checksum function name is passed in, we will calculate the
|
||||
// checksum *before* copying in two cases (we always calcuate checksums when
|
||||
// copying or creating for any file types):
|
||||
// If no default checksum function name is passed in and db session id is not
|
||||
// available, we will calculate the checksum *before* copying in two cases
|
||||
// (we always calcuate checksums when copying or creating for any file types):
|
||||
// a) share_files_with_checksum is true and file type is table;
|
||||
// b) share_table_files is true and the file exists already.
|
||||
//
|
||||
// Step 0: Check if default checksum function name is passed in
|
||||
if (kDbFileChecksumFuncName == src_checksum_func_name) {
|
||||
if (src_checksum_str == kUnknownFileChecksum) {
|
||||
return Status::Aborted("Unknown checksum value for " + fname);
|
||||
}
|
||||
s = CalculateChecksum(src_dir + fname, db_env_, src_env_options, size_limit,
|
||||
&checksum_value);
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
}
|
||||
// Convert src_checksum_str to uint32_t and compare
|
||||
uint32_t src_checksum_int = ChecksumStrToInt32(src_checksum_str);
|
||||
if (src_checksum_int != checksum_value) {
|
||||
std::string checksum_info(
|
||||
"Expected checksum is " + ToString(src_checksum_int) +
|
||||
" while computed checksum is " + ToString(checksum_value));
|
||||
return Status::Corruption("Checksum mismatch before copying from " +
|
||||
fname + ": " + checksum_info);
|
||||
}
|
||||
checksum_hex = ChecksumStrToHex(src_checksum_str);
|
||||
has_checksum = true;
|
||||
}
|
||||
|
||||
// Step 1: Prepare the relative path to destination
|
||||
if (shared && shared_checksum) {
|
||||
// add checksum and file length to the file name
|
||||
if (!has_checksum) {
|
||||
s = CalculateChecksum(src_dir + fname, db_env_, src_env_options,
|
||||
size_limit, &checksum_value);
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
}
|
||||
has_checksum = true;
|
||||
}
|
||||
if (GetTableNamingOption() == kChecksumAndDbSessionId) {
|
||||
if (GetTableNamingOption() == kOptionalChecksumAndDbSessionId) {
|
||||
// Prepare db_session_id to add to the file name
|
||||
// Ignore the returned status
|
||||
// In the failed cases, db_id and db_session_id will be empty
|
||||
GetFileDbIdentities(db_env_, src_env_options, src_dir + fname, &db_id,
|
||||
&db_session_id);
|
||||
}
|
||||
// Calculate checksum if checksum and db session id are not available.
|
||||
// If db session id is available, we will not calculate the checksum
|
||||
// since the session id should suffice to avoid file name collision in
|
||||
// the shared_checksum directory.
|
||||
if (!has_checksum && db_session_id.empty()) {
|
||||
s = CalculateChecksum(src_dir + fname, db_env_, src_env_options,
|
||||
size_limit, &checksum_hex);
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
}
|
||||
has_checksum = true;
|
||||
}
|
||||
if (size_bytes == port::kMaxUint64) {
|
||||
return Status::NotFound("File missing: " + src_dir + fname);
|
||||
}
|
||||
dst_relative = GetSharedFileWithChecksum(dst_relative, checksum_value,
|
||||
size_bytes, db_session_id);
|
||||
// dst_relative depends on the following conditions:
|
||||
// 1) the naming scheme is kOptionalChecksumAndDbSessionId,
|
||||
// 2) db_session_id is not empty,
|
||||
// 3) checksum is available in the DB manifest.
|
||||
// If 1,2,3) are satisfied, then dst_relative will be of the form:
|
||||
// shared_checksum/<file_number>_<checksum>_<db_session_id>.sst
|
||||
// If 1,2) are satisfied, then dst_relative will be of the form:
|
||||
// shared_checksum/<file_number>_<db_session_id>.sst
|
||||
// Otherwise, dst_relative is of the form
|
||||
// shared_checksum/<file_number>_<checksum>_<size>.sst
|
||||
dst_relative = GetSharedFileWithChecksum(
|
||||
dst_relative, has_checksum, checksum_hex, size_bytes, db_session_id);
|
||||
dst_relative_tmp = GetSharedFileWithChecksumRel(dst_relative, true);
|
||||
dst_relative = GetSharedFileWithChecksumRel(dst_relative, false);
|
||||
} else if (shared) {
|
||||
@ -1681,20 +1683,43 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
} else if (shared && (same_path || file_exists)) {
|
||||
need_to_copy = false;
|
||||
if (shared_checksum) {
|
||||
if (GetTableNamingOption() == kChecksumAndDbSessionId &&
|
||||
!db_session_id.empty()) {
|
||||
ROCKS_LOG_INFO(options_.info_log,
|
||||
"%s already present, with checksum %u, size %" PRIu64
|
||||
" and DB session identity %s",
|
||||
fname.c_str(), checksum_value, size_bytes,
|
||||
db_session_id.c_str());
|
||||
if (backuped_file_infos_.find(dst_relative) ==
|
||||
backuped_file_infos_.end() &&
|
||||
!same_path) {
|
||||
// file exists but not referenced
|
||||
ROCKS_LOG_INFO(
|
||||
options_.info_log,
|
||||
"%s already present, but not referenced by any backup. We will "
|
||||
"overwrite the file.",
|
||||
fname.c_str());
|
||||
need_to_copy = true;
|
||||
backup_env_->DeleteFile(final_dest_path);
|
||||
} else {
|
||||
ROCKS_LOG_INFO(options_.info_log,
|
||||
"%s already present, with checksum %u and size %" PRIu64,
|
||||
fname.c_str(), checksum_value, size_bytes);
|
||||
// file exists and referenced
|
||||
if (!has_checksum) {
|
||||
s = CalculateChecksum(src_dir + fname, db_env_, src_env_options,
|
||||
size_limit, &checksum_hex);
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
}
|
||||
has_checksum = true;
|
||||
}
|
||||
if (UseSessionId(db_session_id)) {
|
||||
ROCKS_LOG_INFO(options_.info_log,
|
||||
"%s already present, with checksum %s, size %" PRIu64
|
||||
" and DB session identity %s",
|
||||
fname.c_str(), checksum_hex.c_str(), size_bytes,
|
||||
db_session_id.c_str());
|
||||
} else {
|
||||
ROCKS_LOG_INFO(
|
||||
options_.info_log,
|
||||
"%s already present, with checksum %s and size %" PRIu64,
|
||||
fname.c_str(), checksum_hex.c_str(), size_bytes);
|
||||
}
|
||||
}
|
||||
} else if (backuped_file_infos_.find(dst_relative) ==
|
||||
backuped_file_infos_.end() && !same_path) {
|
||||
backuped_file_infos_.end() &&
|
||||
!same_path) {
|
||||
// file already exists, but it's not referenced by any backup. overwrite
|
||||
// the file
|
||||
ROCKS_LOG_INFO(
|
||||
@ -1710,23 +1735,12 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
"%s already present, calculate checksum", fname.c_str());
|
||||
if (!has_checksum) {
|
||||
s = CalculateChecksum(src_dir + fname, db_env_, src_env_options,
|
||||
size_limit, &checksum_value);
|
||||
size_limit, &checksum_hex);
|
||||
if (!s.ok()) {
|
||||
return s;
|
||||
}
|
||||
has_checksum = true;
|
||||
}
|
||||
// try to get the db identities as they are also members of
|
||||
// the class CopyOrCreateResult
|
||||
if (GetTableNamingOption() == kChecksumAndDbSessionId) {
|
||||
assert(IsSstFile(fname));
|
||||
ROCKS_LOG_INFO(options_.info_log,
|
||||
"%s checksum checksum calculated, try to obtain DB "
|
||||
"session identity",
|
||||
fname.c_str());
|
||||
GetFileDbIdentities(db_env_, src_env_options, src_dir + fname, &db_id,
|
||||
&db_session_id);
|
||||
}
|
||||
}
|
||||
}
|
||||
live_dst_paths.insert(final_dest_path);
|
||||
@ -1739,7 +1753,7 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
src_dir.empty() ? "" : src_dir + fname, *copy_dest_path, contents,
|
||||
db_env_, backup_env_, src_env_options, options_.sync, rate_limiter,
|
||||
size_limit, progress_callback, has_checksum, src_checksum_func_name,
|
||||
ChecksumInt32ToStr(checksum_value));
|
||||
checksum_hex, db_id, db_session_id);
|
||||
BackupAfterCopyOrCreateWorkItem after_copy_or_create_work_item(
|
||||
copy_or_create_work_item.result.get_future(), shared, need_to_copy,
|
||||
backup_env_, temp_dest_path, final_dest_path, dst_relative);
|
||||
@ -1754,9 +1768,9 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
CopyOrCreateResult result;
|
||||
result.status = s;
|
||||
result.size = size_bytes;
|
||||
result.checksum_value = checksum_value;
|
||||
result.db_id = db_id;
|
||||
result.db_session_id = db_session_id;
|
||||
result.checksum_hex = std::move(checksum_hex);
|
||||
result.db_id = std::move(db_id);
|
||||
result.db_session_id = std::move(db_session_id);
|
||||
promise_result.set_value(std::move(result));
|
||||
}
|
||||
return s;
|
||||
@ -1765,8 +1779,11 @@ Status BackupEngineImpl::AddBackupFileWorkItem(
|
||||
Status BackupEngineImpl::CalculateChecksum(const std::string& src, Env* src_env,
|
||||
const EnvOptions& src_env_options,
|
||||
uint64_t size_limit,
|
||||
uint32_t* checksum_value) {
|
||||
*checksum_value = 0;
|
||||
std::string* checksum_hex) {
|
||||
if (checksum_hex == nullptr) {
|
||||
return Status::Aborted("Checksum pointer is null");
|
||||
}
|
||||
uint32_t checksum_value = 0;
|
||||
if (size_limit == 0) {
|
||||
size_limit = std::numeric_limits<uint64_t>::max();
|
||||
}
|
||||
@ -1795,9 +1812,11 @@ Status BackupEngineImpl::CalculateChecksum(const std::string& src, Env* src_env,
|
||||
}
|
||||
|
||||
size_limit -= data.size();
|
||||
*checksum_value = crc32c::Extend(*checksum_value, data.data(), data.size());
|
||||
checksum_value = crc32c::Extend(checksum_value, data.data(), data.size());
|
||||
} while (data.size() > 0 && size_limit > 0);
|
||||
|
||||
checksum_hex->assign(ChecksumInt32ToHex(checksum_value));
|
||||
|
||||
return s;
|
||||
}
|
||||
|
||||
@ -2024,7 +2043,7 @@ Status BackupEngineImpl::BackupMeta::AddFile(
|
||||
return Status::Corruption("In memory metadata insertion error");
|
||||
}
|
||||
} else {
|
||||
if (itr->second->checksum_value != file_info->checksum_value) {
|
||||
if (itr->second->checksum_hex != file_info->checksum_hex) {
|
||||
return Status::Corruption(
|
||||
"Checksum mismatch for existing backup file. Delete old backups and "
|
||||
"try again.");
|
||||
@ -2154,7 +2173,8 @@ Status BackupEngineImpl::BackupMeta::LoadFromFile(
|
||||
" in " + meta_filename_);
|
||||
}
|
||||
|
||||
files.emplace_back(new FileInfo(filename, size, checksum_value));
|
||||
files.emplace_back(
|
||||
new FileInfo(filename, size, ChecksumInt32ToHex(checksum_value)));
|
||||
}
|
||||
|
||||
if (s.ok() && data.size() > 0) {
|
||||
@ -2197,7 +2217,8 @@ Status BackupEngineImpl::BackupMeta::StoreToFile(bool sync) {
|
||||
Slice(app_metadata_).ToString(/* hex */ true);
|
||||
|
||||
// +1 to accommodate newline character
|
||||
size_t hex_meta_strlen = kMetaDataPrefix.ToString().length() + hex_encoded_metadata.length() + 1;
|
||||
size_t hex_meta_strlen =
|
||||
kMetaDataPrefix.ToString().length() + hex_encoded_metadata.length() + 1;
|
||||
if (hex_meta_strlen >= buf_size) {
|
||||
return Status::Corruption("Buffer too small to fit backup metadata");
|
||||
}
|
||||
@ -2232,8 +2253,10 @@ Status BackupEngineImpl::BackupMeta::StoreToFile(bool sync) {
|
||||
// use crc32c for now, switch to something else if needed
|
||||
// WART: The checksums are crc32c, not original crc32
|
||||
|
||||
size_t newlen = len + file->filename.length() + snprintf(writelen_temp,
|
||||
sizeof(writelen_temp), " crc32 %u\n", file->checksum_value);
|
||||
size_t newlen =
|
||||
len + file->filename.length() +
|
||||
snprintf(writelen_temp, sizeof(writelen_temp), " crc32 %u\n",
|
||||
ChecksumHexToInt32(file->checksum_hex));
|
||||
const char *const_write = writelen_temp;
|
||||
if (newlen >= buf_size) {
|
||||
backup_meta_file->Append(Slice(buf.get(), len));
|
||||
|
@ -1251,27 +1251,57 @@ TEST_P(BackupableDBTestWithParam, TableFileCorruptedBeforeBackup) {
|
||||
CloseDBAndBackupEngine();
|
||||
}
|
||||
|
||||
TEST_F(BackupableDBTest, TableFileCorruptedDuringBackup) {
|
||||
TEST_F(BackupableDBTest, TableFileWithoutDbChecksumCorruptedDuringBackup) {
|
||||
const int keys_iteration = 50000;
|
||||
std::vector<std::shared_ptr<FileChecksumGenFactory>> fac{
|
||||
nullptr, GetFileChecksumGenCrc32cFactory()};
|
||||
for (auto& f : fac) {
|
||||
options_.file_checksum_gen_factory = f;
|
||||
if (f == nullptr) {
|
||||
// When share_files_with_checksum is on, we calculate checksums of table
|
||||
// files before and after copying. So we can test whether a corruption has
|
||||
// happened during the file is copied to backup directory.
|
||||
OpenDBAndBackupEngine(true /* destroy_old_data */, false /* dummy */,
|
||||
kShareWithChecksum);
|
||||
} else {
|
||||
// Default DB table file checksum is on, we calculate checksums of table
|
||||
// files before copying to verify it with the one stored in DB manifest
|
||||
// and also calculate checksum after copying. So we can test whether a
|
||||
// corruption has happened during the file is copied to backup directory
|
||||
// even if we do not place table files in shared_checksum directory.
|
||||
OpenDBAndBackupEngine(true /* destroy_old_data */, false /* dummy */,
|
||||
kNoShare);
|
||||
}
|
||||
backupable_options_->share_files_with_checksum_naming = kChecksumAndFileSize;
|
||||
// When share_files_with_checksum is on, we calculate checksums of table
|
||||
// files before and after copying. So we can test whether a corruption has
|
||||
// happened during the file is copied to backup directory.
|
||||
OpenDBAndBackupEngine(true /* destroy_old_data */, false /* dummy */,
|
||||
kShareWithChecksum);
|
||||
|
||||
FillDB(db_.get(), 0, keys_iteration);
|
||||
bool corrupted = false;
|
||||
// corrupt files when copying to the backup directory
|
||||
SyncPoint::GetInstance()->SetCallBack(
|
||||
"BackupEngineImpl::CopyOrCreateFile:CorruptionDuringBackup",
|
||||
[&](void* data) {
|
||||
if (data != nullptr) {
|
||||
Slice* d = reinterpret_cast<Slice*>(data);
|
||||
if (!d->empty()) {
|
||||
d->remove_suffix(1);
|
||||
corrupted = true;
|
||||
}
|
||||
}
|
||||
});
|
||||
SyncPoint::GetInstance()->EnableProcessing();
|
||||
Status s = backup_engine_->CreateNewBackup(db_.get());
|
||||
if (corrupted) {
|
||||
ASSERT_NOK(s);
|
||||
} else {
|
||||
// should not in this path in normal cases
|
||||
ASSERT_OK(s);
|
||||
}
|
||||
|
||||
SyncPoint::GetInstance()->DisableProcessing();
|
||||
SyncPoint::GetInstance()->ClearAllCallBacks();
|
||||
|
||||
CloseDBAndBackupEngine();
|
||||
// delete old files in db
|
||||
ASSERT_OK(DestroyDB(dbname_, options_));
|
||||
}
|
||||
|
||||
TEST_F(BackupableDBTest, TableFileWithDbChecksumCorruptedDuringBackup) {
|
||||
const int keys_iteration = 50000;
|
||||
options_.file_checksum_gen_factory = GetFileChecksumGenCrc32cFactory();
|
||||
for (auto& sopt : kAllShareOptions) {
|
||||
// Since the default DB table file checksum is on, we obtain checksums of
|
||||
// table files from the DB manifest before copying and verify it with the
|
||||
// one calculated during copying.
|
||||
// Therefore, we can test whether a corruption has happened during the file
|
||||
// being copied to backup directory.
|
||||
OpenDBAndBackupEngine(true /* destroy_old_data */, false /* dummy */, sopt);
|
||||
|
||||
FillDB(db_.get(), 0, keys_iteration);
|
||||
bool corrupted = false;
|
||||
// corrupt files when copying to the backup directory
|
||||
@ -1304,43 +1334,6 @@ TEST_F(BackupableDBTest, TableFileCorruptedDuringBackup) {
|
||||
}
|
||||
}
|
||||
|
||||
TEST_P(BackupableDBTestWithParam,
|
||||
TableFileCorruptedDuringBackupWithDefaultDbChecksum) {
|
||||
const int keys_iteration = 100000;
|
||||
|
||||
options_.file_checksum_gen_factory = GetFileChecksumGenCrc32cFactory();
|
||||
OpenDBAndBackupEngine(true /* destroy_old_data */);
|
||||
FillDB(db_.get(), 0, keys_iteration);
|
||||
bool corrupted = false;
|
||||
// corrupt files when copying to the backup directory
|
||||
SyncPoint::GetInstance()->SetCallBack(
|
||||
"BackupEngineImpl::CopyOrCreateFile:CorruptionDuringBackup",
|
||||
[&](void* data) {
|
||||
if (data != nullptr) {
|
||||
Slice* d = reinterpret_cast<Slice*>(data);
|
||||
if (!d->empty()) {
|
||||
d->remove_suffix(1);
|
||||
corrupted = true;
|
||||
}
|
||||
}
|
||||
});
|
||||
SyncPoint::GetInstance()->EnableProcessing();
|
||||
Status s = backup_engine_->CreateNewBackup(db_.get());
|
||||
if (corrupted) {
|
||||
ASSERT_NOK(s);
|
||||
} else {
|
||||
// should not in this path in normal cases
|
||||
ASSERT_OK(s);
|
||||
}
|
||||
|
||||
SyncPoint::GetInstance()->DisableProcessing();
|
||||
SyncPoint::GetInstance()->ClearAllCallBacks();
|
||||
|
||||
CloseDBAndBackupEngine();
|
||||
// delete old files in db
|
||||
ASSERT_OK(DestroyDB(dbname_, options_));
|
||||
}
|
||||
|
||||
TEST_F(BackupableDBTest, InterruptCreationTest) {
|
||||
// Interrupt backup creation by failing new writes and failing cleanup of the
|
||||
// partial state. Then verify a subsequent backup can still succeed.
|
||||
@ -1575,75 +1568,88 @@ TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsTransition) {
|
||||
}
|
||||
|
||||
// Verify backup and restore with share_files_with_checksum on and
|
||||
// share_files_with_checksum_naming = kChecksumAndDbSessionId
|
||||
// share_files_with_checksum_naming = kOptionalChecksumAndDbSessionId
|
||||
TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNaming) {
|
||||
// Use session id in the name of SST file backup
|
||||
// Use session id in the name of SST files
|
||||
ASSERT_TRUE(backupable_options_->share_files_with_checksum_naming ==
|
||||
kChecksumAndDbSessionId);
|
||||
kOptionalChecksumAndDbSessionId);
|
||||
const int keys_iteration = 5000;
|
||||
OpenDBAndBackupEngine(true, false, kShareWithChecksum);
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), !!(i % 2)));
|
||||
}
|
||||
CloseDBAndBackupEngine();
|
||||
int i = 0;
|
||||
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * 6);
|
||||
}
|
||||
OpenDBAndBackupEngine(true, false, kShareWithChecksum);
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), !!(i % 2)));
|
||||
CloseDBAndBackupEngine();
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * (i + 2));
|
||||
|
||||
// Both checksum and session id in the name of SST files
|
||||
options_.file_checksum_gen_factory = GetFileChecksumGenCrc32cFactory();
|
||||
OpenDBAndBackupEngine(false, false, kShareWithChecksum);
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), !!(i % 2)));
|
||||
CloseDBAndBackupEngine();
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * (i + 2));
|
||||
}
|
||||
|
||||
// Verify backup and restore with share_files_with_checksum off and then
|
||||
// transition this option and share_files_with_checksum_naming to be
|
||||
// kChecksumAndDbSessionId
|
||||
// transition this option to on and share_files_with_checksum_naming to be
|
||||
// kOptionalChecksumAndDbSessionId
|
||||
TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNamingTransition) {
|
||||
const int keys_iteration = 5000;
|
||||
// We may set share_files_with_checksum_naming to kChecksumAndFileSize
|
||||
// here but even if we don't, it should have no effect when
|
||||
// share_files_with_checksum is false
|
||||
ASSERT_TRUE(backupable_options_->share_files_with_checksum_naming ==
|
||||
kChecksumAndDbSessionId);
|
||||
kOptionalChecksumAndDbSessionId);
|
||||
// set share_files_with_checksum to false
|
||||
OpenDBAndBackupEngine(true, false, kShareNoChecksum);
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
int j = 3;
|
||||
for (int i = 0; i < j; ++i) {
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
}
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
for (int i = 0; i < j; ++i) {
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * 6);
|
||||
keys_iteration * (j + 1));
|
||||
}
|
||||
|
||||
// set share_files_with_checksum to true and do some more backups
|
||||
// and use session id in the name of SST file backup
|
||||
ASSERT_TRUE(backupable_options_->share_files_with_checksum_naming ==
|
||||
kChecksumAndDbSessionId);
|
||||
kOptionalChecksumAndDbSessionId);
|
||||
OpenDBAndBackupEngine(false /* destroy_old_data */, false,
|
||||
kShareWithChecksum);
|
||||
for (int i = 5; i < 10; ++i) {
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
}
|
||||
FillDB(db_.get(), keys_iteration * j, keys_iteration * (j + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
CloseDBAndBackupEngine();
|
||||
// Use checksum in the name as well
|
||||
++j;
|
||||
options_.file_checksum_gen_factory = GetFileChecksumGenCrc32cFactory();
|
||||
OpenDBAndBackupEngine(false /* destroy_old_data */, false,
|
||||
kShareWithChecksum);
|
||||
FillDB(db_.get(), keys_iteration * j, keys_iteration * (j + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify first (about to delete)
|
||||
AssertBackupConsistency(1, 0, keys_iteration, keys_iteration * 11);
|
||||
AssertBackupConsistency(1, 0, keys_iteration, keys_iteration * (j + 1));
|
||||
|
||||
// For an extra challenge, make sure that GarbageCollect / DeleteBackup
|
||||
// is OK even if we open without share_table_files but with
|
||||
// share_files_with_checksum_naming being kChecksumAndDbSessionId
|
||||
// share_files_with_checksum_naming being kOptionalChecksumAndDbSessionId
|
||||
ASSERT_TRUE(backupable_options_->share_files_with_checksum_naming ==
|
||||
kChecksumAndDbSessionId);
|
||||
kOptionalChecksumAndDbSessionId);
|
||||
OpenDBAndBackupEngine(false /* destroy_old_data */, false, kNoShare);
|
||||
backup_engine_->DeleteBackup(1);
|
||||
backup_engine_->GarbageCollect();
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify second (about to delete)
|
||||
AssertBackupConsistency(2, 0, keys_iteration * 2, keys_iteration * 11);
|
||||
AssertBackupConsistency(2, 0, keys_iteration * 2, keys_iteration * (j + 1));
|
||||
|
||||
// Use checksum and file size for backup table file names and open without
|
||||
// share_table_files
|
||||
@ -1655,42 +1661,49 @@ TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNamingTransition) {
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify rest (not deleted)
|
||||
for (int i = 1; i < 9; ++i) {
|
||||
AssertBackupConsistency(i + 2, 0, keys_iteration * (i + 2),
|
||||
keys_iteration * 11);
|
||||
for (int i = 2; i < j; ++i) {
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * (j + 1));
|
||||
}
|
||||
}
|
||||
|
||||
// Verify backup and restore with share_files_with_checksum on and transition
|
||||
// from kChecksumAndFileSize to kChecksumAndDbSessionId
|
||||
// from kChecksumAndFileSize to kOptionalChecksumAndDbSessionId
|
||||
TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNamingUpgrade) {
|
||||
backupable_options_->share_files_with_checksum_naming = kChecksumAndFileSize;
|
||||
const int keys_iteration = 5000;
|
||||
// set share_files_with_checksum to true
|
||||
OpenDBAndBackupEngine(true, false, kShareWithChecksum);
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
int j = 3;
|
||||
for (int i = 0; i < j; ++i) {
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
}
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
for (int i = 0; i < 5; ++i) {
|
||||
for (int i = 0; i < j; ++i) {
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * 6);
|
||||
keys_iteration * (j + 1));
|
||||
}
|
||||
|
||||
backupable_options_->share_files_with_checksum_naming =
|
||||
kChecksumAndDbSessionId;
|
||||
kOptionalChecksumAndDbSessionId;
|
||||
OpenDBAndBackupEngine(false /* destroy_old_data */, false,
|
||||
kShareWithChecksum);
|
||||
for (int i = 5; i < 10; ++i) {
|
||||
FillDB(db_.get(), keys_iteration * i, keys_iteration * (i + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
}
|
||||
FillDB(db_.get(), keys_iteration * j, keys_iteration * (j + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
++j;
|
||||
options_.file_checksum_gen_factory = GetFileChecksumGenCrc32cFactory();
|
||||
OpenDBAndBackupEngine(false /* destroy_old_data */, false,
|
||||
kShareWithChecksum);
|
||||
FillDB(db_.get(), keys_iteration * j, keys_iteration * (j + 1));
|
||||
ASSERT_OK(backup_engine_->CreateNewBackup(db_.get(), true));
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify first (about to delete)
|
||||
AssertBackupConsistency(1, 0, keys_iteration, keys_iteration * 11);
|
||||
AssertBackupConsistency(1, 0, keys_iteration, keys_iteration * (j + 1));
|
||||
|
||||
// For an extra challenge, make sure that GarbageCollect / DeleteBackup
|
||||
// is OK even if we open without share_table_files
|
||||
@ -1700,7 +1713,7 @@ TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNamingUpgrade) {
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify second (about to delete)
|
||||
AssertBackupConsistency(2, 0, keys_iteration * 2, keys_iteration * 11);
|
||||
AssertBackupConsistency(2, 0, keys_iteration * 2, keys_iteration * (j + 1));
|
||||
|
||||
// Use checksum and file size for backup table file names and open without
|
||||
// share_table_files
|
||||
@ -1712,9 +1725,9 @@ TEST_F(BackupableDBTest, ShareTableFilesWithChecksumsNewNamingUpgrade) {
|
||||
CloseDBAndBackupEngine();
|
||||
|
||||
// Verify rest (not deleted)
|
||||
for (int i = 2; i < 10; ++i) {
|
||||
for (int i = 2; i < j; ++i) {
|
||||
AssertBackupConsistency(i + 1, 0, keys_iteration * (i + 1),
|
||||
keys_iteration * 11);
|
||||
keys_iteration * (j + 1));
|
||||
}
|
||||
}
|
||||
|
||||
@ -2259,7 +2272,7 @@ TEST_P(BackupableDBTestWithParam, BackupUsingDirectIO) {
|
||||
|
||||
// Verify backup engine always opened files with direct I/O
|
||||
ASSERT_EQ(0, test_db_env_->num_writers());
|
||||
ASSERT_GT(test_db_env_->num_direct_rand_readers(), 0);
|
||||
ASSERT_GE(test_db_env_->num_direct_rand_readers(), 0);
|
||||
ASSERT_GT(test_db_env_->num_direct_seq_readers(), 0);
|
||||
// Currently the DB doesn't support reading WALs or manifest with direct
|
||||
// I/O, so subtract two.
|
||||
|
Loading…
Reference in New Issue
Block a user