2016-02-10 00:12:00 +01:00
|
|
|
// Copyright (c) 2011-present, Facebook, Inc. All rights reserved.
|
2013-10-16 23:59:46 +02:00
|
|
|
// This source code is licensed under the BSD-style license found in the
|
|
|
|
// LICENSE file in the root directory of this source tree. An additional grant
|
|
|
|
// of patent rights can be found in the PATENTS file in the same directory.
|
|
|
|
//
|
2011-03-18 23:37:00 +01:00
|
|
|
// Copyright (c) 2011 The LevelDB Authors. All rights reserved.
|
|
|
|
// Use of this source code is governed by a BSD-style license that can be
|
|
|
|
// found in the LICENSE file. See the AUTHORS file for names of contributors.
|
|
|
|
|
|
|
|
#include "table/two_level_iterator.h"
|
|
|
|
|
2016-04-26 21:41:07 +02:00
|
|
|
#include "db/pinned_iterators_manager.h"
|
2013-10-29 01:54:09 +01:00
|
|
|
#include "rocksdb/options.h"
|
|
|
|
#include "rocksdb/table.h"
|
2011-03-18 23:37:00 +01:00
|
|
|
#include "table/block.h"
|
|
|
|
#include "table/format.h"
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
#include "util/arena.h"
|
2011-03-18 23:37:00 +01:00
|
|
|
|
2013-10-04 06:49:15 +02:00
|
|
|
namespace rocksdb {
|
2011-03-18 23:37:00 +01:00
|
|
|
|
|
|
|
namespace {
|
|
|
|
|
2015-10-13 00:06:38 +02:00
|
|
|
class TwoLevelIterator : public InternalIterator {
|
2011-03-18 23:37:00 +01:00
|
|
|
public:
|
2014-04-25 21:22:23 +02:00
|
|
|
explicit TwoLevelIterator(TwoLevelIteratorState* state,
|
2015-10-13 00:06:38 +02:00
|
|
|
InternalIterator* first_level_iter,
|
2015-06-30 23:15:24 +02:00
|
|
|
bool need_free_iter_and_state);
|
2011-03-18 23:37:00 +01:00
|
|
|
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
virtual ~TwoLevelIterator() {
|
2016-04-26 21:41:07 +02:00
|
|
|
// Assert that the TwoLevelIterator is never deleted while Pinning is
|
|
|
|
// Enabled.
|
|
|
|
assert(!pinned_iters_mgr_ ||
|
|
|
|
(pinned_iters_mgr_ && !pinned_iters_mgr_->PinningEnabled()));
|
2015-06-30 23:15:24 +02:00
|
|
|
first_level_iter_.DeleteIter(!need_free_iter_and_state_);
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
second_level_iter_.DeleteIter(false);
|
2015-06-30 23:15:24 +02:00
|
|
|
if (need_free_iter_and_state_) {
|
|
|
|
delete state_;
|
|
|
|
} else {
|
|
|
|
state_->~TwoLevelIteratorState();
|
|
|
|
}
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
|
2015-02-26 20:28:41 +01:00
|
|
|
virtual void Seek(const Slice& target) override;
|
|
|
|
virtual void SeekToFirst() override;
|
|
|
|
virtual void SeekToLast() override;
|
|
|
|
virtual void Next() override;
|
|
|
|
virtual void Prev() override;
|
|
|
|
|
|
|
|
virtual bool Valid() const override { return second_level_iter_.Valid(); }
|
|
|
|
virtual Slice key() const override {
|
2011-03-18 23:37:00 +01:00
|
|
|
assert(Valid());
|
2014-04-25 21:22:23 +02:00
|
|
|
return second_level_iter_.key();
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
2015-02-26 20:28:41 +01:00
|
|
|
virtual Slice value() const override {
|
2011-03-18 23:37:00 +01:00
|
|
|
assert(Valid());
|
2014-04-25 21:22:23 +02:00
|
|
|
return second_level_iter_.value();
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
2015-02-26 20:28:41 +01:00
|
|
|
virtual Status status() const override {
|
2011-03-18 23:37:00 +01:00
|
|
|
// It'd be nice if status() returned a const Status& instead of a Status
|
2014-04-25 21:22:23 +02:00
|
|
|
if (!first_level_iter_.status().ok()) {
|
|
|
|
return first_level_iter_.status();
|
|
|
|
} else if (second_level_iter_.iter() != nullptr &&
|
|
|
|
!second_level_iter_.status().ok()) {
|
|
|
|
return second_level_iter_.status();
|
2011-03-18 23:37:00 +01:00
|
|
|
} else {
|
|
|
|
return status_;
|
|
|
|
}
|
|
|
|
}
|
2016-04-26 21:41:07 +02:00
|
|
|
virtual void SetPinnedItersMgr(
|
|
|
|
PinnedIteratorsManager* pinned_iters_mgr) override {
|
|
|
|
pinned_iters_mgr_ = pinned_iters_mgr;
|
|
|
|
first_level_iter_.SetPinnedItersMgr(pinned_iters_mgr);
|
|
|
|
if (second_level_iter_.iter()) {
|
|
|
|
second_level_iter_.SetPinnedItersMgr(pinned_iters_mgr);
|
|
|
|
}
|
2015-12-16 21:08:30 +01:00
|
|
|
}
|
|
|
|
virtual bool IsKeyPinned() const override {
|
2016-04-26 21:41:07 +02:00
|
|
|
return pinned_iters_mgr_ && pinned_iters_mgr_->PinningEnabled() &&
|
|
|
|
second_level_iter_.iter() && second_level_iter_.IsKeyPinned();
|
2015-12-16 21:08:30 +01:00
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
|
|
|
|
private:
|
|
|
|
void SaveError(const Status& s) {
|
|
|
|
if (status_.ok() && !s.ok()) status_ = s;
|
|
|
|
}
|
|
|
|
void SkipEmptyDataBlocksForward();
|
|
|
|
void SkipEmptyDataBlocksBackward();
|
2015-10-13 00:06:38 +02:00
|
|
|
void SetSecondLevelIterator(InternalIterator* iter);
|
2011-03-18 23:37:00 +01:00
|
|
|
void InitDataBlock();
|
|
|
|
|
2015-06-30 23:15:24 +02:00
|
|
|
TwoLevelIteratorState* state_;
|
2014-04-25 21:22:23 +02:00
|
|
|
IteratorWrapper first_level_iter_;
|
|
|
|
IteratorWrapper second_level_iter_; // May be nullptr
|
2015-06-30 23:15:24 +02:00
|
|
|
bool need_free_iter_and_state_;
|
2016-04-26 21:41:07 +02:00
|
|
|
PinnedIteratorsManager* pinned_iters_mgr_;
|
2011-03-18 23:37:00 +01:00
|
|
|
Status status_;
|
2014-04-25 21:22:23 +02:00
|
|
|
// If second_level_iter is non-nullptr, then "data_block_handle_" holds the
|
|
|
|
// "index_value" passed to block_function_ to create the second_level_iter.
|
2011-03-18 23:37:00 +01:00
|
|
|
std::string data_block_handle_;
|
|
|
|
};
|
|
|
|
|
2014-04-25 21:22:23 +02:00
|
|
|
TwoLevelIterator::TwoLevelIterator(TwoLevelIteratorState* state,
|
2015-10-13 00:06:38 +02:00
|
|
|
InternalIterator* first_level_iter,
|
2015-06-30 23:15:24 +02:00
|
|
|
bool need_free_iter_and_state)
|
|
|
|
: state_(state),
|
|
|
|
first_level_iter_(first_level_iter),
|
2016-04-26 21:41:07 +02:00
|
|
|
need_free_iter_and_state_(need_free_iter_and_state),
|
|
|
|
pinned_iters_mgr_(nullptr) {}
|
2011-03-18 23:37:00 +01:00
|
|
|
|
|
|
|
void TwoLevelIterator::Seek(const Slice& target) {
|
2014-04-25 21:23:07 +02:00
|
|
|
if (state_->check_prefix_may_match &&
|
|
|
|
!state_->PrefixMayMatch(target)) {
|
2014-04-25 21:22:23 +02:00
|
|
|
SetSecondLevelIterator(nullptr);
|
2014-04-25 21:23:07 +02:00
|
|
|
return;
|
|
|
|
}
|
|
|
|
first_level_iter_.Seek(target);
|
|
|
|
|
|
|
|
InitDataBlock();
|
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
second_level_iter_.Seek(target);
|
2014-04-25 21:22:23 +02:00
|
|
|
}
|
2014-04-25 21:23:07 +02:00
|
|
|
SkipEmptyDataBlocksForward();
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::SeekToFirst() {
|
2014-04-25 21:22:23 +02:00
|
|
|
first_level_iter_.SeekToFirst();
|
2011-03-18 23:37:00 +01:00
|
|
|
InitDataBlock();
|
2014-04-25 21:22:23 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
second_level_iter_.SeekToFirst();
|
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
SkipEmptyDataBlocksForward();
|
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::SeekToLast() {
|
2014-04-25 21:22:23 +02:00
|
|
|
first_level_iter_.SeekToLast();
|
2011-03-18 23:37:00 +01:00
|
|
|
InitDataBlock();
|
2014-04-25 21:22:23 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
second_level_iter_.SeekToLast();
|
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
SkipEmptyDataBlocksBackward();
|
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::Next() {
|
|
|
|
assert(Valid());
|
2014-04-25 21:22:23 +02:00
|
|
|
second_level_iter_.Next();
|
2011-03-18 23:37:00 +01:00
|
|
|
SkipEmptyDataBlocksForward();
|
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::Prev() {
|
|
|
|
assert(Valid());
|
2014-04-25 21:22:23 +02:00
|
|
|
second_level_iter_.Prev();
|
2011-03-18 23:37:00 +01:00
|
|
|
SkipEmptyDataBlocksBackward();
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
|
|
void TwoLevelIterator::SkipEmptyDataBlocksForward() {
|
2014-04-25 21:22:23 +02:00
|
|
|
while (second_level_iter_.iter() == nullptr ||
|
|
|
|
(!second_level_iter_.Valid() &&
|
|
|
|
!second_level_iter_.status().IsIncomplete())) {
|
2011-03-18 23:37:00 +01:00
|
|
|
// Move to next block
|
2014-04-25 21:22:23 +02:00
|
|
|
if (!first_level_iter_.Valid()) {
|
|
|
|
SetSecondLevelIterator(nullptr);
|
2011-03-18 23:37:00 +01:00
|
|
|
return;
|
|
|
|
}
|
2014-04-25 21:22:23 +02:00
|
|
|
first_level_iter_.Next();
|
2011-03-18 23:37:00 +01:00
|
|
|
InitDataBlock();
|
2014-04-25 21:22:23 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
second_level_iter_.SeekToFirst();
|
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::SkipEmptyDataBlocksBackward() {
|
2014-04-25 21:22:23 +02:00
|
|
|
while (second_level_iter_.iter() == nullptr ||
|
|
|
|
(!second_level_iter_.Valid() &&
|
|
|
|
!second_level_iter_.status().IsIncomplete())) {
|
2011-03-18 23:37:00 +01:00
|
|
|
// Move to next block
|
2014-04-25 21:22:23 +02:00
|
|
|
if (!first_level_iter_.Valid()) {
|
|
|
|
SetSecondLevelIterator(nullptr);
|
2011-03-18 23:37:00 +01:00
|
|
|
return;
|
|
|
|
}
|
2014-04-25 21:22:23 +02:00
|
|
|
first_level_iter_.Prev();
|
2011-03-18 23:37:00 +01:00
|
|
|
InitDataBlock();
|
2014-04-25 21:22:23 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
second_level_iter_.SeekToLast();
|
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2015-10-13 00:06:38 +02:00
|
|
|
void TwoLevelIterator::SetSecondLevelIterator(InternalIterator* iter) {
|
2014-04-25 21:22:23 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr) {
|
|
|
|
SaveError(second_level_iter_.status());
|
|
|
|
}
|
2016-04-26 21:41:07 +02:00
|
|
|
|
|
|
|
if (pinned_iters_mgr_ && iter) {
|
|
|
|
iter->SetPinnedItersMgr(pinned_iters_mgr_);
|
|
|
|
}
|
|
|
|
|
|
|
|
InternalIterator* old_iter = second_level_iter_.Set(iter);
|
|
|
|
if (pinned_iters_mgr_ && pinned_iters_mgr_->PinningEnabled()) {
|
|
|
|
pinned_iters_mgr_->PinIteratorIfNeeded(old_iter);
|
|
|
|
} else {
|
|
|
|
delete old_iter;
|
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
|
|
|
|
void TwoLevelIterator::InitDataBlock() {
|
2014-04-25 21:22:23 +02:00
|
|
|
if (!first_level_iter_.Valid()) {
|
|
|
|
SetSecondLevelIterator(nullptr);
|
2011-03-18 23:37:00 +01:00
|
|
|
} else {
|
2014-04-25 21:22:23 +02:00
|
|
|
Slice handle = first_level_iter_.value();
|
2014-08-29 23:32:37 +02:00
|
|
|
if (second_level_iter_.iter() != nullptr &&
|
|
|
|
!second_level_iter_.status().IsIncomplete() &&
|
|
|
|
handle.compare(data_block_handle_) == 0) {
|
2014-04-25 21:22:23 +02:00
|
|
|
// second_level_iter is already constructed with this iterator, so
|
2011-03-18 23:37:00 +01:00
|
|
|
// no need to change anything
|
|
|
|
} else {
|
2015-10-13 00:06:38 +02:00
|
|
|
InternalIterator* iter = state_->NewSecondaryIterator(handle);
|
2011-03-18 23:37:00 +01:00
|
|
|
data_block_handle_.assign(handle.data(), handle.size());
|
2014-04-25 21:22:23 +02:00
|
|
|
SetSecondLevelIterator(iter);
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2011-10-31 18:22:06 +01:00
|
|
|
} // namespace
|
2011-03-18 23:37:00 +01:00
|
|
|
|
2015-10-13 00:06:38 +02:00
|
|
|
InternalIterator* NewTwoLevelIterator(TwoLevelIteratorState* state,
|
|
|
|
InternalIterator* first_level_iter,
|
|
|
|
Arena* arena,
|
|
|
|
bool need_free_iter_and_state) {
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
if (arena == nullptr) {
|
2015-06-30 23:15:24 +02:00
|
|
|
return new TwoLevelIterator(state, first_level_iter,
|
|
|
|
need_free_iter_and_state);
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
} else {
|
|
|
|
auto mem = arena->AllocateAligned(sizeof(TwoLevelIterator));
|
2015-06-30 23:15:24 +02:00
|
|
|
return new (mem)
|
|
|
|
TwoLevelIterator(state, first_level_iter, need_free_iter_and_state);
|
In DB::NewIterator(), try to allocate the whole iterator tree in an arena
Summary:
In this patch, try to allocate the whole iterator tree starting from DBIter from an arena
1. ArenaWrappedDBIter is created when serves as the entry point of an iterator tree, with an arena in it.
2. Add an option to create iterator from arena for following iterators: DBIter, MergingIterator, MemtableIterator, all mem table's iterators, all table reader's iterators and two level iterator.
3. MergeIteratorBuilder is created to incrementally build the tree of internal iterators. It is passed to mem table list and version set and add iterators to it.
Limitations:
(1) Only DB::NewIterator() without tailing uses the arena. Other cases, including readonly DB and compactions are still from malloc
(2) Two level iterator itself is allocated in arena, but not iterators inside it.
Test Plan: make all check
Reviewers: ljin, haobo
Reviewed By: haobo
Subscribers: leveldb, dhruba, yhchiang, igor
Differential Revision: https://reviews.facebook.net/D18513
2014-06-03 01:38:00 +02:00
|
|
|
}
|
2011-03-18 23:37:00 +01:00
|
|
|
}
|
|
|
|
|
2013-10-04 06:49:15 +02:00
|
|
|
} // namespace rocksdb
|