提交 · 898e803fc5cb198d048b7be97f4fbdac04720abb · kvdb / rocksdb

03 6月, 2015 2 次提交

Add a stats counter for DB_WRITE back which was mistakenly removed. · 898e803f

由 Yueh-Hsuan Chiang 提交于 6月 02, 2015

Summary: Add a stats counter for DB_WRITE back which was mistakenly removed.

Test Plan: augment GroupCommitTest

Reviewers: sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39399

898e803f

Fix Bug: CompactRange() doesn't change to correct level caused by using wrong level · ac81130f

由 sdong 提交于 6月 02, 2015

Summary: In previous change https://reviews.facebook.net/D39099 , while renaming parameters, use a wrong parameter, causing CompactRange() to compact not wrong level.

Test Plan: Run "DBTest.MigrateToDynamicLevelMaxBytesBase" which failed with the patch.

Reviewers: rven, yhchiang, kradhakrishnan, igor, anthony

Subscribers: leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D39393

ac81130f

02 6月, 2015 5 次提交

more times in perf_context and iostats_context · ec7a9443

由 Mike Kolupaev 提交于 6月 02, 2015

Summary:
We occasionally get write stalls (>1s Write() calls) on HDD under read load. The following timers explain almost all of the stalls:
 - perf_context.db_mutex_lock_nanos
 - perf_context.db_condition_wait_nanos
 - iostats_context.open_time
 - iostats_context.allocate_time
 - iostats_context.write_time
 - iostats_context.range_sync_time
 - iostats_context.logger_time

In my experiments each of these occasionally takes >1s on write path under some workload. There are rare cases when Write() takes long but none of these takes long.

Test Plan: Added code to our application to write the listed timings to log for slow writes. They usually add up to almost exactly the time Write() call took.

Reviewers: rven, yhchiang, sdong

Reviewed By: sdong

Subscribers: march, dhruba, tnovak

Differential Revision: https://reviews.facebook.net/D39177

ec7a9443

Allow users to migrate to options.level_compaction_dynamic_level_bytes=true using CompactRange() · 4266d4fd

由 sdong 提交于 5月 28, 2015

Summary: In DB::CompactRange(), change parameter "reduce_level" to "change_level". Users can compact all data to the last level if needed. By doing it, users can migrate the DB to options.level_compaction_dynamic_level_bytes=true.

Test Plan: Add a unit test for it.

Reviewers: yhchiang, anthony, kradhakrishnan, igor, rven

Reviewed By: rven

Subscribers: leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D39099

4266d4fd

Removed DBImpl::notifying_events_ · d333820b

由 Yueh-Hsuan Chiang 提交于 6月 01, 2015

Summary:
DBImpl::notifying_events_ is a internal counter in DBImpl which is
used to prevent DB close when DB is notifying events.  However, as
the current events all rely on either compaction or flush which
already have similar counters to prevent DB close, it is safe to
remove notifying_events_.

Test Plan:
listener_test
examples/compact_files_example

Reviewers: igor, anthony, kradhakrishnan, rven, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39315

d333820b

Fixed compile warning in compact_files_example.cc · 495ce601

由 Yueh-Hsuan Chiang 提交于 6月 01, 2015

Summary: Fixed compile warning in compact_files_example.cc

Test Plan: compact_files_example

Reviewers: sdong, igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39309

495ce601

add rocksdb::WritableFileWrapper similar to rocksdb::EnvWrapper · 2ecac9f9

由 Mike Kolupaev 提交于 6月 01, 2015

Summary: It used to be no good (known to me) non-intrusive way to wrap WritableFile - you can't call protected virtual methods of the wrapped pointer to WritableFile. This diff adds a convenience class WritableFileWrapper that makes wrapping WritableFile both possible and easy.

Test Plan: `make clean; make -j release`, `make clean; OPT=-DROCKSDB_LITE make release`, `make clean; USE_CLANG=1 make -j all`.

Reviewers: sdong, yhchiang, rven

Reviewed By: rven

Subscribers: dhruba, tnovak, march

Differential Revision: https://reviews.facebook.net/D39147

2ecac9f9

01 6月, 2015 1 次提交
- I
  Merge pull request #617 from rdallman/wb-merge-sliceparts · a187e66a
  由 Igor Canadi 提交于 5月 31, 2015
```
WriteBatch.Merge w/ SliceParts support
```
  a187e66a
31 5月, 2015 2 次提交

Fixed db_stress · 16c19762

由 Yueh-Hsuan Chiang 提交于 5月 30, 2015

Summary:
Fixed db_stress by correcting the verification of column family
names in the Listener of db_stress

Test Plan: db_stress

Reviewers: igor, sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39255

16c19762

Fix compile on darwin · 4c181f08

由 Igor Canadi 提交于 5月 30, 2015

Summary: As title

Test Plan: make check

Reviewers: anthony

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39243

4c181f08

30 5月, 2015 7 次提交

fix LITE build · bc7a7a40

由 agiardullo 提交于 5月 29, 2015

Summary: Broken by optimistic transaction diff.  (I only built 'release' not 'static_lib' when testing).

Test Plan: build

Reviewers: yhchiang, sdong, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39219

bc7a7a40

Fixed a compile warning in db_stress in NDEBUG mode. · 832271f6

由 Yueh-Hsuan Chiang 提交于 5月 29, 2015

Summary: Fixed a compile warning in db_stress in NDEBUG mode.

Test Plan: make OPT=-DNDEBUG db_stress

Reviewers: sdong, anthony

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39213

832271f6

Optimistic Transactions · dc9d70de

由 agiardullo 提交于 5月 29, 2015

Summary: Optimistic transactions supporting begin/commit/rollback semantics. Currently relies on checking the memtable to determine if there are any collisions at commit time. Not yet implemented would be a way of enuring the memtable has some minimum amount of history so that we won't fail to commit when the memtable is empty. You should probably start with transaction.h to get an overview of what is currently supported.

Test Plan: Added a new test, but still need to look into stress testing.

Reviewers: yhchiang, igor, rven, sdong

Reviewed By: sdong

Subscribers: adamretter, MarkCallaghan, leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D33435

dc9d70de

Fixed a compile warning in db_stress · d5a0c0e6

由 Yueh-Hsuan Chiang 提交于 5月 29, 2015

Summary:
Fixed the following compile warning in db_stress:
error: 'OnCompactionCompleted' overrides a member function but is not marked 'override' [-Werror,-Winconsistent-missing-override]

Test Plan: make db_stress

Reviewers: sdong, igor, anthony

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39207

d5a0c0e6

Fixed a compile error in ROCKSDB_LITE · ebfdb3c7

由 Yueh-Hsuan Chiang 提交于 5月 29, 2015

Summary: Fixed a compile error in ROCKSDB_LITE

Test Plan: make db_stress OPT=-DROCKSDB_LITE -j32

Reviewers: sdong, igor, anthony

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39201

ebfdb3c7

Include EventListener in stress test. · 9ffc8ba0

由 Yueh-Hsuan Chiang 提交于 5月 29, 2015

Summary: Include EventListener in stress test.

Test Plan: make blackbox_crash_test whitebox_crash_test

Reviewers: anthony, igor, rven, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39105

9ffc8ba0

Decrease number of jobs in make release · a3da5902

由 Igor Canadi 提交于 5月 29, 2015

Summary: as title

Test Plan: make release

Reviewers: MarkCallaghan, sdong

Reviewed By: sdong

Subscribers: sdong, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38853

a3da5902

29 5月, 2015 4 次提交

R
WriteBatch.Merge w/ SliceParts support · a0635ba3
由 Reed Allman 提交于 5月 27, 2015
```
also hooked up WriteBatchInternal
```
a0635ba3

Support saving history in memtable_list · c8153510

由 agiardullo 提交于 5月 28, 2015

Summary:
For transactions, we are using the memtables to validate that there are no write conflicts. But after flushing, we don't have any memtables, and transactions could fail to commit. So we want to someone keep around some extra history to use for conflict checking. In addition, we want to provide a way to increase the size of this history if too many transactions fail to commit.

After chatting with people, it seems like everyone prefers just using Memtables to store this history (instead of a separate history structure). It seems like the best place for this is abstracted inside the memtable_list. I decide to create a separate list in MemtableListVersion as using the same list complicated the flush/installalflushresults logic too much.

This diff adds a new parameter to control how much memtable history to keep around after flushing. However, it sounds like people aren't too fond of adding new parameters. So I am making the default size of flushed+not-flushed memtables be set to max_write_buffers. This should not change the maximum amount of memory used, but make it more likely we're using closer the the limit. (We are now postponing deleting flushed memtables until the max_write_buffer limit is reached). So while we might use more memory on average, we are still obeying the limit set (and you could argue it's better to go ahead and use up memory now instead of waiting for a write stall to happen to test this limit).

However, if people are opposed to this default behavior, we can easily set it to 0 and require this parameter be set in order to use transactions.

Test Plan: Added a xfunc test to play around with setting different values of this parameter in all tests. Added testing in memtablelist_test and planning on adding more testing here.

Reviewers: sdong, rven, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D37443

c8153510

Rename EventLoggerHelpers EventHelpers · ec4ff4e9

由 Yueh-Hsuan Chiang 提交于 5月 28, 2015

Summary:
Rename EventLoggerHelpers EventHelpers, as it's going to include
all event-related helper functions instead of EventLogger only stuffs.

Test Plan: make

Reviewers: sdong, rven, anthony

Reviewed By: anthony

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39093

ec4ff4e9

[API Change] Move listeners from ColumnFamilyOptions to DBOptions · 672dda9b

由 Yueh-Hsuan Chiang 提交于 5月 28, 2015

Summary: Move listeners from ColumnFamilyOptions to DBOptions

Test Plan:
listener_test
compact_files_test

Reviewers: rven, anthony, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D39087

672dda9b

27 5月, 2015 4 次提交

Compaction now conditionally boosts the size of deletion entries. · 3ab8ffd4

由 Yueh-Hsuan Chiang 提交于 5月 26, 2015

Summary:
Compaction now boosts the size of deletion entries of a file only when
the number of deletion entries is greater than the number of non-deletion
entries in the file.  The motivation here is that in a stable workload,
the number of deletion entries should be roughly equal to the number of
non-deletion entries.  If we compensate the size of deletion entries in a
stable workload, the deletion compensation logic might introduce unwanted
effet which changes the shape of LSM tree.

Test Plan: db_test --gtest_filter="*Deletion*"

Reviewers: sdong, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38703

3ab8ffd4

I
Merge pull request #615 from rdallman/master · a81ac241
由 Igor Canadi 提交于 5月 26, 2015
```
C: add more block based table stuff, some aux slice transform/merge ops
```
a81ac241

Fixed a bug in EventLoggerHelpers::LogTableFileCreation · 6d299b70

由 Yueh-Hsuan Chiang 提交于 5月 26, 2015

Summary:
Fixed a missing "}" at the end of the generated JSON Log
in EventLoggerHelpers::LogTableFileCreation.

Test Plan: db_bench

Reviewers: igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38919

6d299b70

Removed an unused private variable in db_impl.h · a0580205

由 Yueh-Hsuan Chiang 提交于 5月 26, 2015

Summary: Removed an unused private variable in db_impl.h

Test Plan: make db_test

Reviewers: sdong, anthony, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38925

a0580205

23 5月, 2015 12 次提交

R

update an import path to fit in with the rest of the kids · 328ad902
由 Reed Allman 提交于 5月 22, 2015

328ad902
R

C: extra bbto / noop slice transform · 9c38ce1d
由 Reed Allman 提交于 5月 22, 2015

9c38ce1d
I
Merge pull request #614 from arschles/docker · 8d26799f
由 Igor Canadi 提交于 5月 22, 2015
```
adding docker build script and dockerfile for tools
```
8d26799f

fix typo in c_simple_example · 32198343

由 agiardullo 提交于 5月 22, 2015

Summary: fix typo

Test Plan: none

Reviewers: tfarina, igor

Reviewed By: tfarina, igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D37347

32198343

A

moving dockerfile to root · 6116ccc2
由 Aaron Schlesinger 提交于 5月 22, 2015

6116ccc2
A

adding docker build script and dockerfile · d90cee9f
由 Aaron Schlesinger 提交于 5月 22, 2015

d90cee9f

Don't skip last level when calculating compaction stats · ea6d3a8a

由 Igor Canadi 提交于 5月 22, 2015

Summary: We have a bug where we don't report the last level's files as being compacted. This fixes it.

Test Plan: See the fix in action here: https://phabricator.fb.com/P19845738

Reviewers: MarkCallaghan, sdong

Reviewed By: sdong

Subscribers: yhchiang, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38727

ea6d3a8a

Fixed two bugs on logging file deletion. · 5c224d1b

由 Yueh-Hsuan Chiang 提交于 5月 22, 2015

Summary:
This patch fixes the following two bugs on logging file deletion.

1.  Previously, file deletion failure was only logged in INFO_LEVEL.
    This patch changes it to ERROR_LEVEL and does some code clean.

2.  EventLogger previously will always generate the same log on
    table file deletion even when file deletion is not successful.
    Now the resulting status of file deletion will also be logged.

Test Plan: make all check

Reviewers: sdong, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38817

5c224d1b

Change the log-level of DB summary and options from INFO_LEVEL to WARN_LEVEL · dc81efe4

由 Yueh-Hsuan Chiang 提交于 5月 22, 2015

Summary: Change the log-level of DB summary and options from INFO_LEVEL to WARN_LEVEL

Test Plan:
Use db_bench to verify the log level.

Sample output:
2015/05/22-00:20:39.778064 7fff75b41300 [WARN] RocksDB version: 3.11.0
2015/05/22-00:20:39.778095 7fff75b41300 [WARN] Git sha rocksdb_build_git_sha:7fee8775
2015/05/22-00:20:39.778099 7fff75b41300 [WARN] Compile date May 22 2015
2015/05/22-00:20:39.778101 7fff75b41300 [WARN] DB SUMMARY
2015/05/22-00:20:39.778145 7fff75b41300 [WARN] SST files in /tmp/rocksdbtest-691931916/dbbench dir, Total Num: 0, files:
2015/05/22-00:20:39.778148 7fff75b41300 [WARN] Write Ahead Log file in /tmp/rocksdbtest-691931916/dbbench:
2015/05/22-00:20:39.778150 7fff75b41300 [WARN] Options.error_if_exists: 0
2015/05/22-00:20:39.778152 7fff75b41300 [WARN] Options.create_if_missing: 1
2015/05/22-00:20:39.778153 7fff75b41300 [WARN] Options.paranoid_checks: 1

Reviewers: MarkCallaghan, igor, kradhakrishnan

Reviewed By: igor

Subscribers: sdong, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38835

dc81efe4

Ensure ColumnFamilyOptions.num_levels >= 2 when level compaction is used. · 687214f8

由 Yueh-Hsuan Chiang 提交于 5月 22, 2015

Summary: Ensure ColumnFamilyOptions.num_levels >= 2 when level compaction is used.

Test Plan: Extend SanitizeOptions test in column_family_test

Reviewers: sdong, rven, anthony, krishnanm86, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38829

687214f8

Avoid logging under mutex in DBImpl::WriteLevel0TableForRecovery(). · 2abb5926

由 Yueh-Hsuan Chiang 提交于 5月 22, 2015

Summary: Avoid logging under mutex in DBImpl::WriteLevel0TableForRecovery().

Test Plan: make all check

Reviewers: igor, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38823

2abb5926

Run tests sequentally if J=1 · 309a9d07

由 Igor Canadi 提交于 5月 22, 2015

Summary: Sometimes we want to run tests sequentially. J=1 gives us that option

Test Plan:
make J=1 check -- sequential
make J=2 check -- parallel

Reviewers: sdong, yhchiang, meyering

Reviewed By: meyering

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38805

309a9d07

22 5月, 2015 3 次提交

Allow EventLogger to directly log from a JSONWriter. · 7fee8775

由 Yueh-Hsuan Chiang 提交于 5月 21, 2015

Summary:
Allow EventLogger to directly log from a JSONWriter.  This allows
the JSONWriter to be shared by EventLogger and potentially EventListener,
which is an important step to integrate EventLogger and EventListener.

This patch also rewrites EventLoggerHelpers::LogTableFileCreation(),
which uses the new API to generate identical log.

Test Plan:
Run db_bench in debug mode and make sure the log is correct and no
assertions fail.

Reviewers: sdong, anthony, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38709

7fee8775

Don't artificially inflate L0 score · 7a357751

由 Igor Canadi 提交于 5月 21, 2015

Summary:
This turns out to be pretty bad because if we prioritize L0->L1 then L1 can grow artificially large, which makes L0->L1 more and more expensive. For example:
256MB @ L0 + 256MB @ L1 --> 512MB @ L1
256MB @ L0 + 512MB @ L1 --> 768MB @ L1
256MB @ L0 + 768MB @ L1 --> 1GB @ L1

....

256MB @ L0 + 10GB @ L1 --> 10.2GB @ L1

At some point we need to start compacting L1->L2 to speed up L0->L1.

Test Plan:
The performance improvement is massive for heavy write workload. This is the benchmark I ran: https://phabricator.fb.com/P19842671. Before this change, the benchmark took 47 minutes to complete. After, the benchmark finished in 2minutes. You can see full results here: https://phabricator.fb.com/P19842674

Also, we ran this diff on MongoDB on RocksDB on one replicaset. Before the change, our initial sync was so slow that it couldn't keep up with primary writes. After the change, the import finished without any issues

Reviewers: dynamike, MarkCallaghan, rven, yhchiang, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38637

7a357751

Set stats_dump_period_sec to 600 by default · 4cb4d546

由 Igor Canadi 提交于 5月 21, 2015

Summary: Having stats in our LOG more often will help a lot with perf debugging.

Test Plan: none

Reviewers: sdong, MarkCallaghan

Reviewed By: MarkCallaghan

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D38781

4cb4d546

kvdb / rocksdb 11 个月 前同步成功

kvdb / rocksdb
11 个月前同步成功