提交 · 16fb04434fede6ee368622dfa817b6cc2830833b · kvdb / rocksdb

02 11月, 2016 1 次提交

expose IngestExternalFile to c abi · 16fb0443

由 Jay Lee 提交于 11月 01, 2016

Summary:
IngestExternalFile is very useful when doing bulk load. This pr expose this API to c so many bindings can benefit from it too.
Closes https://github.com/facebook/rocksdb/pull/1454

Differential Revision: D4113420

Pulled By: yiwu-arbug

fbshipit-source-id: 307c6ae

16fb0443

13 9月, 2016 1 次提交
- A
  
  Fix C api memtable rep bugs. (#1328) · a10e8a05
  由 Adam Faulkner 提交于 9月 12, 2016
  
  a10e8a05
10 9月, 2016 1 次提交
- Z
  add C api for set wal_recovery_mode (#1327) · b06b1913
  由 zhangjinpeng1987 提交于 9月 10, 2016
```
* add C api for set wal recovery mode

* add test
```
  b06b1913
02 9月, 2016 1 次提交

Merge options source_compaction_factor, max_grandparent_overlap_bytes and... · 32149059

由 sdong 提交于 6月 16, 2016

Merge options source_compaction_factor, max_grandparent_overlap_bytes and expanded_compaction_factor into max_compaction_bytes

Summary: To reduce number of options, merge source_compaction_factor, max_grandparent_overlap_bytes and expanded_compaction_factor into max_compaction_bytes.

Test Plan: Add two new unit tests. Run all existing tests, including jtest.

Reviewers: yhchiang, igor, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59829

32149059

18 8月, 2016 1 次提交
- J
  c abi: allow compaction filter ignore snapshot (#1268) · 49d88be0
  由 Jay 提交于 8月 18, 2016
```
close #1262
```
  49d88be0
27 7月, 2016 1 次提交

Change options memtable_prefix_bloom_huge_page_tlb_size =>... · e5b5f12b

由 sdong 提交于 7月 26, 2016

Change options memtable_prefix_bloom_huge_page_tlb_size => memtable_huge_page_size and cover huge page to memtable too

Summary: Extend the option memtable_prefix_bloom_huge_page_tlb_size from just putting memtable bloom filter to huge page to memtable itself too.

Test Plan: Run all existing tests.

Reviewers: IslamAbdelRahman, yhchiang, andrewkr

Reviewed By: andrewkr

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D60513

e5b5f12b

21 7月, 2016 1 次提交

Introduce FullMergeV2 (eliminate memcpy from merge operators) · 68a8e6b8

由 Islam AbdelRahman 提交于 7月 20, 2016

Summary:
This diff update the code to pin the merge operator operands while the merge operation is done, so that we can eliminate the memcpy cost, to do that we need a new public API for FullMerge that replace the std::deque<std::string> with std::vector<Slice>

This diff is stacked on top of D56493 and D56511

In this diff we
- Update FullMergeV2 arguments to be encapsulated in MergeOperationInput and MergeOperationOutput which will make it easier to add new arguments in the future
- Replace std::deque<std::string> with std::vector<Slice> to pass operands
- Replace MergeContext std::deque with std::vector (based on a simple benchmark I ran https://gist.github.com/IslamAbdelRahman/78fc86c9ab9f52b1df791e58943fb187)
- Allow FullMergeV2 output to be an existing operand

```
[Everything in Memtable | 10K operands | 10 KB each | 1 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="mergerandom,readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --merge_keys=10000 --num=10000 --disable_auto_compactions --value_size=10240 --write_buffer_size=1000000000

[FullMergeV2]
readseq      :       0.607 micros/op 1648235 ops/sec; 16121.2 MB/s
readseq      :       0.478 micros/op 2091546 ops/sec; 20457.2 MB/s
readseq      :       0.252 micros/op 3972081 ops/sec; 38850.5 MB/s
readseq      :       0.237 micros/op 4218328 ops/sec; 41259.0 MB/s
readseq      :       0.247 micros/op 4043927 ops/sec; 39553.2 MB/s

[master]
readseq      :       3.935 micros/op 254140 ops/sec; 2485.7 MB/s
readseq      :       3.722 micros/op 268657 ops/sec; 2627.7 MB/s
readseq      :       3.149 micros/op 317605 ops/sec; 3106.5 MB/s
readseq      :       3.125 micros/op 320024 ops/sec; 3130.1 MB/s
readseq      :       4.075 micros/op 245374 ops/sec; 2400.0 MB/s
```

```
[Everything in Memtable | 10K operands | 10 KB each | 10 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="mergerandom,readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --merge_keys=1000 --num=10000 --disable_auto_compactions --value_size=10240 --write_buffer_size=1000000000

[FullMergeV2]
readseq      :       3.472 micros/op 288018 ops/sec; 2817.1 MB/s
readseq      :       2.304 micros/op 434027 ops/sec; 4245.2 MB/s
readseq      :       1.163 micros/op 859845 ops/sec; 8410.0 MB/s
readseq      :       1.192 micros/op 838926 ops/sec; 8205.4 MB/s
readseq      :       1.250 micros/op 800000 ops/sec; 7824.7 MB/s

[master]
readseq      :      24.025 micros/op 41623 ops/sec;  407.1 MB/s
readseq      :      18.489 micros/op 54086 ops/sec;  529.0 MB/s
readseq      :      18.693 micros/op 53495 ops/sec;  523.2 MB/s
readseq      :      23.621 micros/op 42335 ops/sec;  414.1 MB/s
readseq      :      18.775 micros/op 53262 ops/sec;  521.0 MB/s

```

```
[Everything in Block cache | 10K operands | 10 KB each | 1 operand per key]

[FullMergeV2]
$ DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --num=100000 --db="/dev/shm/merge-random-10K-10KB" --cache_size=1000000000 --use_existing_db --disable_auto_compactions
readseq      :      14.741 micros/op 67837 ops/sec;  663.5 MB/s
readseq      :       1.029 micros/op 971446 ops/sec; 9501.6 MB/s
readseq      :       0.974 micros/op 1026229 ops/sec; 10037.4 MB/s
readseq      :       0.965 micros/op 1036080 ops/sec; 10133.8 MB/s
readseq      :       0.943 micros/op 1060657 ops/sec; 10374.2 MB/s

[master]
readseq      :      16.735 micros/op 59755 ops/sec;  584.5 MB/s
readseq      :       3.029 micros/op 330151 ops/sec; 3229.2 MB/s
readseq      :       3.136 micros/op 318883 ops/sec; 3119.0 MB/s
readseq      :       3.065 micros/op 326245 ops/sec; 3191.0 MB/s
readseq      :       3.014 micros/op 331813 ops/sec; 3245.4 MB/s
```

```
[Everything in Block cache | 10K operands | 10 KB each | 10 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --num=100000 --db="/dev/shm/merge-random-10-operands-10K-10KB" --cache_size=1000000000 --use_existing_db --disable_auto_compactions

[FullMergeV2]
readseq      :      24.325 micros/op 41109 ops/sec;  402.1 MB/s
readseq      :       1.470 micros/op 680272 ops/sec; 6653.7 MB/s
readseq      :       1.231 micros/op 812347 ops/sec; 7945.5 MB/s
readseq      :       1.091 micros/op 916590 ops/sec; 8965.1 MB/s
readseq      :       1.109 micros/op 901713 ops/sec; 8819.6 MB/s

[master]
readseq      :      27.257 micros/op 36687 ops/sec;  358.8 MB/s
readseq      :       4.443 micros/op 225073 ops/sec; 2201.4 MB/s
readseq      :       5.830 micros/op 171526 ops/sec; 1677.7 MB/s
readseq      :       4.173 micros/op 239635 ops/sec; 2343.8 MB/s
readseq      :       4.150 micros/op 240963 ops/sec; 2356.8 MB/s
```

Test Plan: COMPILE_WITH_ASAN=1 make check -j64

Reviewers: yhchiang, andrewkr, sdong

Reviewed By: sdong

Subscribers: lovro, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D57075

68a8e6b8

18 6月, 2016 1 次提交

Deprectate filter_deletes · 7b79238b

由 sdong 提交于 6月 07, 2016

Summary: filter_deltes is not a frequently used feature. Remove it.

Test Plan: Run all test suites.

Reviewers: igor, yhchiang, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59427

7b79238b

11 6月, 2016 1 次提交

memtable_prefix_bloom_bits -> memtable_prefix_bloom_bits_ratio and deprecate... · 20699df8

由 sdong 提交于 6月 03, 2016

memtable_prefix_bloom_bits -> memtable_prefix_bloom_bits_ratio and deprecate memtable_prefix_bloom_probes

Summary:
memtable_prefix_bloom_probes is not a critical option. Remove it to reduce number of options.
It's easier for users to make mistakes with memtable_prefix_bloom_bits, turn it to memtable_prefix_bloom_bits_ratio

Test Plan: Run all existing tests

Reviewers: yhchiang, igor, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: gunnarku, yoshinorim, MarkCallaghan, leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59199

20699df8

03 6月, 2016 1 次提交
- J
  
  allow updating block cache capacity from C (#1149) · 02ec8154
  由 Jan Doms 提交于 6月 03, 2016
  
  02ec8154
02 6月, 2016 1 次提交
- S
  
  add readahead size option (#1146) · 21c047ab
  由 siddontang 提交于 6月 02, 2016
  
  21c047ab
24 5月, 2016 1 次提交
- S
  
  Expose report_bg_io_stats option in the C API. (#1131) · def2f7bd
  由 Shen Li 提交于 5月 24, 2016
  
  def2f7bd
23 5月, 2016 1 次提交
- S
  
  C API: Expose DeleteFileInRange (#1132) · 8f121453
  由 siddontang 提交于 5月 23, 2016
  
  8f121453
28 4月, 2016 2 次提交

Fix compression dictionary clang errors · 54de13ab

由 Andrew Kryczka 提交于 4月 27, 2016

Summary: There were a few narrowing conversions that clang didn't like.

Test Plan:
  $ make clean && USE_CLANG=1 DISABLE_JEMALLOC=1 TEST_TMPDIR=/dev/shm/rocksdb OPT=-g make -j32 check

Reviewers: IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: andrewkr, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D57351

54de13ab

Shared dictionary compression using reference block · 843d2e31

由 Andrew Kryczka 提交于 4月 27, 2016

Summary:
This adds a new metablock containing a shared dictionary that is used
to compress all data blocks in the SST file. The size of the shared dictionary
is configurable in CompressionOptions and defaults to 0. It's currently only
used for zlib/lz4/lz4hc, but the block will be stored in the SST regardless of
the compression type if the user chooses a nonzero dictionary size.

During compaction, computes the dictionary by randomly sampling the first
output file in each subcompaction. It pre-computes the intervals to sample
by assuming the output file will have the maximum allowable length. In case
the file is smaller, some of the pre-computed sampling intervals can be beyond
end-of-file, in which case we skip over those samples and the dictionary will
be a bit smaller. After the dictionary is generated using the first file in a
subcompaction, it is loaded into the compression library before writing each
block in each subsequent file of that subcompaction.

On the read path, gets the dictionary from the metablock, if it exists. Then,
loads that dictionary into the compression library before reading each block.

Test Plan: new unit test

Reviewers: yhchiang, IslamAbdelRahman, cyan, sdong

Reviewed By: sdong

Subscribers: andrewkr, yoshinorim, kradhakrishnan, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D52287

843d2e31

23 4月, 2016 4 次提交
- N
  Merge pull request #1068 from daaku/c-purge-old-backups · 99a3bf8f
  由 Naitik Shah 提交于 4月 22, 2016
```
rocksdb_backup_engine_purge_old_backups for C libraries
```
  99a3bf8f
- N
  
  rocksdb_create_mem_env to allow C libraries to create mem env (#1066) · c146c9be
  由 Naitik Shah 提交于 4月 22, 2016
  
  c146c9be
- N
  
  expose more options in the c api (#1067) · 6da70c58
  由 Naitik Shah 提交于 4月 22, 2016
  
  6da70c58
- N
  
  C rocksdb_create_iterators to expose NewIterators (#1069) · 6f01687a
  由 Naitik Shah 提交于 4月 22, 2016
  
  6f01687a
02 4月, 2016 1 次提交

Adding pin_l0_filter_and_index_blocks_in_cache feature and related fixes. · 9b519875

由 Marton Trencseni 提交于 4月 01, 2016

Summary:
When a block based table file is opened, if prefetch_index_and_filter is true, it will prefetch the index and filter blocks, putting them into the block cache.
What this feature adds: when a L0 block based table file is opened, if pin_l0_filter_and_index_blocks_in_cache is true in the options (and prefetch_index_and_filter is true), then the filter and index blocks aren't released back to the block cache at the end of BlockBasedTableReader::Open(). Instead the table reader takes ownership of them, hence pinning them, ie. the LRU cache will never push them out. Meanwhile in the table reader, further accesses will not hit the block cache, thus avoiding lock contention.

Test Plan:
'export TEST_TMPDIR=/dev/shm/ && DISABLE_JEMALLOC=1 OPT=-g make all valgrind_check -j32' is OK.
I didn't run the Java tests, I don't have Java set up on my devserver.

Reviewers: sdong

Reviewed By: sdong

Subscribers: andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D56133

9b519875

22 3月, 2016 1 次提交
- S
  Revert "Adding pin_l0_filter_and_index_blocks_in_cache feature." · b1fafcac
  由 sdong 提交于 3月 21, 2016
```
This reverts commit 522de4f5.

It has bug of index block cleaning up.
```
  b1fafcac
18 3月, 2016 1 次提交

Adding pin_l0_filter_and_index_blocks_in_cache feature. · 522de4f5

由 Marton Trencseni 提交于 3月 17, 2016

Summary:
When a block based table file is opened, if prefetch_index_and_filter is true, it will prefetch the index and filter blocks, putting them into the block cache.
What this feature adds: when a L0 block based table file is opened, if pin_l0_filter_and_index_blocks_in_cache is true in the options (and prefetch_index_and_filter is true), then the filter and index blocks aren't released back to the block cache at the end of BlockBasedTableReader::Open(). Instead the table reader takes ownership of them, hence pinning them, ie. the LRU cache will never push them out. Meanwhile in the table reader, further accesses will not hit the block cache, thus avoiding lock contention.
When the table reader is destroyed, it releases the pinned blocks (if there were any). This has to happen before the cache is destroyed, so I had to introduce a TableReader::Close(), to guarantee the order of destruction.

Test Plan:
Added two unit tests for this. Existing unit tests run fine (default is pin_l0_filter_and_index_blocks_in_cache=false).

DISABLE_JEMALLOC=1 OPT=-g make all valgrind_check -j32
Mac: OK.
Linux: with D55287 patched in it's OK.

Reviewers: sdong

Reviewed By: sdong

Subscribers: andrewkr, leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D54801

522de4f5

10 2月, 2016 1 次提交
- B
  
  Updated all copyright headers to the new format. · 21e95811
  由 Baraa Hamodi 提交于 2月 09, 2016
  
  21e95811
31 12月, 2015 2 次提交
- W
  
  expose memtable_prefix_bloom_huge_page_tlb_size option to C API · 0fde291a
  由 Warren Falk 提交于 12月 29, 2015
  
  0fde291a
- W
  
  Support creation of "full" format bloom filter from C API · 7e81dba5
  由 Warren Falk 提交于 12月 29, 2015
  
  7e81dba5
31 10月, 2015 1 次提交
- S
  
  Move skip_table_builder_flush to BlockBasedTableOption · ccc8c10c
  由 SherlockNoMad 提交于 10月 30, 2015
  
  ccc8c10c
19 10月, 2015 1 次提交
- S
  options: add recycle_log_file_num option · 543c12ab
  由 Sage Weil 提交于 10月 07, 2015
```
Signed-off-by: NSage Weil <sage@redhat.com>
```
  543c12ab
18 7月, 2015 2 次提交

Don't let flushes preempt compactions · 35ca5936

由 Igor Canadi 提交于 7月 17, 2015

Summary:
When we first started, max_background_flushes was 0 by default and compaction thread was executing flushes (since there was no flush thread). Then, we switched the default max_background_flushes to 1. However, we still support the case where there is no flush thread and flushes are done in compaction. This is making our code a bit more complicated. By not supporting this use-case we can make our code simpler.

We have a special case that when you set max_background_flushes to 0, we
schedule the flush to execute on the compaction thread.

Test Plan: make check (there might be some unit tests that depend on this behavior)

Reviewers: IslamAbdelRahman, yhchiang, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D41931

35ca5936

Deprecate CompactionFilterV2 · a96fcd09

由 Igor Canadi 提交于 7月 17, 2015

Summary: It has been around for a while and it looks like it never found any uses in the wild. It's also complicating our compaction_job code quite a bit. We're deprecating it in 3.13, but will put it back in 3.14 if we actually find users that need this feature.

Test Plan: make check

Reviewers: noetzli, yhchiang, sdong

Reviewed By: sdong

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D42405

a96fcd09

16 7月, 2015 1 次提交

move convenience.h out of utilities · 81d07262

由 agiardullo 提交于 7月 15, 2015

Summary: Moved convenience.h out of utilities to remove a dependency on utilities in db.

Test Plan: unit tests.  Also compiled a link to the old location to verify the _Pragma works.

Reviewers: sdong, yhchiang, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D42201

81d07262

14 7月, 2015 2 次提交

Deprecate purge_redundant_kvs_while_flush · a9c51095

由 Igor Canadi 提交于 7月 14, 2015

Summary: This option is guarding the feature implemented 2 and a half years ago: D8991. The feature was enabled by default back then and has been running without issues. There is no reason why any client would turn this feature off. I found no reference in fbcode.

Test Plan: none

Reviewers: sdong, yhchiang, anthony, dhruba

Reviewed By: dhruba

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D42063

a9c51095

"make format" against last 10 commits · f9728640

由 sdong 提交于 7月 13, 2015

Summary: This helps Windows port to format their changes, as discussed. Might have formatted some other codes too becasue last 10 commits include more.

Test Plan: Build it.

Reviewers: anthony, IslamAbdelRahman, kradhakrishnan, yhchiang, igor

Reviewed By: igor

Subscribers: leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D41961

f9728640

08 7月, 2015 1 次提交
- D
  
  Commit both PR and internal code review changes · ef4b87f1
  由 Dmitri Smirnov 提交于 7月 07, 2015
  
  ef4b87f1
03 7月, 2015 1 次提交
- D
  
  Merge the latest changes from github/master · d2f0912b
  由 Dmitri Smirnov 提交于 7月 02, 2015
  
  d2f0912b
02 7月, 2015 1 次提交

Windows Port from Microsoft · 18285c1e

由 Dmitri Smirnov 提交于 7月 01, 2015

 Summary: Make RocksDb build and run on Windows to be functionally
 complete and performant. All existing test cases run with no
 regressions. Performance numbers are in the pull-request.

 Test plan: make all of the existing unit tests pass, obtain perf numbers.

 Co-authored-by: Praveen Rao praveensinghrao@outlook.com
 Co-authored-by: Sherlock Huang baihan.huang@gmail.com
 Co-authored-by: Alex Zinoviev alexander.zinoviev@me.com
 Co-authored-by: Dmitri Smirnov dmitrism@microsoft.com

18285c1e

18 6月, 2015 2 次提交

Use CompactRangeOptions for CompactRange · 12e030a9

由 Islam AbdelRahman 提交于 6月 17, 2015

Summary:
This diff update DB::CompactRange to use RangeCompactionOptions instead of using multiple parameters
Old CompactRange is still available but deprecated

Test Plan:
make all check
make rocksdbjava
USE_CLANG=1 make all
OPT=-DROCKSDB_LITE make release

Reviewers: sdong, yhchiang, igor

Reviewed By: igor

Subscribers: dhruba

Differential Revision: https://reviews.facebook.net/D40209

12e030a9

Block c_test in ROCKSDB_LITE · 1a08d0be

由 Yueh-Hsuan Chiang 提交于 6月 17, 2015

Summary: Block c_test in ROCKSDB_LITE as it's not supported in ROCKSDB_LITE.

Test Plan: c_test

Reviewers: sdong, rven, anthony, kradhakrishnan, IslamAbdelRahman, igor

Reviewed By: igor

Subscribers: dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D40257

1a08d0be

10 6月, 2015 1 次提交
- R
  
  C: add WriteBatch.PutLogData support · 735df665
  由 Reed Allman 提交于 6月 10, 2015
  
  735df665
04 6月, 2015 2 次提交
- R
  
  C: add MultiGet support · 211a195d
  由 Reed Allman 提交于 6月 03, 2015
  
  211a195d
- R
  
  C: add support for WriteBatch SliceParts params · 5dc174e1
  由 Reed Allman 提交于 6月 03, 2015
  
  5dc174e1

kvdb / rocksdb 12 个月 前同步成功

kvdb / rocksdb
12 个月前同步成功