提交 · 653ac1f9c6df604fe3c528c374c21c6be4de2d76 · kvdb / rocksdb

04 1月, 2017 1 次提交

C API: support total_order_mode · 653ac1f9

由 siddontang 提交于 1月 03, 2017

Summary: Closes https://github.com/facebook/rocksdb/pull/1687

Differential Revision: D4349210

Pulled By: IslamAbdelRahman

fbshipit-source-id: 32d0fbd

653ac1f9

01 1月, 2017 1 次提交

utilities/backupable: backup should limit the copy size of wal. · e425ec11

由 Vincent Lee 提交于 12月 31, 2016

Summary:
Since the backup work as snapshot, we should only copy
 the bytes of the wal while we get the alive files.
Closes https://github.com/facebook/rocksdb/pull/1733

Differential Revision: D4373457

Pulled By: ajkr

fbshipit-source-id: 389318f

e425ec11

23 12月, 2016 1 次提交

direct io write support · 972f96b3

由 Aaron Gao 提交于 12月 22, 2016

Summary:
rocksdb direct io support

```
[gzh@dev11575.prn2 ~/rocksdb] ./db_bench -benchmarks=fillseq --num=1000000
Initializing RocksDB Options from the specified file
Initializing RocksDB Options from command-line flags
RocksDB:    version 5.0
Date:       Wed Nov 23 13:17:43 2016
CPU:        40 * Intel(R) Xeon(R) CPU E5-2660 v2 @ 2.20GHz
CPUCache:   25600 KB
Keys:       16 bytes each
Values:     100 bytes each (50 bytes after compression)
Entries:    1000000
Prefix:    0 bytes
Keys per prefix:    0
RawSize:    110.6 MB (estimated)
FileSize:   62.9 MB (estimated)
Write rate: 0 bytes/second
Compression: Snappy
Memtablerep: skip_list
Perf Level: 1
WARNING: Assertions are enabled; benchmarks unnecessarily slow
------------------------------------------------
Initializing RocksDB Options from the specified file
Initializing RocksDB Options from command-line flags
DB path: [/tmp/rocksdbtest-112628/dbbench]
fillseq      :       4.393 micros/op 227639 ops/sec;   25.2 MB/s

[gzh@dev11575.prn2 ~/roc
Closes https://github.com/facebook/rocksdb/pull/1564

Differential Revision: D4241093

Pulled By: lightmark

fbshipit-source-id: 98c29e3

972f96b3

16 12月, 2016 1 次提交

C API: support get usage and pinned_usage for cache · 8f5d24ae

由 siddontang 提交于 12月 15, 2016

Summary: Closes https://github.com/facebook/rocksdb/pull/1671

Differential Revision: D4327453

Pulled By: yiwu-arbug

fbshipit-source-id: bcdbc65

8f5d24ae

14 12月, 2016 1 次提交

C API: support writebatch delete range · b57dd926

由 siddontang 提交于 12月 13, 2016

Summary:
Seem that writebatch delete range can work now, so I add C API for later use.

Btw, can we use this feature in production now?
Closes https://github.com/facebook/rocksdb/pull/1647

Differential Revision: D4314534

Pulled By: ajkr

fbshipit-source-id: e835165

b57dd926

08 12月, 2016 1 次提交

CompactRangeOptions C API · 45c7ce13

由 zhangjinpeng1987 提交于 12月 07, 2016

Summary:
Add C API for CompactRangeOptions.
Closes https://github.com/facebook/rocksdb/pull/1596

Differential Revision: D4252339

Pulled By: yiwu-arbug

fbshipit-source-id: f768f93

45c7ce13

01 12月, 2016 2 次提交

c api: expose option for dynamic level size target · 96fcefbf

由 Panagiotis Ktistakis 提交于 11月 30, 2016

Summary: Closes https://github.com/facebook/rocksdb/pull/1587

Differential Revision: D4245923

Pulled By: yiwu-arbug

fbshipit-source-id: 6ee7291

96fcefbf

Add C API to set base_backgroud_compactions · 00197cff

由 zhangjinpeng1987 提交于 11月 30, 2016

Summary:
Add C API to set base_backgroud_compactions
Closes https://github.com/facebook/rocksdb/pull/1571

Differential Revision: D4245709

Pulled By: yiwu-arbug

fbshipit-source-id: 792c6b8

00197cff

17 11月, 2016 1 次提交

Enable allow_concurrent_memtable_write and enable_write_thread_adaptive_yield by default · 972e3ff2

由 Siying Dong 提交于 11月 16, 2016

Summary: Closes https://github.com/facebook/rocksdb/pull/1496

Differential Revision: D4168080

Pulled By: siying

fbshipit-source-id: 056ae62

972e3ff2

09 11月, 2016 2 次提交

remove tabs and duplicate #include in c api · e48f3f8b

由 Aaron Gao 提交于 11月 08, 2016

Summary:
fix lint error about tabs and duplicate includes.
Closes https://github.com/facebook/rocksdb/pull/1476

Differential Revision: D4149646

Pulled By: lightmark

fbshipit-source-id: 2e0a632

e48f3f8b

c: support seek_for_prev · a7875272

由 Jay Lee 提交于 11月 08, 2016

Summary:
support seek_for_prev in c abi.
Closes https://github.com/facebook/rocksdb/pull/1457

Differential Revision: D4135360

Pulled By: lightmark

fbshipit-source-id: 61256b0

a7875272

04 11月, 2016 1 次提交

Add C api for RateLimiter · 879f3663

由 zhangjinpeng1987 提交于 11月 03, 2016

Summary:
Add C api for RateLimiter.
Closes https://github.com/facebook/rocksdb/pull/1455

Differential Revision: D4116362

Pulled By: yiwu-arbug

fbshipit-source-id: cb05a8d

879f3663

02 11月, 2016 2 次提交

Change max_bytes_for_level_multiplier to double · 2b16d664

由 Benoit Girard 提交于 11月 01, 2016

Summary: Closes https://github.com/facebook/rocksdb/pull/1427

Differential Revision: D4094732

Pulled By: yiwu-arbug

fbshipit-source-id: b9b79e9

2b16d664

expose IngestExternalFile to c abi · 16fb0443

由 Jay Lee 提交于 11月 01, 2016

Summary:
IngestExternalFile is very useful when doing bulk load. This pr expose this API to c so many bindings can benefit from it too.
Closes https://github.com/facebook/rocksdb/pull/1454

Differential Revision: D4113420

Pulled By: yiwu-arbug

fbshipit-source-id: 307c6ae

16fb0443

13 9月, 2016 1 次提交
- A
  
  Fix C api memtable rep bugs. (#1328) · a10e8a05
  由 Adam Faulkner 提交于 9月 12, 2016
  
  a10e8a05
10 9月, 2016 1 次提交
- Z
  add C api for set wal_recovery_mode (#1327) · b06b1913
  由 zhangjinpeng1987 提交于 9月 10, 2016
```
* add C api for set wal recovery mode

* add test
```
  b06b1913
02 9月, 2016 1 次提交

Merge options source_compaction_factor, max_grandparent_overlap_bytes and... · 32149059

由 sdong 提交于 6月 16, 2016

Merge options source_compaction_factor, max_grandparent_overlap_bytes and expanded_compaction_factor into max_compaction_bytes

Summary: To reduce number of options, merge source_compaction_factor, max_grandparent_overlap_bytes and expanded_compaction_factor into max_compaction_bytes.

Test Plan: Add two new unit tests. Run all existing tests, including jtest.

Reviewers: yhchiang, igor, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59829

32149059

18 8月, 2016 1 次提交
- J
  c abi: allow compaction filter ignore snapshot (#1268) · 49d88be0
  由 Jay 提交于 8月 18, 2016
```
close #1262
```
  49d88be0
27 7月, 2016 1 次提交

Change options memtable_prefix_bloom_huge_page_tlb_size =>... · e5b5f12b

由 sdong 提交于 7月 26, 2016

Change options memtable_prefix_bloom_huge_page_tlb_size => memtable_huge_page_size and cover huge page to memtable too

Summary: Extend the option memtable_prefix_bloom_huge_page_tlb_size from just putting memtable bloom filter to huge page to memtable itself too.

Test Plan: Run all existing tests.

Reviewers: IslamAbdelRahman, yhchiang, andrewkr

Reviewed By: andrewkr

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D60513

e5b5f12b

21 7月, 2016 1 次提交

Introduce FullMergeV2 (eliminate memcpy from merge operators) · 68a8e6b8

由 Islam AbdelRahman 提交于 7月 20, 2016

Summary:
This diff update the code to pin the merge operator operands while the merge operation is done, so that we can eliminate the memcpy cost, to do that we need a new public API for FullMerge that replace the std::deque<std::string> with std::vector<Slice>

This diff is stacked on top of D56493 and D56511

In this diff we
- Update FullMergeV2 arguments to be encapsulated in MergeOperationInput and MergeOperationOutput which will make it easier to add new arguments in the future
- Replace std::deque<std::string> with std::vector<Slice> to pass operands
- Replace MergeContext std::deque with std::vector (based on a simple benchmark I ran https://gist.github.com/IslamAbdelRahman/78fc86c9ab9f52b1df791e58943fb187)
- Allow FullMergeV2 output to be an existing operand

```
[Everything in Memtable | 10K operands | 10 KB each | 1 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="mergerandom,readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --merge_keys=10000 --num=10000 --disable_auto_compactions --value_size=10240 --write_buffer_size=1000000000

[FullMergeV2]
readseq      :       0.607 micros/op 1648235 ops/sec; 16121.2 MB/s
readseq      :       0.478 micros/op 2091546 ops/sec; 20457.2 MB/s
readseq      :       0.252 micros/op 3972081 ops/sec; 38850.5 MB/s
readseq      :       0.237 micros/op 4218328 ops/sec; 41259.0 MB/s
readseq      :       0.247 micros/op 4043927 ops/sec; 39553.2 MB/s

[master]
readseq      :       3.935 micros/op 254140 ops/sec; 2485.7 MB/s
readseq      :       3.722 micros/op 268657 ops/sec; 2627.7 MB/s
readseq      :       3.149 micros/op 317605 ops/sec; 3106.5 MB/s
readseq      :       3.125 micros/op 320024 ops/sec; 3130.1 MB/s
readseq      :       4.075 micros/op 245374 ops/sec; 2400.0 MB/s
```

```
[Everything in Memtable | 10K operands | 10 KB each | 10 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="mergerandom,readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --merge_keys=1000 --num=10000 --disable_auto_compactions --value_size=10240 --write_buffer_size=1000000000

[FullMergeV2]
readseq      :       3.472 micros/op 288018 ops/sec; 2817.1 MB/s
readseq      :       2.304 micros/op 434027 ops/sec; 4245.2 MB/s
readseq      :       1.163 micros/op 859845 ops/sec; 8410.0 MB/s
readseq      :       1.192 micros/op 838926 ops/sec; 8205.4 MB/s
readseq      :       1.250 micros/op 800000 ops/sec; 7824.7 MB/s

[master]
readseq      :      24.025 micros/op 41623 ops/sec;  407.1 MB/s
readseq      :      18.489 micros/op 54086 ops/sec;  529.0 MB/s
readseq      :      18.693 micros/op 53495 ops/sec;  523.2 MB/s
readseq      :      23.621 micros/op 42335 ops/sec;  414.1 MB/s
readseq      :      18.775 micros/op 53262 ops/sec;  521.0 MB/s

```

```
[Everything in Block cache | 10K operands | 10 KB each | 1 operand per key]

[FullMergeV2]
$ DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --num=100000 --db="/dev/shm/merge-random-10K-10KB" --cache_size=1000000000 --use_existing_db --disable_auto_compactions
readseq      :      14.741 micros/op 67837 ops/sec;  663.5 MB/s
readseq      :       1.029 micros/op 971446 ops/sec; 9501.6 MB/s
readseq      :       0.974 micros/op 1026229 ops/sec; 10037.4 MB/s
readseq      :       0.965 micros/op 1036080 ops/sec; 10133.8 MB/s
readseq      :       0.943 micros/op 1060657 ops/sec; 10374.2 MB/s

[master]
readseq      :      16.735 micros/op 59755 ops/sec;  584.5 MB/s
readseq      :       3.029 micros/op 330151 ops/sec; 3229.2 MB/s
readseq      :       3.136 micros/op 318883 ops/sec; 3119.0 MB/s
readseq      :       3.065 micros/op 326245 ops/sec; 3191.0 MB/s
readseq      :       3.014 micros/op 331813 ops/sec; 3245.4 MB/s
```

```
[Everything in Block cache | 10K operands | 10 KB each | 10 operand per key]

DEBUG_LEVEL=0 make db_bench -j64 && ./db_bench --benchmarks="readseq,readseq,readseq,readseq,readseq" --merge_operator="max" --num=100000 --db="/dev/shm/merge-random-10-operands-10K-10KB" --cache_size=1000000000 --use_existing_db --disable_auto_compactions

[FullMergeV2]
readseq      :      24.325 micros/op 41109 ops/sec;  402.1 MB/s
readseq      :       1.470 micros/op 680272 ops/sec; 6653.7 MB/s
readseq      :       1.231 micros/op 812347 ops/sec; 7945.5 MB/s
readseq      :       1.091 micros/op 916590 ops/sec; 8965.1 MB/s
readseq      :       1.109 micros/op 901713 ops/sec; 8819.6 MB/s

[master]
readseq      :      27.257 micros/op 36687 ops/sec;  358.8 MB/s
readseq      :       4.443 micros/op 225073 ops/sec; 2201.4 MB/s
readseq      :       5.830 micros/op 171526 ops/sec; 1677.7 MB/s
readseq      :       4.173 micros/op 239635 ops/sec; 2343.8 MB/s
readseq      :       4.150 micros/op 240963 ops/sec; 2356.8 MB/s
```

Test Plan: COMPILE_WITH_ASAN=1 make check -j64

Reviewers: yhchiang, andrewkr, sdong

Reviewed By: sdong

Subscribers: lovro, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D57075

68a8e6b8

18 6月, 2016 1 次提交

Deprectate filter_deletes · 7b79238b

由 sdong 提交于 6月 07, 2016

Summary: filter_deltes is not a frequently used feature. Remove it.

Test Plan: Run all test suites.

Reviewers: igor, yhchiang, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59427

7b79238b

11 6月, 2016 1 次提交

memtable_prefix_bloom_bits -> memtable_prefix_bloom_bits_ratio and deprecate... · 20699df8

由 sdong 提交于 6月 03, 2016

memtable_prefix_bloom_bits -> memtable_prefix_bloom_bits_ratio and deprecate memtable_prefix_bloom_probes

Summary:
memtable_prefix_bloom_probes is not a critical option. Remove it to reduce number of options.
It's easier for users to make mistakes with memtable_prefix_bloom_bits, turn it to memtable_prefix_bloom_bits_ratio

Test Plan: Run all existing tests

Reviewers: yhchiang, igor, IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: gunnarku, yoshinorim, MarkCallaghan, leveldb, andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D59199

20699df8

03 6月, 2016 1 次提交
- J
  
  allow updating block cache capacity from C (#1149) · 02ec8154
  由 Jan Doms 提交于 6月 03, 2016
  
  02ec8154
02 6月, 2016 1 次提交
- S
  
  add readahead size option (#1146) · 21c047ab
  由 siddontang 提交于 6月 02, 2016
  
  21c047ab
24 5月, 2016 1 次提交
- S
  
  Expose report_bg_io_stats option in the C API. (#1131) · def2f7bd
  由 Shen Li 提交于 5月 24, 2016
  
  def2f7bd
23 5月, 2016 1 次提交
- S
  
  C API: Expose DeleteFileInRange (#1132) · 8f121453
  由 siddontang 提交于 5月 23, 2016
  
  8f121453
28 4月, 2016 2 次提交

Fix compression dictionary clang errors · 54de13ab

由 Andrew Kryczka 提交于 4月 27, 2016

Summary: There were a few narrowing conversions that clang didn't like.

Test Plan:
  $ make clean && USE_CLANG=1 DISABLE_JEMALLOC=1 TEST_TMPDIR=/dev/shm/rocksdb OPT=-g make -j32 check

Reviewers: IslamAbdelRahman

Reviewed By: IslamAbdelRahman

Subscribers: andrewkr, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D57351

54de13ab

Shared dictionary compression using reference block · 843d2e31

由 Andrew Kryczka 提交于 4月 27, 2016

Summary:
This adds a new metablock containing a shared dictionary that is used
to compress all data blocks in the SST file. The size of the shared dictionary
is configurable in CompressionOptions and defaults to 0. It's currently only
used for zlib/lz4/lz4hc, but the block will be stored in the SST regardless of
the compression type if the user chooses a nonzero dictionary size.

During compaction, computes the dictionary by randomly sampling the first
output file in each subcompaction. It pre-computes the intervals to sample
by assuming the output file will have the maximum allowable length. In case
the file is smaller, some of the pre-computed sampling intervals can be beyond
end-of-file, in which case we skip over those samples and the dictionary will
be a bit smaller. After the dictionary is generated using the first file in a
subcompaction, it is loaded into the compression library before writing each
block in each subsequent file of that subcompaction.

On the read path, gets the dictionary from the metablock, if it exists. Then,
loads that dictionary into the compression library before reading each block.

Test Plan: new unit test

Reviewers: yhchiang, IslamAbdelRahman, cyan, sdong

Reviewed By: sdong

Subscribers: andrewkr, yoshinorim, kradhakrishnan, dhruba, leveldb

Differential Revision: https://reviews.facebook.net/D52287

843d2e31

23 4月, 2016 4 次提交
- N
  Merge pull request #1068 from daaku/c-purge-old-backups · 99a3bf8f
  由 Naitik Shah 提交于 4月 22, 2016
```
rocksdb_backup_engine_purge_old_backups for C libraries
```
  99a3bf8f
- N
  
  rocksdb_create_mem_env to allow C libraries to create mem env (#1066) · c146c9be
  由 Naitik Shah 提交于 4月 22, 2016
  
  c146c9be
- N
  
  expose more options in the c api (#1067) · 6da70c58
  由 Naitik Shah 提交于 4月 22, 2016
  
  6da70c58
- N
  
  C rocksdb_create_iterators to expose NewIterators (#1069) · 6f01687a
  由 Naitik Shah 提交于 4月 22, 2016
  
  6f01687a
02 4月, 2016 1 次提交

Adding pin_l0_filter_and_index_blocks_in_cache feature and related fixes. · 9b519875

由 Marton Trencseni 提交于 4月 01, 2016

Summary:
When a block based table file is opened, if prefetch_index_and_filter is true, it will prefetch the index and filter blocks, putting them into the block cache.
What this feature adds: when a L0 block based table file is opened, if pin_l0_filter_and_index_blocks_in_cache is true in the options (and prefetch_index_and_filter is true), then the filter and index blocks aren't released back to the block cache at the end of BlockBasedTableReader::Open(). Instead the table reader takes ownership of them, hence pinning them, ie. the LRU cache will never push them out. Meanwhile in the table reader, further accesses will not hit the block cache, thus avoiding lock contention.

Test Plan:
'export TEST_TMPDIR=/dev/shm/ && DISABLE_JEMALLOC=1 OPT=-g make all valgrind_check -j32' is OK.
I didn't run the Java tests, I don't have Java set up on my devserver.

Reviewers: sdong

Reviewed By: sdong

Subscribers: andrewkr, dhruba

Differential Revision: https://reviews.facebook.net/D56133

9b519875

22 3月, 2016 1 次提交
- S
  Revert "Adding pin_l0_filter_and_index_blocks_in_cache feature." · b1fafcac
  由 sdong 提交于 3月 21, 2016
```
This reverts commit 522de4f5.

It has bug of index block cleaning up.
```
  b1fafcac
18 3月, 2016 1 次提交

Adding pin_l0_filter_and_index_blocks_in_cache feature. · 522de4f5

由 Marton Trencseni 提交于 3月 17, 2016

Summary:
When a block based table file is opened, if prefetch_index_and_filter is true, it will prefetch the index and filter blocks, putting them into the block cache.
What this feature adds: when a L0 block based table file is opened, if pin_l0_filter_and_index_blocks_in_cache is true in the options (and prefetch_index_and_filter is true), then the filter and index blocks aren't released back to the block cache at the end of BlockBasedTableReader::Open(). Instead the table reader takes ownership of them, hence pinning them, ie. the LRU cache will never push them out. Meanwhile in the table reader, further accesses will not hit the block cache, thus avoiding lock contention.
When the table reader is destroyed, it releases the pinned blocks (if there were any). This has to happen before the cache is destroyed, so I had to introduce a TableReader::Close(), to guarantee the order of destruction.

Test Plan:
Added two unit tests for this. Existing unit tests run fine (default is pin_l0_filter_and_index_blocks_in_cache=false).

DISABLE_JEMALLOC=1 OPT=-g make all valgrind_check -j32
Mac: OK.
Linux: with D55287 patched in it's OK.

Reviewers: sdong

Reviewed By: sdong

Subscribers: andrewkr, leveldb, dhruba

Differential Revision: https://reviews.facebook.net/D54801

522de4f5

10 2月, 2016 1 次提交
- B
  
  Updated all copyright headers to the new format. · 21e95811
  由 Baraa Hamodi 提交于 2月 09, 2016
  
  21e95811
31 12月, 2015 2 次提交
- W
  
  expose memtable_prefix_bloom_huge_page_tlb_size option to C API · 0fde291a
  由 Warren Falk 提交于 12月 29, 2015
  
  0fde291a
- W
  
  Support creation of "full" format bloom filter from C API · 7e81dba5
  由 Warren Falk 提交于 12月 29, 2015
  
  7e81dba5
31 10月, 2015 1 次提交
- S
  
  Move skip_table_builder_flush to BlockBasedTableOption · ccc8c10c
  由 SherlockNoMad 提交于 10月 30, 2015
  
  ccc8c10c
19 10月, 2015 1 次提交
- S
  options: add recycle_log_file_num option · 543c12ab
  由 Sage Weil 提交于 10月 07, 2015
```
Signed-off-by: NSage Weil <sage@redhat.com>
```
  543c12ab

kvdb / rocksdb 12 个月 前同步成功

kvdb / rocksdb
12 个月前同步成功