File: README.md

package info (click to toggle)
badger 4.8.0-2
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 1,672 kB
  • sloc: sh: 132; makefile: 61
file content (121 lines) | stat: -rw-r--r-- 3,886 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
Size of table is 123,217,667 bytes for all benchmarks.

# BenchmarkRead

```sh
$ go test -bench ^BenchmarkRead$ -run ^$ -count 3
goos: linux
goarch: amd64
pkg: github.com/dgraph-io/badger/table
BenchmarkRead-16              10       154074944 ns/op
BenchmarkRead-16              10       154340411 ns/op
BenchmarkRead-16              10       151914489 ns/op
PASS
ok       github.com/dgraph-io/badger/table       22.467s
```

Size of table is 123,217,667 bytes, which is ~118MB.

The rate is ~762MB/s using LoadToRAM (when table is in RAM).

To read a 64MB table, this would take ~0.084s, which is negligible.

# BenchmarkReadAndBuild

```sh
$ go test -bench BenchmarkReadAndBuild -run ^$ -count 3
goos: linux
goarch: amd64
pkg: github.com/dgraph-io/badger/table
BenchmarkReadAndBuild-16              1       1026755231 ns/op
BenchmarkReadAndBuild-16              1       1009543316 ns/op
BenchmarkReadAndBuild-16              1       1039920546 ns/op
PASS
ok       github.com/dgraph-io/badger/table       12.081s
```

The rate is ~123MB/s. To build a 64MB table, this would take ~0.56s. Note that this does NOT include
the flushing of the table to disk. All we are doing above is reading one table (which is in RAM) and
write one table in memory.

The table building takes 0.56-0.084s ~ 0.4823s.

# BenchmarkReadMerged

Below, we merge 5 tables. The total size remains unchanged at ~122M.

```sh
$ go test -bench ReadMerged -run ^$ -count 3
goos: linux
goarch: amd64
pkg: github.com/dgraph-io/badger/table
BenchmarkReadMerged-16              2       977588975 ns/op
BenchmarkReadMerged-16              2       982140738 ns/op
BenchmarkReadMerged-16              2       962046017 ns/op
PASS
ok       github.com/dgraph-io/badger/table       27.433s
```

The rate is ~120MB/s. To read a 64MB table using merge iterator, this would take ~0.53s.

# BenchmarkRandomRead

```sh
go test -bench BenchmarkRandomRead$ -run ^$ -count 3
goos: linux
goarch: amd64
pkg: github.com/dgraph-io/badger/table
BenchmarkRandomRead-16              500000       2645 ns/op
BenchmarkRandomRead-16              500000       2648 ns/op
BenchmarkRandomRead-16              500000       2614 ns/op
PASS
ok       github.com/dgraph-io/badger/table       50.850s
```

For random read benchmarking, we are randomly reading a key and verifying its value.

# DB Open benchmark

1. Create badger DB with 2 billion key-value pairs (about 380GB of data)

   ```sh
   badger fill -m 2000 --dir="/tmp/data" --sorted
   ```

2. Clear buffers and swap memory

   ```sh
   free -mh && sync && echo 3 | sudo tee /proc/sys/vm/drop_caches && sudo swapoff -a && sudo swapon -a && free -mh
   ```

   Also flush disk buffers

   ```sh
   blockdev --flushbufs /dev/nvme0n1p4
   ```

3. Run the benchmark

   ```sh
   go test -run=^$ github.com/dgraph-io/badger -bench ^BenchmarkDBOpen$ -benchdir="/tmp/data" -v

   badger 2019/06/04 17:15:56 INFO: 126 tables out of 1028 opened in 3.017s
   badger 2019/06/04 17:15:59 INFO: 257 tables out of 1028 opened in 6.014s
   badger 2019/06/04 17:16:02 INFO: 387 tables out of 1028 opened in 9.017s
   badger 2019/06/04 17:16:05 INFO: 516 tables out of 1028 opened in 12.025s
   badger 2019/06/04 17:16:08 INFO: 645 tables out of 1028 opened in 15.013s
   badger 2019/06/04 17:16:11 INFO: 775 tables out of 1028 opened in 18.008s
   badger 2019/06/04 17:16:14 INFO: 906 tables out of 1028 opened in 21.003s
   badger 2019/06/04 17:16:17 INFO: All 1028 tables opened in 23.851s
   badger 2019/06/04 17:16:17 INFO: Replaying file id: 1998 at offset: 332000
   badger 2019/06/04 17:16:17 INFO: Replay took: 9.81µs
   goos: linux
   goarch: amd64
   pkg: github.com/dgraph-io/badger
   BenchmarkDBOpen-16              1       23930082140 ns/op
   PASS
   ok       github.com/dgraph-io/badger       24.076s

   ```

   It takes about 23.851s to open a DB with 2 billion sorted key-value entries.