File: README.md

package info (click to toggle)
golang-github-vividcortex-gohistogram 1.0.0-2
  • links: PTS, VCS
  • area: main
  • in suites: bookworm, forky, sid, trixie
  • size: 172 kB
  • sloc: makefile: 2
file content (80 lines) | stat: -rw-r--r-- 2,565 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
# gohistogram - Histograms in Go

![build status](https://circleci.com/gh/VividCortex/gohistogram.png?circle-token=d37ec652ea117165cd1b342400a801438f575209)

This package provides [Streaming Approximate Histograms](https://vividcortex.com/blog/2013/07/08/streaming-approximate-histograms/)
for efficient quantile approximations.

The histograms in this package are based on the algorithms found in
Ben-Haim & Yom-Tov's *A Streaming Parallel Decision Tree Algorithm*
([PDF](http://jmlr.org/papers/volume11/ben-haim10a/ben-haim10a.pdf)).
Histogram bins do not have a preset size. As values stream into
the histogram, bins are dynamically added and merged.

Another implementation can be found in the Apache Hive project (see
[NumericHistogram](http://hive.apache.org/docs/r0.11.0/api/org/apache/hadoop/hive/ql/udf/generic/NumericHistogram.html)).

An example:

![histogram](http://i.imgur.com/5OplaRs.png)

The accurate method of calculating quantiles (like percentiles) requires
data to be sorted. Streaming histograms make it possible to approximate
quantiles without sorting (or even individually storing) values.

NumericHistogram is the more basic implementation of a streaming
histogram. WeightedHistogram implements bin values as exponentially-weighted
moving averages.

A maximum bin size is passed as an argument to the constructor methods. A
larger bin size yields more accurate approximations at the cost of increased
memory utilization and performance.

A picture of kittens:

![stack of kittens](http://i.imgur.com/QxRTWAE.jpg)

## Getting started

### Using in your own code

    $ go get github.com/VividCortex/gohistogram
    
```go
import "github.com/VividCortex/gohistogram"
```

### Running tests and making modifications

Get the code into your workspace:

    $ cd $GOPATH
    $ git clone git@github.com:VividCortex/gohistogram.git ./src/github.com/VividCortex/gohistogram

You can run the tests now:

    $ cd src/github.com/VividCortex/gohistogram
    $ go test .

## API Documentation

Full source documentation can be found [here][godoc].

[godoc]: http://godoc.org/github.com/VividCortex/gohistogram

## Contributing

We only accept pull requests for minor fixes or improvements. This includes:

* Small bug fixes
* Typos
* Documentation or comments

Please open issues to discuss new features. Pull requests for new features will be rejected,
so we recommend forking the repository and making changes in your fork for your use case.

## License

Copyright (c) 2013 VividCortex

Released under MIT License. Check `LICENSE` file for details.