File: hadoop.asciidoc

package info (click to toggle)
elasticsearch 1.0.3%2Bdfsg-5
  • links: PTS, VCS
  • area: main
  • in suites: jessie-kfreebsd
  • size: 37,220 kB
  • sloc: java: 365,486; xml: 1,258; sh: 714; python: 505; ruby: 354; perl: 134; makefile: 41
file content (36 lines) | stat: -rw-r--r-- 1,180 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
[[modules-gateway-hadoop]]
=== Hadoop Gateway

*The hadoop gateway is deprecated and will be removed in a future
version. Please use the
<<modules-gateway-local,local gateway>>
instead.*

The hadoop (HDFS) based gateway stores the cluster meta and indices data
in hadoop. Hadoop support is provided as a plugin and installing is
explained https://github.com/elasticsearch/elasticsearch-hadoop[here] or
downloading the hadoop plugin and placing it under the `plugins`
directory. Here is an example config to enable it:

[source,js]
--------------------------------------------------
gateway:
    type: hdfs
    hdfs:
        uri: hdfs://myhost:8022
--------------------------------------------------

[float]
==== Settings

The hadoop gateway requires two simple settings. The `gateway.hdfs.uri`
controls the URI to connect to the hadoop cluster, for example:
`hdfs://myhost:8022`. The `gateway.hdfs.path` controls the path under
which the gateway will store the data.

[float]
==== concurrent_streams

The `gateway.hdfs.concurrent_streams` allow to throttle the number of
streams (per node) opened against the shared gateway performing the
snapshot operation. It defaults to `5`.