1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141
|
Miscellaneous
=============
Client configuring from URL
---------------------------
*New in version 0.1.1.*
Client can be configured from the given URL:
.. code-block:: python
>>> from clickhouse_driver import Client
>>> client = Client.from_url(
... 'clickhouse://login:password@host:port/database'
... )
Port 9000 is default for schema ``clickhouse``, port 9440 is default for schema ``clickhouses``.
Connection to default database:
.. code-block:: python
>>> client = Client.from_url('clickhouse://localhost')
Querystring arguments will be passed along to the :meth:`~clickhouse_driver.connection.Connection` class’s initializer:
.. code-block:: python
>>> client = Client.from_url(
... 'clickhouse://localhost/database?send_logs_level=trace&'
... 'client_name=myclient&'
... 'compression=lz4'
... )
If parameter doesn't match Connection's init signature will be treated as settings parameter.
.. _insert-from-csv-file:
Inserting data from CSV file
----------------------------
Let's assume you have following data in CSV file.
.. code-block:: shell
$ cat /tmp/data.csv
time,order,qty
2019-08-01 15:23:14,New order1,5
2019-08-05 09:14:45,New order2,3
2019-08-13 12:20:32,New order3,7
Data can be inserted into ClickHouse in the following way:
.. code-block:: python
>>> from csv import DictReader
>>> from datetime import datetime
>>>
>>> from clickhouse_driver import Client
>>>
>>>
>>> def iter_csv(filename):
... converters = {
... 'qty': int,
... 'time': lambda x: datetime.strptime(x, '%Y-%m-%d %H:%M:%S')
... }
...
... with open(filename, 'r') as f:
... reader = DictReader(f)
... for line in reader:
... yield {k: (converters[k](v) if k in converters else v) for k, v in line.items()}
...
>>> client = Client('localhost')
>>>
>>> client.execute(
... 'CREATE TABLE IF NOT EXISTS data_csv '
... '('
... 'time DateTime, '
... 'order String, '
... 'qty Int32'
... ') Engine = Memory'
... )
>>> []
>>> client.execute('INSERT INTO data_csv VALUES', iter_csv('/tmp/data.csv'))
3
Table can be populated with json file in the similar way.
Adding missed settings
----------------------
It's hard to keep package settings in consistent state with ClickHouse
server's. Some settings can be missed if your server is old. But, if setting
is *supported by your server* and missed in the package it can be added by
simple monkey pathing. Just look into ClickHouse server source and pick
corresponding setting type from package or write your own type.
.. code-block:: python
>>> from clickhouse_driver.settings.available import settings as available_settings, SettingBool
>>> from clickhouse_driver import Client
>>>
>>> available_settings['allow_suspicious_low_cardinality_types'] = SettingBool
>>>
>>> client = Client('localhost', settings={'allow_suspicious_low_cardinality_types': True})
>>> client.execute('CREATE TABLE test (x LowCardinality(Int32)) Engine = Null')
[]
*New in version 0.1.5.*
Modern ClickHouse servers (20.*+) use text serialization for settings instead of
binary serialization. You don't have to add missed settings manually into
available. Just specify new settings and it will work.
.. code-block:: python
>>> client = Client('localhost', settings={'brand_new_setting': 42})
>>> client.execute('SELECT 1')
Inserting NULL into NOT NULL columns
------------------------------------
*New in version 0.2.4.*
Client option ``input_format_null_as_default`` does the same thing as in
``clickhouse-client``. But in this package it's disabled by default. You should
enable it if you want cast ``None`` value into default value for current type:
.. code-block:: python
>>> settings = {'input_format_null_as_default': True}
>>> client = Client('localhost', settings=settings)
|