1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452
|
.. include:: ./snippets/postgres_doc_links.rst
.. warning::
Table partitioning is a relatively new and advanded PostgreSQL feature. It has plenty of ways to shoot yourself in the foot with.
We HIGHLY RECOMMEND you only use this feature if you're already deeply familiar with table partitioning and aware of its advantages and disadvantages.
Do study the PostgreSQL documentation carefully.
.. _table_partitioning_page:
Table partitioning
==================
:class:`~psqlextra.models.PostgresPartitionedModel` adds support for `PostgreSQL Declarative Table Partitioning`_.
The following partitioning methods are available:
* ``PARTITION BY RANGE``
* ``PARTITION BY LIST``
* ``PARTITION BY HASH``
.. note::
Although table partitioning is available in PostgreSQL 10.x, it is highly recommended you use PostgresSQL 11.x. Table partitioning got a major upgrade in PostgreSQL 11.x.
PostgreSQL 10.x does not support creating foreign keys to/from partitioned tables and does not automatically create an index across all partitions.
Creating partitioned tables
---------------------------
Partitioned tables are declared like regular Django models with a special base class and two extra options to set the partitioning method and key. Once declared, they behave like regular Django models.
Declaring the model
*******************
Inherit your model from :class:`psqlextra.models.PostgresPartitionedModel` and declare a child class named ``PartitioningMeta``. On the meta class, specify the partitioning method and key.
* Use :attr:`psqlextra.types.PostgresPartitioningMethod.RANGE` to ``PARTITION BY RANGE``
* Use :attr:`psqlextra.types.PostgresPartitioningMethod.LIST` to ``PARTITION BY LIST``
* Use :attr:`psqlextra.types.PostgresPartitioningMethod.HASH` to ``PARTITION BY HASH``
.. code-block:: python
from django.db import models
from psqlextra.types import PostgresPartitioningMethod
from psqlextra.models import PostgresPartitionedModel
class MyModel(PostgresPartitionedModel):
class PartitioningMeta:
method = PostgresPartitioningMethod.RANGE
key = ["timestamp"]
name = models.TextField()
timestamp = models.DateTimeField()
Generating a migration
**********************
Run the following command to automatically generate a migration:
.. code-block:: bash
python manage.py pgmakemigrations
This will generate a migration that creates the partitioned table with a default partition.
.. warning::
Always use ``python manage.py pgmakemigrations`` for partitioned models.
The model must be created by the :class:`~psqlextra.backend.migrations.operations.PostgresCreatePartitionedModel` operation.
Do not use the standard ``python manage.py makemigrations`` command for partitioned models. Django will issue a standard :class:`~django:django.db.migrations.operations.CreateModel` operation. Doing this will not create a partitioned table and all subsequent operations will fail.
Automatically managing partitions
---------------------------------
The ``python manage.py pgpartition`` command can help you automatically create new partitions ahead of time and delete old ones for time-based partitioning.
You can run this command manually as needed, schedule to run it periodically or run it every time you release a new version of your app.
.. warning::
We DO NOT recommend that you set up this command to automatically delete partitions without manual review.
Specify ``--skip-delete`` to not delete partitions automatically. Run the command manually periodically without the ``--yes`` flag to review partitions to be deleted.
Command-line options
********************
==================== ============= ================ ==================================================================================================== === === === === === ===
Long flag Short flag Default Description
==================== ============= ================ ==================================================================================================== === === === === === ===
``--yes`` ``-y`` ``False`` Specifies yes to all questions. You will NOT be asked for confirmation before partition deletion.
``--using`` ``-u`` ``'default'`` Optional name of the database connection to use.
``--skip-create`` ``False`` Whether to skip creating partitions.
``--skip-delete`` ``False`` Whether to skip deleting partitions.
==================== ============= ================ ==================================================================================================== === === === === === ===
Configuration
*************
In order to use the command, you have to declare an instance of :class:`psqlextra.partitioning.PostgresPartitioningManager` and set ``PSQLEXTRA_PARTITIONING_MANAGER`` to a string with the import path to your instance of :class:`psqlextra.partitioning.PostgresPartitioningManager`.
For example:
.. code-block:: python
# myapp/partitioning.py
from psqlextra.partitioning import PostgresPartitioningManager
manager = PostgresPartitioningManager(...)
# myapp/settings.py
PSQLEXTRA_PARTITIONING_MANAGER = 'myapp.partitioning.manager'
Time-based partitioning
~~~~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from dateutil.relativedelta import relativedelta
from psqlextra.partitioning import (
PostgresPartitioningManager,
PostgresCurrentTimePartitioningStrategy,
PostgresTimePartitionSize,
partition_by_current_time,
)
from psqlextra.partitioning.config import PostgresPartitioningConfig
manager = PostgresPartitioningManager([
# 3 partitions ahead, each partition is one month
# delete partitions older than 6 months
# partitions will be named `[table_name]_[year]_[3-letter month name]`.
PostgresPartitioningConfig(
model=MyPartitionedModel,
strategy=PostgresCurrentTimePartitioningStrategy(
size=PostgresTimePartitionSize(months=1),
count=3,
max_age=relativedelta(months=6),
),
),
# 6 partitions ahead, each partition is two weeks
# delete partitions older than 8 months
# partitions will be named `[table_name]_[year]_week_[week number]`.
PostgresPartitioningConfig(
model=MyPartitionedModel,
strategy=PostgresCurrentTimePartitioningStrategy(
size=PostgresTimePartitionSize(weeks=2),
count=6,
max_age=relativedelta(months=8),
),
),
# 12 partitions ahead, each partition is 5 days
# old partitions are never deleted, `max_age` is not set
# partitions will be named `[table_name]_[year]_[month]_[month day number]`.
PostgresPartitioningConfig(
model=MyPartitionedModel,
strategy=PostgresCurrentTimePartitioningStrategy(
size=PostgresTimePartitionSize(days=5),
count=12,
),
),
])
Changing a time partitioning strategy
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
When switching partitioning strategies, you might encounter the problem that partitions for part of a particular range already exist.
In order to combat this, you can use the :class:`psqlextra.partitioning.PostgresTimePartitioningStrategy` and specify the `start_datetime` parameter. As a result, no partitions will be created before the given date/time.
Custom strategy
~~~~~~~~~~~~~~~
You can create a custom partitioning strategy by implementing the :class:`psqlextra.partitioning.PostgresPartitioningStrategy` interface.
You can look at :class:`psqlextra.partitioning.PostgresCurrentTimePartitioningStrategy` as an example.
Manually managing partitions
----------------------------
If you are using list or hash partitioning, you most likely have a fixed amount of partitions that can be created up front using migrations or using the schema editor.
Using migration operations
**************************
Adding a range partition
~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`~psqlextra.backend.migrations.operations.PostgresAddRangePartition` operation to add a new range partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.RANGE`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresAddRangePartition
class Migration(migrations.Migration):
operations = [
PostgresAddRangePartition(
model_name="mypartitionedmodel",
name="pt1",
from_values="2019-01-01",
to_values="2019-02-01",
),
]
Adding a list partition
~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`~psqlextra.backend.migrations.operations.PostgresAddListPartition` operation to add a new list partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.LIST`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresAddListPartition
class Migration(migrations.Migration):
operations = [
PostgresAddListPartition(
model_name="mypartitionedmodel",
name="pt1",
values=["car", "boat"],
),
]
Adding a hash partition
~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`~psqlextra.backend.migrations.operations.PostgresAddHashPartition` operation to add a new list partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.HASH`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresAddHashPartition
class Migration(migrations.Migration):
operations = [
PostgresAddHashPartition(
model_name="mypartitionedmodel",
name="pt1",
modulus=3,
remainder=1,
),
]
Adding a default partition
~~~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`~psqlextra.backend.migrations.operations.PostgresAddDefaultPartition` operation to add a new list partition.
Note that you can only have one default partition per partitioned table/model. An error will be thrown if you try to create a second default partition.
If you used ``python manage.py pgmakemigrations`` to generate a migration for your newly created partitioned model, you do not need this operation. This operation is added automatically when you create a new partitioned model.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresAddDefaultPartition
class Migration(migrations.Migration):
operations = [
PostgresAddDefaultPartition(
model_name="mypartitionedmodel",
name="default",
),
]
Deleting a default partition
~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`~psqlextra.backend.migrations.operations.PostgresDeleteDefaultPartition` operation to delete an existing default partition.
.. warning::
Deleting the default partition and leaving your model without a default partition can be dangerous. Rows that do not fit in any other partition will fail to be inserted.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresDeleteDefaultPartition
class Migration(migrations.Migration):
operations = [
PostgresDeleteDefaultPartition(
model_name="mypartitionedmodel",
name="pt1",
),
]
Deleting a range partition
~~~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`psqlextra.backend.migrations.operations.PostgresDeleteRangePartition` operation to delete an existing range partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.RANGE`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresDeleteRangePartition
class Migration(migrations.Migration):
operations = [
PostgresDeleteRangePartition(
model_name="mypartitionedmodel",
name="pt1",
),
]
Deleting a list partition
~~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`psqlextra.backend.migrations.operations.PostgresDeleteListPartition` operation to delete an existing range partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.LIST`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresDeleteListPartition
class Migration(migrations.Migration):
operations = [
PostgresDeleteListPartition(
model_name="mypartitionedmodel",
name="pt1",
),
]
Deleting a hash partition
~~~~~~~~~~~~~~~~~~~~~~~~~
Use the :class:`psqlextra.backend.migrations.operations.PostgresDeleteHashPartition` operation to delete an existing range partition. Only use this operation when your partitioned model uses :attr:`psqlextra.types.PostgresPartitioningMethod.HASH`.
.. code-block:: python
from django.db import migrations, models
from psqlextra.backend.migrations.operations import PostgresDeleteHashPartition
class Migration(migrations.Migration):
operations = [
PostgresDeleteHashPartition(
model_name="mypartitionedmodel",
name="pt1",
),
]
Using the schema editor
***********************
Use the :class:`psqlextra.backend.PostgresSchemaEditor` to manage partitions directly in a more imperative fashion. The schema editor is used by the migration operations described above.
Adding a range partition
~~~~~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from django.db import connection
connection.schema_editor().add_range_partition(
model=MyPartitionedModel,
name="pt1",
from_values="2019-01-01",
to_values="2019-02-01",
)
Adding a list partition
~~~~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from django.db import connection
connection.schema_editor().add_list_partition(
model=MyPartitionedModel,
name="pt1",
values=["car", "boat"],
)
Adding a hash partition
~~~~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from django.db import connection
connection.schema_editor().add_hash_partition(
model=MyPartitionedModel,
name="pt1",
modulus=3,
remainder=1,
)
Adding a default partition
~~~~~~~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from django.db import connection
connection.schema_editor().add_default_partition(
model=MyPartitionedModel,
name="default",
)
Deleting a partition
~~~~~~~~~~~~~~~~~~~~
.. code-block:: python
from django.db import connection
connection.schema_editor().delete_partition(
model=MyPartitionedModel,
name="default",
)
|