1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165
|
# Processors
The true power of *structlog* lies in its *combinable log processors*.
A log processor is a regular callable or in other words:
A function or an instance of a class with a `__call__()` method.
(chains)=
## Chains
The *processor chain* is a list of processors.
Each processors receives three positional arguments:
**logger**
: Your wrapped logger object.
For example {class}`logging.Logger` or {class}`structlog.typing.FilteringBoundLogger` (default).
**method_name**
: The name of the wrapped method.
If you called `log.warning("foo")`, it will be `"warning"`.
**event_dict**
: Current context together with the current event.
If the context was `{"a": 42}` and the event is `"foo"`, the initial `event_dict` will be `{"a":42, "event": "foo"}`.
The return value of each processor is passed on to the next one as `event_dict` until finally the return value of the last processor gets passed into the wrapped logging method.
:::{note}
*structlog* only looks at the return value of the **last** processor.
That means that as long as you control the next processor in the chain (the processor that will get your return value passed as an argument), you can return whatever you want.
Returning a modified event dictionary from your processors is just a convention to make processors composable.
:::
### Examples
If you set up your logger like:
```python
structlog.configure(processors=[f1, f2, f3])
log = structlog.get_logger().bind(x=42)
```
and call `log.info("some_event", y=23)`, it results in the following call chain:
```python
wrapped_logger.info(
f3(wrapped_logger, "info",
f2(wrapped_logger, "info",
f1(wrapped_logger, "info", {"event": "some_event", "x": 42, "y": 23})
)
)
)
```
In this case, `f3` has to make sure it returns something `wrapped_logger.info` can handle (see {ref}`adapting`).
For the example with `PrintLogger` above, this means `f3` must return a string.
The simplest modification a processor can make is adding new values to the `event_dict`.
Parsing human-readable timestamps is tedious, not so [UNIX timestamps](https://en.wikipedia.org/wiki/UNIX_time) -- let's add one to each log entry:
```python
import calendar
import time
def timestamper(logger, log_method, event_dict):
event_dict["timestamp"] = calendar.timegm(time.gmtime())
return event_dict
```
:::{important}
You're explicitly allowed to modify the `event_dict` parameter, because a copy has been created before calling the first processor.
:::
Please note that *structlog* comes with such a processor built in: {class}`~structlog.processors.TimeStamper`.
## Filtering
If a processor raises {class}`structlog.DropEvent`, the event is silently dropped.
Therefore, the following processor drops every entry:
```python
from structlog import DropEvent
def dropper(logger, method_name, event_dict):
raise DropEvent
```
But we can do better than that!
(cond-drop)=
How about dropping only log entries that are marked as coming from a certain peer (for example, monitoring)?
```python
class ConditionalDropper:
def __init__(self, peer_to_ignore):
self._peer_to_ignore = peer_to_ignore
def __call__(self, logger, method_name, event_dict):
"""
>>> cd = ConditionalDropper("127.0.0.1")
>>> cd(None, "", {"event": "foo", "peer": "10.0.0.1"})
{'peer': '10.0.0.1', 'event': 'foo'}
>>> cd(None, "", {"event": "foo", "peer": "127.0.0.1"})
Traceback (most recent call last):
...
DropEvent
"""
if event_dict.get("peer") == self._peer_to_ignore:
raise DropEvent
return event_dict
```
Since it's so common to filter by the log level, *structlog* comes with {func}`structlog.make_filtering_bound_logger` that filters log entries before they even enter the processor chain.
It does **not** use the standard library, but it does use its names and order of log levels.
(adapting)=
## Adapting and rendering
An important role is played by the *last* processor because its duty is to adapt the `event_dict` into something the logging methods of the *wrapped logger* understand.
With that, it's also the *only* processor that needs to know anything about the underlying system.
It can return one of three types:
- An Unicode string ({any}`str`), a bytes string ({any}`bytes`), or a {any}`bytearray` that is passed as the first (and only) positional argument to the underlying logger.
- A tuple of `(args, kwargs)` that are passed as `log_method(*args, **kwargs)`.
- A dictionary which is passed as `log_method(**kwargs)`.
Therefore `return "hello world"` is a shortcut for `return (("hello world",), {})` (the example in {ref}`chains` assumes this shortcut has been taken).
This should give you enough power to use *structlog* with any logging system while writing agnostic processors that operate on dictionaries.
:::{versionchanged} 14.0.0 Allow final processor to return a {any}`dict`.
:::
:::{versionchanged} 20.2.0 Allow final processor to return a {any}`bytes`.
:::
:::{versionchanged} 21.2.0 Allow final processor to return a {any}`bytearray`.
:::
### Examples
The probably most useful formatter for string based loggers is {class}`structlog.processors.JSONRenderer`.
Advanced log aggregation and analysis tools like [*Logstash*](https://www.elastic.co/logstash) offer features like telling them "this is JSON, deal with it" instead of fiddling with regular expressions.
For a list of shipped processors, check out the {ref}`API documentation <procs>`.
## Third-Party packages
*structlog* was specifically designed to be as composable and reusable as possible, so whatever you're missing:
chances are, you can solve it with a processor!
Since processors are self-contained callables, it's easy to write your own and to share them in separate packages.
We collect those packages in our [GitHub Wiki](https://github.com/hynek/structlog/wiki/Third-Party-Extensions) and encourage you to add your package too!
|