File: detection_algorithms.rst

package info (click to toggle)
thumbor 7.7.7-5
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 40,556 kB
  • sloc: xml: 295,435; python: 18,673; ansic: 1,479; makefile: 360; sh: 27
file content (179 lines) | stat: -rw-r--r-- 6,143 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
Detection Algorithms
====================

If the smart mode of thumbor has been specified in the uri (by the
/smart portion of it), thumbor will use it's smart detectors to find
focal points.

thumbor comes pre-packaged with two focal-point detection algorithms:
facial and feature. First it tries to identify faces and if it can't
find any, it tries to identify features (more on that below).

Facial Detection
----------------

For instructions on how to get facial detection coordinates see
:ref:`usage-metadata-endpoint` .

.. note::
    thumbor uses OpenCV (http://opencv.org) to detect faces.
    OpenCV returns the rectangle coordinates for the faces it identifies.
    You can specify the HAAR file Thumbor should use for identification.

Original image
~~~~~~~~~~~~~~

.. image:: images/tom_before_brightness.jpg
    :alt: Original image

Image after detection
~~~~~~~~~~~~~~~~~~~~~

Notice how white rectangle show the area identified as face:

.. image:: images/face_detection_alt.jpg
    :alt: White rectangle is the area identified as face

After retrieving these squares from OpenCV, thumbor calculates the
*center of mass* of the image using *weighted average*.

Consider that OpenCV returned 3 squares at *(10, 10, 100, 100)*, *(150,
100, 100, 100)*, *(300, 300, 80, 50)*, being *(x, y, width, height)*, as
such:

.. image:: images/faces_found_example.png
    :alt: Faces at 10x10, 150x100, 300x300

In order to find the *center of mass* for all the faces, we must first
find the center and weight of each rectangle. We define weight in this
scenario as the area of the rectangle.

So, for the faces in our example (***x***,\ ***y*** being the
coordinates of the rectangle's center and ***z*** the rectangle weight):

**Face 1**:

- :math:`x = (10 + 100) ÷ 2 = 55`
- :math:`y = (10 + 100) ÷ 2 = 55`
- :math:`z = 100 * 100 = 10000`

**Face 2**:

- :math:`x = (150 + 100) ÷ 2 = 125`
- :math:`y = (100 + 100) ÷ 2 = 100`
- :math:`z = 100 * 100 = 10000`

**Face 3**:

- :math:`x = (300 + 80) ÷ 2 = 190`
- :math:`y = (300 + 50) ÷ 2 = 175`
- :math:`z = 80 * 50 = 4000`

In order to find the *center of mass* we'll do a *weighted average* of
the X and Y coordinates of the faces using:

**Horizontal Axis - X**: :math:`((55 * 10000) + (125 * 10000) + (190 * 4000)) ÷ 24000 = 106`

**Vertical Axis - Y**: :math:`((55 * 10000) + (100 * 10000) + (175 * 4000)) ÷ 24000 = 93`

So for the faces found by OpenCV in that image we have the *center of
mass* of the picture being :math:`106x93`.

Using Focal Points for Cropping
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

After finding the center of mass we can use it as the focal point for
cropping. Given an image of :math:`800x600` and a focal point at :math:`106x93`, we
need to determine the percentage that needs to be cropped from the top,
bottom, left and right sides of the image.

To determine the percentage we use simple math to figure how far from
the top and the left side the *center of mass* is:

**From the left** - :math:`left = 106 ÷ 800 * 100 = 13.25\%`

**From the top** - :math:`top = 93 ÷ 600 * 100 = 15.50\%`

Using the same example from the :doc:`crop_and_resize_algorithms` page, we
need to crop :math:`300px` out of the height of the image. In possession of
the percentages of crop above, we can calculate how much we need to crop
out of the top and bottom with:

**Top** - :math:`top = 300 * 0.155 ~= 46`

**Bottom** - just subtract top :math:`46px` from the amount of crop (``300px``): :math:`bottom = 300 - 46 = 254`

So, now we now we have to remove :math:`46px` out of the top of the picture
and :math:`254px` out of the bottom of the picture. In an :math:`800x600` picture,
that means cropping from *(0, 46)* to *(800, 346)*, resulting in an
:math:`800x300px` image.

Assuming we would crop :math:`300px` horizontally, the cropping would be:

**Left** - :math:`left = 300 * 0.135 ~= 40`

**Right** - just subtract left :math:`40px` from the amount of crop
(``300px``): :math:`right = 300 - 40 = 260`

In an image of :math:`800x600`, that means cropping from *(40, 0)* to *(540,
600)*, resulting in a :math:`500x600px` image. This would not be the case for
this image, though.

Feature Detection
-----------------

If no faces are found in the picture, we still try to find important
features in the image, provided by the Good Features to Track Algorithm
in OpenCV (http://bit.ly/evAU95).

According to OpenCV documentation, this algorithm finds *"important"*
corners in the image. It then returns a list of *(x, y)* values.

We can see the detection taking place in the following images:

.. image:: images/dice_transparent_background.png
    :alt: Original image

The points identified by the good features algorithm:

.. image:: images/feature_detection.jpg
    :alt: The points identified by the good features algorithm

The cropping based in these features is analogous to the face one,
except that all points have a weight of ***1.0*** and are already their
centers.

Let's consider that we found **3** feature points: *10x15*, *30x40*,
*25x60*. To find the center of mass we would do ((10 + 30 + 25) / 3 ~=
22) to find the horizontal component and ((15 + 40 + 60) / 3 ~= 39) for
the vertical one. This means that our center of mass in this scenario is
***22x39***.

Given an image of *800x600* and a center of mass of *22x39*, let's find
the left and top percentages:

**From the left** - :math:`22 / 800 * 100 = 2.75\%`

**From the top** - :math:`93 / 600 * 100 = 6.50\%`

Assuming we are cropping *300px* of the height, we'll crop top and
bottom according to:

**Top** - :math:`300 * 0.0275 ~= 9`

**Bottom** - just subtract top (*9px*) from the amount of crop (*300px*)
- :math:`300 - 9 = 291`

In an image of *800x600*, that means cropping from *(0, 9)* to *(800,
309)*, resulting in a *800x300px* image.

If we were cropping *300px* of the width instead, we would crop left and
right according to:

**Left** - :math:`300 * 0.065 ~= 20`

**Right** - just subtract left (*20px*) from the amount of crop
(*300px*) - :math:`300 - 20 = 280`

In an image of *800x600*, that means cropping from *(20, 0)* to *(520,
600)*, resulting in a *500x600px* image.