File: VPL_prg_transcoding.rst

package info (click to toggle)
libvpl 1%3A2.15.0-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 21,584 kB
  • sloc: cpp: 92,593; ansic: 6,143; python: 4,312; sh: 323; makefile: 7
file content (150 lines) | stat: -rw-r--r-- 6,315 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
.. SPDX-FileCopyrightText: 2019-2020 Intel Corporation
..
.. SPDX-License-Identifier: CC-BY-4.0

======================
Transcoding Procedures
======================

The application can use |vpl_short_name| encoding, decoding, and video processing
functions together for transcoding operations. This section describes the key
aspects of connecting two or more |vpl_short_name| functions together.

---------------------
Asynchronous Pipeline
---------------------

The application passes the output of an upstream |vpl_short_name| function to the input of
the downstream |vpl_short_name| function to construct an asynchronous pipeline. Pipeline
construction is done at runtime and can be dynamically changed, as shown in the
following example:

.. literalinclude:: ../snippets/prg_transcoding.c
   :language: c++
   :start-after: /*beg1*/
   :end-before: /*end1*/
   :lineno-start: 1

|vpl_short_name| simplifies the requirements for asynchronous pipeline synchronization.
The application only needs to synchronize after the last |vpl_short_name| function. Explicit
synchronization of intermediate results is not required and may slow performance.

|vpl_short_name| tracks dynamic pipeline construction and verifies dependency on input
and output parameters to ensure the execution order of the pipeline function.
In the previous example, |vpl_short_name| will ensure :cpp:func:`MFXVideoENCODE_EncodeFrameAsync`
does not begin its operation until :cpp:func:`MFXVideoDECODE_DecodeFrameAsync` or
:cpp:func:`MFXVideoVPP_RunFrameVPPAsync` has finished.

During the execution of an asynchronous pipeline, the application must consider
the input data as "in use" and must not change it until the execution has completed.
The application must also consider output data unavailable until the execution
has finished. In addition, for encoders, the application must consider extended
and payload buffers as "in use" while the input surface is locked.

|vpl_short_name| checks dependencies by comparing the input and output parameters of each
|vpl_short_name| function in the pipeline. Do not modify the contents of input and output
parameters before the previous asynchronous operation finishes. Doing so will
break the dependency check and can result in undefined behavior. An exception
occurs when the input and output parameters are structures, in which case
overwriting fields in the structures is allowed.

.. note:: The dependency check works on the pointers to the structures only.

There are two exceptions with respect to intermediate synchronization:

- If the input is from any asynchronous operation, the application must
  synchronize any input before calling the |vpl_short_name| :cpp:func:`MFXVideoDECODE_DecodeFrameAsync`
  function.
- When the application calls an asynchronous function to generate an output
  surface in video memory and passes that surface to a non-|vpl_short_name| component, it must
  explicitly synchronize the operation before passing the surface to the non-|vpl_short_name|
  component.

.. _surface_pool_alloc:

-----------------------
Surface Pool Allocation
-----------------------

When connecting API function **A** to API function **B**, the application must
take into account the requirements of both functions to calculate the number of
frame surfaces in the surface pool. Typically, the application can use the formula
**Na+Nb**, where **Na** is the frame surface requirements for |vpl_short_name| function **A**
output, and **Nb** is the frame surface requirements for |vpl_short_name| function **B** input.

For performance considerations, the application must submit multiple operations
and delay synchronization as much as possible, which gives |vpl_short_name| flexibility
to organize internal pipelining. For example, compare the following two operation
sequences, where the first sequence is the recommended order:

.. graphviz::
   :caption: Recommended operation sequence

   digraph {
      rankdir=LR;
      labelloc="t";
      label="Operation sequence 1";
      f1 [shape=record label="ENCODE(F1)" ];
      f2 [shape=record label="ENCODE(F2)" ];
      f3 [shape=record label="SYNC(F1)" ];
      f4 [shape=record label="SYNC(F2)" ];
      f1->f2->f3->f4;
   }


.. graphviz::
   :caption: Operation sequence - not recommended

   digraph {
      rankdir=LR;
      labelloc="t";
      label="Operation sequence 2";
      f1 [shape=record label="ENCODE(F1)" ];
      f2 [shape=record label="ENCODE(F2)" ];
      f3 [shape=record label="SYNC(F1)" ];
      f4 [shape=record label="SYNC(F2)" ];
      f1->f3->f2->f4;
   }



In this example, the surface pool needs additional surfaces to take into account
multiple asynchronous operations before synchronization. The application can use
the :cpp:member:`mfxVideoParam::AsyncDepth` field to inform a |vpl_short_name| function of
the number of asynchronous operations the application plans to perform before
synchronization. The corresponding |vpl_short_name| **QueryIOSurf** function will reflect
this number in the :cpp:member:`mfxFrameAllocRequest::NumFrameSuggested`
value. The following example shows a way of calculating the surface needs based
on :cpp:member:`mfxFrameAllocRequest::NumFrameSuggested` values:

.. literalinclude:: ../snippets/prg_transcoding.c
   :language: c++
   :start-after: /*beg2*/
   :end-before: /*end2*/
   :lineno-start: 1

------------------------
Pipeline Error Reporting
------------------------

During asynchronous pipeline construction, each pipeline stage function will
return a synchronization point (sync point). These synchronization points are
useful in tracking errors during the asynchronous pipeline operation.

For example, assume the following pipeline:

.. graphviz::

   digraph {
      rankdir=LR;
      A->B->C;
   }

The application synchronizes on sync point **C**. If the error occurs in
function **C**, then the synchronization returns the exact error code. If the
error occurs before function **C**, then the synchronization returns
:cpp:enumerator:`mfxStatus::MFX_ERR_ABORTED`. The application can then try to
synchronize on sync point **B**. Similarly, if the error occurs in function **B**,
the synchronization returns the exact error code, or else
:cpp:enumerator:`mfxStatus:: MFX_ERR_ABORTED`. The same logic applies if the
error occurs in function **A**.