File: README.md

package info (click to toggle)
pytorch-audio 2.6.0-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 10,696 kB
  • sloc: python: 61,274; cpp: 10,031; sh: 128; ansic: 70; makefile: 34
file content (35 lines) | stat: -rw-r--r-- 1,095 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
# Augmentation

This example demonstrates how you can use torchaudio's I/O features and augmentations in C++ application.

**NOTE**
This example uses `"sox_io"` backend, thus does not work on Windows.

## Steps
### 1. Create augmentation pipeline TorchScript file.

First, we implement our data process pipeline as a regular Python, and save it as a TorchScript object.
We will load and execute it in our C++ application. The C++ code is found in [`main.cpp`](./main.cpp).

```python
python create_jittable_pipeline.py \
    --rir-path "../data/rir.wav" \
    --output-path "./pipeline.zip"
```

### 2. Build the application

Please refer to [the top level README.md](../README.md)

### 3. Run the application

Now we run the C++ application `augment`, with the TorchScript object we created in Step.1 and an input audio file.

In [the top level directory](../)

```bash
input_audio_file="./data/input.wav"
./build/augmentation/augment ./augmentation/pipeline.zip "${input_audio_file}" "output.wav"
```

When you give a clean speech file, the output audio sounds like it's a phone conversation.