File: massage_html.awk

package info (click to toggle)
texlive-lang 2022.20230122-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 1,447,264 kB
  • sloc: perl: 61,377; xml: 53,781; makefile: 4,525; sh: 4,338; ansic: 2,892; python: 2,861; ruby: 1,031; lisp: 750; awk: 649; java: 159; sed: 142; csh: 25
file content (27 lines) | stat: -rw-r--r-- 634 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
# massage_html.awk
# Part of latex-doc-ptr
#   make4ht loses the subsection run-in headers.  It puts the resulting material
# on two lines.  So this small state machine passes through the .html file
# and creates subsection classes from paragraphs.
#
# 2020-Dec-31 Jim Hefferon

BEGIN { lastLine = ""}

# Look for second line of two-line patterns for subsections
/^class="Spectral-Bold-lf-t-1x-x-109">/ {
  if(lastLine == "<span ") {
      printf("<p class=\"subsection\">%s\n", lastLine)
  } else {
      print lastLine
  }
}

!/^class="Spectral-Bold-lf-t-1x-x-109">/ {
    print lastLine
}

{ lastLine = $0 }

END {print lastLine}