File: table_ignore.html

package info (click to toggle)
python-html2text 2025.4.15-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid, trixie
  • size: 976 kB
  • sloc: python: 1,659; sh: 32; makefile: 5
file content (26 lines) | stat: -rw-r--r-- 1,348 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
<!DOCTYPE html> <html>
    <head lang="en"> <meta charset="UTF-8"> <title></title> </head>
    <body> <h1>This is a test document</h1> With some text, <code>code</code>, <b>bolds</b> and <i>italics</i>.  <h2>This is second header</h2> <p style="display: none">Displaynone text</p> 
    <table>
        <tr> <th>Header 1</th> <th>Header 2</th> <th>Header 3</th> </tr>
        <tr> <td>Content 1</td> <td>2</td> <td><img src="http://lorempixel.com/200/200" alt="200"/> Image!</td> </tr>
        <tr> <td>Content 1 longer</td> <td>Content 2</td> <td>blah</td> </tr>
        <tr> <td>Content </td> <td>Content 2</td> <td>blah</td> </tr>
        <tr> <td>t </td> <td>Content 2</td> <td>blah blah blah</td> </tr>
    </table>


    <table> <tr> <th>H1</th> <th>H2</th> <th>H3</th> </tr>
        <tr> <td>C1</td> <td>Content 2</td> <td>x</td> </tr>
        <tr> <td>C123</td> <td>Content 2</td> <td>xyz</td> </tr>
    </table>

some content between the tables<br>

    <table> <tr> <th>Header 1</th> <th>Header 2</th> <th>Header 3</th> </tr>
        <tr> <td>Content 1</td> <td>Content 2</td> <td><img src="http://lorempixel.com/200/200" alt="200"/> Image!</td> </tr>
        <tr> <td>Content 1</td> <td>Content 2 longer</td> <td><img src="http://lorempixel.com/200/200" alt="200"/> Image!</td> </tr>
    </table>

something else entirely
</body> </html>