File: linkextractor_no_href.html

package info (click to toggle)
python-scrapy 2.13.3-1
  • links: PTS, VCS
  • area: main
  • in suites: forky
  • size: 5,664 kB
  • sloc: python: 52,028; xml: 199; makefile: 25; sh: 7
file content (26 lines) | stat: -rw-r--r-- 740 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
<!DOCTYPE html>

<html>
  <head>
    <base href='http://example.com' />
    <title>Sample page with anchor tags containing no href attribute, to test the TextResponse.follow_all method</title>
  </head>

<body>
  <div class="quote">
    <span class="text">“The world as we have created it is a process of our
      thinking. It cannot be changed without changing our thinking.”</span>
    <span>
      by <small class="author">Albert Einstein</small>
      <a href="/author/Albert-Einstein">(about)</a>
    </span>
    <div id="pagination" class="pagination">
      Tags:
      <a href="/page/1/">Page 1</a>
      <a>Current</a>
      <a href="/page/3/">Page 3</a>
      <a href="/page/4/">Page 4</a>
    </div>
  </div>
</body>
</html>