File: control

package info (click to toggle)
boilerpipe 1.2.0-1
  • links: PTS, VCS
  • area: main
  • in suites: bullseye, buster, jessie, jessie-kfreebsd, sid, stretch
  • size: 524 kB
  • sloc: java: 4,298; xml: 187; makefile: 18
file content (26 lines) | stat: -rw-r--r-- 1,215 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
Source: boilerpipe
Section: java
Priority: optional
Maintainer: Debian Java Maintainers <pkg-java-maintainers@lists.alioth.debian.org>
Uploaders: Emmanuel Bourg <ebourg@apache.org>
Build-Depends: debhelper (>= 8), cdbs, default-jdk
Build-Depends-Indep: maven-repo-helper, ant (>= 1.6.5), javahelper, libnekohtml-java, libxerces2-java
Standards-Version: 3.9.4
Vcs-Svn: svn://anonscm.debian.org/pkg-java/trunk/boilerpipe
Vcs-Browser: http://anonscm.debian.org/viewvc/pkg-java/trunk/boilerpipe
Homepage: http://code.google.com/p/boilerpipe

Package: libboilerpipe-java
Architecture: all
Depends: ${misc:Depends}, libnekohtml-java, libxerces2-java
Description: Boilerplate removal and fulltext extraction from HTML pages
 The boilerpipe library provides algorithms to detect and remove the surplus
 "clutter" (boilerplate, templates) around the main textual content of a web
 page.
 .
 The library already provides specific strategies for common tasks (for example:
 news article extraction) and may also be easily extended for individual problem
 settings.
 .
 Extracting content is very fast (milliseconds), just needs the input document
 (no global or site-level information required) and is usually quite accurate.