p5-HTML-ExtractMain
0.62_1Perl extension to extract main content of a web page
HTML::ExtractMain is a module which takes HTML content, and uses the Readability algorithm to detect the main body of the page, usually skipping headers, footers, navigation, etc.
Origin: www/p5-HTML-ExtractMain
Category: www
Size: 14.7KiB
License: ART10, GPLv1+
Maintainer: jnlin@freebsd.cs.nctu.edu.tw
Dependencies: 3 packages
Required by: 0 packages
$
pkg install p5-HTML-ExtractMainDependencies (3)
More in www
py311-requests2.32.5
Python HTTP for Humansp5-libwww6.81
Perl5 library for WWW accessp5-HTML-Parser3.83
Perl5 module for parsing HTML documentsphp84-session8.4.16
The session shared extension for phpp5-Catalyst-Runtime5.90132_1
Elegant MVC Web Application Framework (Runtime)py311-django424.2.29
High-level Python Web Frameworkapache242.4.66
Version 2.4.x of Apache web serverp5-HTTP-Message7.01
Representation of HTTP style messagesp5-Template-Toolkit3.102
Extensible template processing systemp5-Plack1.0051
Perl extension of PSGI reference implementation and utilities