p5-Text-Language-Guess
0.02_1Trained module to guess a document's language
Text::Language::Guess guesses a document's language. Its implementation is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, it determines how many of the known stopwords the document contains for each language supported by "Lingua::StopWords". Each word in the document recognized as stopword of a particular language scores one point for this language. The "language_guess()" function takes a document as a parameter and returns the abbreviation of the language that it is most likely written in.
Origin: textproc/p5-Text-Language-Guess
Category: textproc
Size: 12.3KiB
License: not specified
Maintainer: markun@onohara.to
Dependencies: 4 packages
Required by: 0 packages
$
pkg install p5-Text-Language-GuessDependencies (4)
More in textproc
libxml22.15.2
XML parser library for GNOMEexpat2.7.4
XML 1.0 parser written in Cqt5-xml5.15.18p109
Qt SAX and DOM implementations (KDE patched)kf6-kcodecs6.22.0
String encoding librarylibxslt1.1.45
XML stylesheet transformation libraryrubygem-nokogiri1.19.1
HTML, XML, SAX, and Reader parseraspell0.60.8.1_1,1
Spelling checker with better suggestion logic than ispellphp84-xml8.4.16
The xml shared extension for phpkf6-sonnet6.22.0
Multi-language spell checkerp5-XML-LibXML2.0210_1,1
Interface to Gnome libxml2 library