FreeBSD.software
Home/chinese/p5-Lingua-ZH-WordSegmenter

p5-Lingua-ZH-WordSegmenter

0.01_1chinese

Simplified Chinese Word Segmentation

This is a perl version of simplified Chinese word segmentation. The algorithm for this segmenter is to search the longest word at each point from both left and right directions, and choose the one with higher frequency product. The original program is from the CPAN module Lingua::ZH::WordSegment (https://metacpan.org/author/CHENYR) I did the follwing changes: 1) make the interface object oriented; 2) make the internal string into utf8; 3) using sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as the default dictionary.

$pkg install p5-Lingua-ZH-WordSegmenter
metacpan.org/release/Lingua-ZH-WordSegmenter
Origin
chinese/p5-Lingua-ZH-WordSegmenter
Size
2.30MiB
License
ART10, GPLv1+
Maintainer
sunpoet@FreeBSD.org
Dependencies
1 packages
Required by
0 packages

Dependencies (1)