Lingua::JA::NormalizeText

A text normalizer
Download

Lingua::JA::NormalizeText Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Perl Artistic License
  • Price:
  • FREE
  • Publisher Name:
  • Kouhei Yoshioka
  • Publisher web site:
  • http://search.cpan.org/~pawapawa/

Lingua::JA::NormalizeText Tags


Lingua::JA::NormalizeText Description

Lingua::JA::NormalizeText is a Perl module that normalizes text.SYNOPSIS use Lingua::JA::NormalizeText; use utf8; my @options = ( qw/nfkc decode_entities/, \&dearinsu_to_desu ); my $normalizer = Lingua::JA::NormalizeText->new(@options); print $normalizer->normalize('鳥が㌧㌦でありんす♥'); # -> 鳥がトンドルです♥ sub dearinsu_to_desu { my $text = shift; $text =~ s/でありんす/です/g; return $text; }# or use Lingua::JA::NormalizeText qw/nfkc decode_entities/; use utf8; my $text = '鳥が㌧㌦でありんす♥'; print dearinsu_to_desu( decode_entities( nfkc($text) ) ); # -> 鳥がトンドルです♥ sub dearinsu_to_desu { my $text = shift; $text =~ s/でありんす/です/g; return $text; }Product's homepage


Lingua::JA::NormalizeText Related Software