Unicode::Regex::Set

Unicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions.
Download

Unicode::Regex::Set Ranking & Summary

Advertisement

  • Rating:
  • License:
  • Perl Artistic License
  • Price:
  • FREE
  • Publisher Name:
  • SADAHIRO Tomoyuki
  • Publisher web site:
  • http://search.cpan.org/~sadahiro/ShiftJIS-String-1.04/String.pod

Unicode::Regex::Set Tags


Unicode::Regex::Set Description

Unicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions. Unicode::Regex::Set is a subtraction and intersection of Character Sets in Unicode Regular Expressions.SYNOPSIS use Unicode::Regex::Set qw(parse); $regex = parse('');Perl 5.8.0 misses subtraction and intersection of characters, which is described in Unicode Regular Expressions (UTS #18). This module provides a mimic syntax of character classes including subtraction and intersection, taking advantage of look-ahead assertions.The syntax provided by this module is considerably incompatible with the standard Perl's regex syntax.Any whitespace character (that matches /s/) is allowed between any tokens. Square brackets ('') are used for grouping. A literal whitespace and square brackets must be backslashed (escaped with a backslash, ''). You cannot put literal ']' at the start of a group.A POSIX-style character class like is allowed since its ' is a list of 'A', '&', 'Z'. is a character range from 'A' to 'Z'. is a set by removal of from .Union operator '|' may be omitted. E.g. is equivalent to , and also to .Intersection operator '&' has high precedence, so is equivalent to | p{D}].Subtraction operator '-' has low precedence, so is equivalent to - ]. is a set by removal of p{B} and p{C} from p{A}. It is equivalent to ] and .Negation. when '^' just after a group-opening ' matches anything but neither nor . More clearly you can say this with grouping as ].If '^' that is not next to ' matches A-Z or a non-Latin character. But ] (or , for this is a simple case) is recommended for clarity.If you want to remove anything other than PERL from , use as well as ]. Similarly, if you want to intersect and a thing not JUNK, use as well as ]. Requirements: · Perl


Unicode::Regex::Set Related Software