[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: Nick Dokos: texi2dvi egrep regexp
From: |
Nick Dokos |
Subject: |
Re: Nick Dokos: texi2dvi egrep regexp |
Date: |
Fri, 08 Oct 2010 14:38:56 -0400 |
Nick Dokos <address@hidden> wrote:
>
> Finally, certain named classes of characters are predefined
> within bracket expressions, as follows. Their names are self
> explanatory, and they are [:alnum:], [:alpha:], [:cntrl:],
> [:digit:], [:graph:], [:lower:], [:print:], [:punct:], [:space:],
> [:upper:], and [:xdigit:]. For example, [[:alnum:]] means
> [0-9A-Za-z], except the latter form depends upon the C locale and
> the ASCII character encoding, whereas the former is independent
> of locale and character set. (Note that the brackets in these
> class names are part of the symbolic names, and must be included
> in addition to the brackets delimiting the bracket expression.)
> Most meta-characters lose their special meaning inside bracket
> expressions. To include a literal ] place it first in the list.
> Similarly, to include a literal ^ place it anywhere but first.
> Finally, to include a literal - place it last.
>
> Given that, would it make sense to replace the egrep invocation in
> texi2dvi with
>
> egrep '^(/|[:alpha:]:/)'
>
> which would be valid under any locale?
Ugh, wrong syntax:
egrep '^(/|[[:alpha:]]:/)'
Also Eric Fraga mentioned that this might include accented characters
which is ... undesirable.
Nick