[O] [RFC] Alternative to sub/superscript regexp

emacs-orgmode

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[O] [RFC] Alternative to sub/superscript regexp

From:	Nicolas Goaziou
Subject:	[O] [RFC] Alternative to sub/superscript regexp
Date:	Mon, 25 Nov 2013 18:14:27 +0100

Hello,

For the record `org-match-substring-regexp' is a variation on:

"\\(\\S-\\)\\([_^]\\)\\(\\(?:\\*\\|address@hidden 
\t\r\n,:\"?<>~;./{}=()]+\\)\\)\\)"

I think it is a bit convoluted and therefore difficult to predict. For
example, as recent bug report showed, you may tend to interpret
a_b[fn:1] as

   a_{b}[fn:1]

but, in fact, it is equivalent to

   a_{b[fn}:1]

Of course, we can prevent this by forbidding "[" and "]" in the last
part of the regexp. But I wonder if there's something better to do.

The idea behind this regexp is that we should be able to write simple
sub/superscript, including numbers and entities, without requiring curly
braces (see `org-use-sub-superscripts' docstring for details). Maybe
something like the following could be an interesting alternative:

  
"\\(\\S-\\)\\([_^]\\)\\(\\*\\|[+-]?\\(?:\\w\\|[0-9.,\\]\\)*\\(\\w\\|[0-9]\\)\\)"

That is, without braces, either an asterisk or any combination of word,
number, dot, comma and backslash characters, which may start with either
a plus or a minus sign but cannot end with either a dot or a comma.

I find it arguably more predictable (no inverted class). Also, we "gain"
the following:

  a^3.14. <=> a^{3.14}.

At the moment, a^3.14. <=> a^{3}.14.

What do you think?


Regards,

-- 
Nicolas Goaziou

[Prev in Thread]

Current Thread

[Next in Thread]

[O] [RFC] Alternative to sub/superscript regexp, Nicolas Goaziou <=
- Re: [O] [RFC] Alternative to sub/superscript regexp, Nick Dokos, 2013/11/25
- Re: [O] [RFC] Alternative to sub/superscript regexp, Rasmus, 2013/11/25
- Re: [O] [RFC] Alternative to sub/superscript regexp, Carsten Dominik, 2013/11/26
  - Re: [O] [RFC] Alternative to sub/superscript regexp, Nicolas Goaziou, 2013/11/26

Prev by Date: Re: [O] commit 5ea0228 has problem opening big org-mode file
Next by Date: Re: [O] [RFC] Alternative to sub/superscript regexp
Previous by thread: Re: [O] Only display hours and minutes, not seconds
Next by thread: Re: [O] [RFC] Alternative to sub/superscript regexp
Index(es):
- Date
- Thread