emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Buffer names with R2L characters


From: James Cloos
Subject: Re: Buffer names with R2L characters
Date: Mon, 20 Jun 2011 16:13:15 -0400
User-agent: Gnus/5.110018 (No Gnus v0.18) Emacs/24.0.50 (gnu/linux)

The bidi algorithm is just not designed for markup (and the <digits> tag
/is/ markup).  Ideally there would be a 0-width break before the <digits>
or a way to mark the <digits> blob as non-neutral.

Whether the result should display as <12>FEDCBA, <21>FEDCBA or FEDCBA<12>,
though, I have no idea.  (That is, I don't know which of those users would
prefer.  I presume that a ordering break would result in the third.)

I doubt that it can be propperly fixed, though, w/o also fixing unicode's
algorithm to better handle markup interspersed in the main text.

In general, each blob of markup should be handled as its own document,
and the result of that should be treated as a single (perhaps neutral)
character from the point of view of the enclosing text.

That would fix things for this issue, sgml/xml, TeX, source code of
every type (some of the markup there is implicit, but still logically 
extant) et cetera.

For that to work the engine obviously needs to know what markup looks like,
which requires additional meta information about each document.  The buffer
mode helps, but may not be sufficient?

Which is probably why unicode, with their plain-text emphasis, ignored it.

-JimC
-- 
James Cloos <address@hidden>         OpenPGP: 1024D/ED7DAEA6



reply via email to

[Prev in Thread] Current Thread [Next in Thread]