emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Reporting UTF-8 related problems?


From: Andreas Schwab
Subject: Re: Reporting UTF-8 related problems?
Date: Tue, 30 Jul 2002 09:57:09 +0200
User-agent: Gnus/5.090006 (Oort Gnus v0.06) Emacs/21.3.50 (ia64-suse-linux)

Kenichi Handa <address@hidden> writes:

|> In article <address@hidden>, Karl Eichwalder <address@hidden> writes:
|> > Kenichi Handa <address@hidden> writes:
|> >>>  Char: “ (0150310, 53448, 0xd0c8) point=309 of 321 (96%) column 12 
|> >> 
|> >>  This is because Emacs received this byte sequence:
|> >>   ESC $ ( B ! H
|> >>  "ESC $ ( B" is a designation sequence for jisx0208, 
|> >>  and the following two bytes "! H" specifies the above
|> >>  Japanese symbol.
|> 
|> > Originally, it was the "right double quote raising" and not meant to be
|> > a special Japanese symbol ;)
|> 
|> I checked the contents of the html file itself and found this:
|> 
|>      &#132;Die Familie Schroffenstein&#147
|> 
|> I thought that the notation &#NUMBER is for transmitting
|> Unicode character of code NUMBER.  But, 132 and 147 are
|> control codes in Unicode, not any kind of quotings.  Do you
|> know a proper web page describing the meaning of them?

The numbers are supposed to be ISO 8859-1 characters codes.  I'd guess the
page has been written with some broken (a.k.a. W*nd*ws) software (the use
of *.htm makes this apparent).  There is no hope for being compliant to
any standard.  I tried to validate it through the W3.org validator, but no
document type matches.

Andreas.

-- 
Andreas Schwab, SuSE Labs, address@hidden
SuSE Linux AG, Deutschherrnstr. 15-19, D-90429 Nürnberg
Key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."



reply via email to

[Prev in Thread] Current Thread [Next in Thread]