bug-mit-scheme
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[bug #62618] utf8->string and string-foldcase do normalization by defaul


From: Taylor R. Campbell
Subject: [bug #62618] utf8->string and string-foldcase do normalization by default
Date: Sun, 12 Jun 2022 11:43:07 -0400 (EDT)

URL:
  <https://savannah.gnu.org/bugs/?62618>

                 Summary: utf8->string and string-foldcase do normalization by
default
                 Project: MIT/GNU Scheme
               Submitter: riastradh
               Submitted: Sun 12 Jun 2022 03:43:05 PM UTC
                Category: runtime
                Severity: 3 - Normal
                Priority: 5 - Normal
              Item Group: Incorrect behavior
                  Status: None
                 Privacy: Public
             Assigned to: None
         Originator Name: 
        Originator Email: 
             Open/Closed: Open
         Discussion Lock: Any
                Keywords: 


    _______________________________________________________

Follow-up Comments:


-------------------------------------------------------
Date: Sun 12 Jun 2022 03:43:05 PM UTC By: Taylor R. Campbell <riastradh>
The UTF-8 decoding algorithm and Unicode case-folding algorithm do not involve
normalization, but the utf8->string and string-foldcase procedures always
normalize into NFC.

This is because they use the default mode of string-builder, which is nfc.  If
they used (builder 'immutable) instead of (builder) this would fix the
problem, but I tend to think that the string builder should default to no
normalization unless you ask for it.

In any case there should also be automatic tests of utf8->string and
string-foldcase and others to catch this.







    _______________________________________________________

Reply to this item at:

  <https://savannah.gnu.org/bugs/?62618>

_______________________________________________
Message sent via Savannah
https://savannah.gnu.org/




reply via email to

[Prev in Thread] Current Thread [Next in Thread]