[Oberon] [EXT] Re: Portings Texts.Mod to Oberon-7, my Artemis project
Skulski, Wojciech
skulski at pas.rochester.edu
Sun Jun 20 03:46:55 CEST 2021
On Sat, Jun 19, 2021 at 4:40 AM Michael Schierl <schierlm at gmx.de<mailto:schierlm at gmx.de>> wrote:
>The characters in memory are still 8 bits wide, using UTF-8 encoding.
> The "magic" happens when the string gets converted to a Texts.Text,
> which is Unicode aware.
Michael:
According to Wikipedia, "UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units."
Could you explain, how your 8-bit UTF-8 encoding relates to the statement that UTF-8 may need up to four bytes?
Thanks,
Wojtek
More information about the Oberon
mailing list