[Oberon] [EXT] Re: Portings Texts.Mod to Oberon-7, my Artemis project

Skulski, Wojciech skulski at pas.rochester.edu
Sun Jun 20 03:46:55 CEST 2021


On Sat, Jun 19, 2021 at 4:40 AM Michael Schierl <schierlm at gmx.de<mailto:schierlm at gmx.de>> wrote:

>The characters in memory are still 8 bits wide, using UTF-8 encoding.
> The "magic" happens when the string gets converted to a Texts.Text,
> which is Unicode aware.

Michael:

According to Wikipedia, "UTF-8 is capable of encoding all 1,112,064 valid character code points in Unicode using one to four one-byte (8-bit) code units."

Could you explain, how your 8-bit UTF-8 encoding relates to the statement that UTF-8 may need up to four bytes?

Thanks,
Wojtek


More information about the Oberon mailing list