LOCALE(1) - General Commands Manual # LOCALE(1) - General Commands Manual
NAME # locale - character encoding and localization conventions
SYNOPSIS # locale [-a | -m | charmap]
DESCRIPTION # If the locale utility is invoked without any arguments, the current locale configuration is shown. Values for categories that are not set in the environment or that are overridden by LC_ALL are displayed between double quotes.
The options are as follows:
...
locale(3p) Perl Programmers Reference Guide locale(3p) # locale(3p) Perl Programmers Reference Guide locale(3p)
NNAAMMEE # locale - Perl pragma to use or avoid POSIX locales for built-in operations WWAARRNNIINNGG # DO NOT USE this pragma in scripts that have multiple threads active. The locale is not local to a single thread. Another thread may change the locale at any time, which could cause at a minimum that a given thread is operating in a locale it isn't expecting to be in.
...
UTF8(7) - Miscellaneous Information Manual # UTF8(7) - Miscellaneous Information Manual
NAME # utf8 - UTF-8 text encoding
DESCRIPTION # UTF-8 is a multibyte character encoding for Unicode text. It is the preferred format for non ASCII text.
Unicode codepoints are encoded as follows:
U+0000 – U+007F:
One byte: 0……. (compatible with ASCII)
U+0080 – U+07FF:
Two bytes: 110….. 10……
U+0800 – U+D7FF and U+E000 – U+FFFF:
Three bytes: 1110….
...