A webpage can contain any number of different languages. The character encoding of a site specifies how those languages are stored on your Host. The choice of encoding is a trade-off between the support of unusual characters and the size of the HTML.
Sandvox offers four different choices of character encoding:
- Only handles basic letters from the Roman alphabet. For example, it cannot handle characters with accents. In order to handle any characters not supported by ASCII, Sandvox must convert them into a special numeric code that takes up a lot of space. Therefore, this choice is only really suitable for English sites.
- ISO-8859-1 / Latin 1
- Similar to ASCII but with better support for accents. Will handle most European languages efficiently.
- UTF-8 (Recommended)
- The recommended choice for most sites. It supports all characters without taking up much space.
- Requires more space than UTF-8. However, it is more efficient at handling languages without European characters (e.g. Japanese, Chinese, Korean, Russian, Hebrew, Arabic, Greek, and Hindi). So if your site is in such a language, UTF-16 is generally the best choice.
To set a site's Character Encoding:
- Open the Document Inspector.
- Go to the "Site" tab.
- Set the Character Encoding from the "Character encoding" popup.
While using the Host Setup Assistant to prepare a site for publishing, Sandvox checks the Host used to see if it always serves pages using a particular character encoding. If this is the case then the assistant will automatically set the site's encoding to match that of the Host.
Keywords: character, encoding, code, coding, characters, letter, encode, incode, incoding, japan, japanese, china, chinese, east, eastern, charset, charsets