# HG changeset patch # User paulb # Date 1094241827 0 # Node ID 88245d14e139c903d0f3e09a13887ae4e4d19471 # Parent 152c78dacc40b00f6aeeaec6529f29d40b9e5378 [project @ 2004-09-03 20:03:47 by paulb] Added notes on character set usage and limitations. diff -r 152c78dacc40 -r 88245d14e139 docs/CHARSET.txt --- a/docs/CHARSET.txt Thu Sep 02 19:33:37 2004 +0000 +++ b/docs/CHARSET.txt Fri Sep 03 20:03:47 2004 +0000 @@ -40,6 +40,15 @@ * Where the response content type does not specify a character set, a default character set is used to encode Unicode response data (eg. HTML pages). +Restrictions in and Omissions from Standards +-------------------------------------------- + +The encoding of character sets such as UTF-16 in HTTP POST request body +messages of content/media type application/x-www-form-urlencoded is not +properly standardised. Therefore, it is highly recommended that UTF-8 be used +as an encoding should the various single byte encodings (eg. ISO-8859-1) not +cover the range of characters to be displayed and received. + Framework Behaviour -------------------