Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

HTTP and unicode

Name: Anonymous 2010-12-30 16:24

Where can i expect unicode strings when writing a webserver? The standard says HTTP headers are UTF-8, but now there are unicode domain names. Of course the Content-type can be anything other than utf8, but that isn't the main problem

Name: Anonymous 2010-12-30 16:48

HTTP headers are ISO-8859-1, fool.

Name: Anonymous 2010-12-30 17:32

>>2
My bad!
So if I get this correctly, my language C implements ascii. latin-1 is an extension to ascii, adding 128 new characters.

However, resource locators in http headers could still be in unicode!

Name: Anonymous 2010-12-30 18:22

>>3
Your language? You wrote a C compiler aged 12 as well? NO WAY

Name: Anonymous 2010-12-30 18:23

>>3
C does not implement any character encoding, it just sets some restrictions on the used character set and encoding (C99 §5.2.1).

URIs have a very restricted character set, though any octet can be encoded in them. You may be thinking of internationalized domain names and Punycode, but that's an orthogonal and largely UI-based issue that can be ignored.

Name: Anonymous 2011-02-03 0:20

Name: Anonymous 2011-02-03 4:46

Newer Posts
Don't change these.
Name: Email:
Entire Thread Thread List