3 May 22:35
Re: More on entities and Â
William McKee wrote: > On Tue, May 04, 2004 at 07:09:05AM +1200, Grant McLean wrote: > >>If your output encoding is UTF8 then every character beyond >>0x7F will be two or more bytes. The non-breaking space >>character should be A2 A0 (I think). So as long as you give >>the browser the correct charset setting in your headers, it >>should do exactly the right thing. > > > Hi Grant, > > Thanks for the quick response. How do I know what my output > encoding is? It will be UTF8 unless you do something to change it. For example (assuming Perl 5.8): my $html = $template->process (%args); open($fh,'>:encoding(iso-8859-1)', $path) or die "open($path): $!"; $fh->print($html); > I can set the encoding of the file and the meta tag. Yes, this tells the browser how it should interpret the document: <meta http-equiv="Content-type" content="text/html; charset=utf-8"> Obviously this needs to match the encoding used to create the file. > Should I be modifying the configuration of my Apache server? The 'meta http-equiv' tag above is equivalent to sending this header: Content-type: text/html; charset=utf-8 I've heard that not all browsers honour the charset suffix on the Content-type header so it might not be worth the effort. The meta tag has the advantage of staying with the document if the user does a 'Save-as', whereas the HTTP header would be lost. > The output I'm getting right now is C2A0. According to this table[1], > nbsp is 00A0 and A2A0 is not defined. Oops, my bad, I meant C2A0 but inexplicably typed A2A0. Cheers Grant
RSS Feed