To LUGNET HomepageTo LUGNET News HomepageTo LUGNET Guide Homepage
 Help on Searching
 
Post new message to lugnet.faqOpen lugnet.faq in your NNTP NewsreaderTo LUGNET News Traffic PageSign In (Members)
 FAQ / 93
92  |  94
Subject: 
Re: Raw FAQ data format (Was: Format of FAQ items)
Newsgroups: 
lugnet.faq
Date: 
Mon, 26 Apr 1999 16:58:19 GMT
Viewed: 
1730 times
  
Todd Lehman (lehman@javanet.com) wrote:
In lugnet.faq, jsproat@geocities.com (Sproaticus) writes:

Agreed -- only &xxx; entities ought to be allowed in the headers, IMO...
And if the content charset is Latin-1 instead of pure 7-bit ASCII, then this
can be further reduced to < > " &.

If we ban HTML _elements_ from the headers, then we don't
need to escape '<' and '>'. There has never been a need to
escape '"'.

If we want to allow numeric character references outside
Latin-1 (like '&#805;') we still have to escape ampersands.

[...]

and *perhaps* the double-quote character should be forced to be written as
an entity as well:

   "  =>  &quot;

Why?

But apart from those, wouldn't it simplify editing a ton (and make it much
much safer) if characters above 128 were just written directly in their
Latin-1 encoding, i.e.--?

Yes.

Play well,

Jacob

------------------------------------------------------------
--  E-mail:              sparre@cats.nbi.dk               --
--  Web...: <URL:http://hugin.risoe.dk/JJ_Memorial/FAQ/>  --
------------------------------------------------------------



Message has 2 Replies:
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Although rare, double-quote characters (") which appear inside of tags (for example inside of URLs), have to be written as &quot; -- e.g. <IMG SRC="(URL) Double-quote characters (") appearing in normal text (outside of tags) don't have to be (...) (25 years ago, 26-Apr-99, to lugnet.faq)
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) How is the Japanese language represented in HTML? I seem to remember seeing a page a few weeks ago that seemed like it used 2-byte ShiftJIS... I'd be shocked if they used 8-byte HTML entities. Can we imagine any possible uses for characters (...) (25 years ago, 26-Apr-99, to lugnet.faq)

Message is in Reply To:
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Oh man, I'm HOT on "lynx -dump -force_html"!! It doesn't do an absolutely perfect perfect job, but it comes *so* close, and I'll bet it can get even closer by specifying a custom config file on the command line. (...) Agreed -- only &xxx; (...) (25 years ago, 25-Apr-99, to lugnet.faq)

82 Messages in This Thread:
























Entire Thread on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact

This Message and its Replies on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact
    

Custom Search

©2005 LUGNET. All rights reserved. - hosted by steinbruch.info GbR