To LUGNET HomepageTo LUGNET News HomepageTo LUGNET Guide Homepage
 Help on Searching
 
Post new message to lugnet.faqOpen lugnet.faq in your NNTP NewsreaderTo LUGNET News Traffic PageSign In (Members)
 FAQ / 80
  Raw FAQ data format (Was: Format of FAQ items)
 
Todd Lehman (lehman@javanet.com) and Sproaticus (jsproat@geocities.com) writes lots of stuff: Guys! It sounds like you are mixing the raw data format and the presentation format. I'll try to stick to the raw data format here, and list the ideas I (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Sounds mostly good. Catch my exceptions down below. (...) Or some other tool; but I agree, a well-defined subset of HTML can and should be used. (...) (Please keep in mind Jacob, that these are nits I'm picking. :-) "Newsgroups" would be more (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) I'd like to see Latin-1 as the raw data format, with conversions to the presentation formats of: (a) HTML for webpages and (b) Latin-1/ASCII for text documents (i.e. news postings). (...) Is this a choice between (a) 7-bit ASCII with HTML (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Oh man, I'm HOT on "lynx -dump -force_html"!! It doesn't do an absolutely perfect perfect job, but it comes *so* close, and I'll bet it can get even closer by specifying a custom config file on the command line. (...) Agreed -- only &xxx; (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Sproaticus: [...] (...) Fine. Location: [comma-separated list of Lugnet relative URI's] (...) What about Translated-From: [ISO 639 language code] Translator: [translator, ISO date] so Revision: Todd Lehman, 1997-12-24 Revision: Minx Kelly, (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Fine. (...) Yes (but the first four _are_ ASCII characters). (...) Me too. And yes. Play well, Jacob ---...--- -- E-mail: sparre@cats.nbi.dk -- -- Web...: <URL:(URL) -- ---...--- (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) If we ban HTML _elements_ from the headers, then we don't need to escape '<' and '>'. There has never been a need to escape '"'. If we want to allow numeric character references outside Latin-1 (like '&#805;') we still have to escape (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Looks good to me. (...) You mean like, Revision: Todd Lehman, 19971224, en Revision: Minx Kelly, 19980921 Revision: Jacob Sparre Andersen, 19990221, da or some such? (...) More easily read by humans, probably just as easy to parse. What would (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Although rare, double-quote characters (") which appear inside of tags (for example inside of URLs), have to be written as &quot; -- e.g. <IMG SRC="(URL) Double-quote characters (") appearing in normal text (outside of tags) don't have to be (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) How is the Japanese language represented in HTML? I seem to remember seeing a page a few weeks ago that seemed like it used 2-byte ShiftJIS... I'd be shocked if they used 8-byte HTML entities. Can we imagine any possible uses for characters (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Todd: (...) It depends on the chosen encoding of the document. I would expect most people^H^H^H^H^Hrogrammers to use plain 16 bit Unicode BMP[1]. Since I am too lazy to get my computer to operate with another character encoding (than Latin-1), I (...) (25 years ago, 27-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Sproaticus: (...) No, like: Revision: Todd Lehman, 1997-12-24 Revision: Minx Kelly, 1998-09-21 Translated-From: en -> Revision: Jacob Sparre Andersen, 1999-02-21 <- Content-Language: da (...) Can't remember. I think you append "-" Hours (two digits) (...) (25 years ago, 27-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) it sounds like a sneeze to me... "Ecu!" "Bless you" "Thanks" Steve (25 years ago, 27-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Since header order is important, this should work pretty well. (...) Not bad either. Wouldn't changing an included file imply that the including file has also changed? (...) ^^^...^^^ Huh? (...) But then we'd have to run the thing through a (...) (25 years ago, 27-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Sproaticus: (...) Yes, but it is not equally clear. Please forget that I mentioned it. (...) No (but it would mean that it should be reprocessed). Imagine this: "translating.da": Content-Language: da Revision: Jacob Sparre Andersen, 1999-04-28 (...) (25 years ago, 28-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Oh waitaminute. I think I see where the confusion is. I'm not suggesting that the header text from the included file be inserted into the including file. What I mean to say is that during processing of the files, when the including file (...) (25 years ago, 28-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Jeremy: (...) I didn't think so. What I tried to explain was how the processing tools would see the file _after_ the "Include" header had been processed. (...) ~~~...~~~ This is the critical part. I hadn't noticed it earlier. (...) Yes. Play well, (...) (25 years ago, 29-Apr-99, to lugnet.faq)

©2005 LUGNET. All rights reserved. - hosted by steinbruch.info GbR