To LUGNET HomepageTo LUGNET News HomepageTo LUGNET Guide Homepage
 Help on Searching
 
Post new message to lugnet.faqOpen lugnet.faq in your NNTP NewsreaderTo LUGNET News Traffic PageSign In (Members)
 FAQ / 97 (-20)
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) How is the Japanese language represented in HTML? I seem to remember seeing a page a few weeks ago that seemed like it used 2-byte ShiftJIS... I'd be shocked if they used 8-byte HTML entities. Can we imagine any possible uses for characters (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Although rare, double-quote characters (") which appear inside of tags (for example inside of URLs), have to be written as &quot; -- e.g. <IMG SRC="(URL) Double-quote characters (") appearing in normal text (outside of tags) don't have to be (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Working sketch of FAQ item data format
 
Here's a working sketch of the FAQ data format; the example is wrapped in quasi-XML, to show the file's path. Subject: [the question] Content-Language: [ISO 639 language code] Topic-Level: [integer, 0 is beginner/easy/simple] Revision: [author, ISO (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Looks good to me. (...) You mean like, Revision: Todd Lehman, 19971224, en Revision: Minx Kelly, 19980921 Revision: Jacob Sparre Andersen, 19990221, da or some such? (...) More easily read by humans, probably just as easy to parse. What would (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) If we ban HTML _elements_ from the headers, then we don't need to escape '<' and '>'. There has never been a need to escape '"'. If we want to allow numeric character references outside Latin-1 (like '&#805;') we still have to escape (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Fine. (...) Yes (but the first four _are_ ASCII characters). (...) Me too. And yes. Play well, Jacob ---...--- -- E-mail: sparre@cats.nbi.dk -- -- Web...: <URL:(URL) -- ---...--- (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
Sproaticus: [...] (...) Fine. Location: [comma-separated list of Lugnet relative URI's] (...) What about Translated-From: [ISO 639 language code] Translator: [translator, ISO date] so Revision: Todd Lehman, 1997-12-24 Revision: Minx Kelly, (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: I'll volunteer for the LEGO FAQ
 
Todd: (...) You're right. I hadn't reached the "one entry, one file" idea when I wrote this. Play well, Jacob ---...--- -- E-mail: sparre@cats.nbi.dk -- -- Web...: <URL:(URL) -- ---...--- (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Format of FAQ items
 
(...) Sure -- suppose you have a directory full of FAQ items, some of which have pretty much the same headers. e.g. Locations: /x/y/,/x/z/,/x/,/ Content-Language: qw Translated-From: er Topic-Level: 1 Instead of maintaining each of these in every (...) (25 years ago, 26-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Oh man, I'm HOT on "lynx -dump -force_html"!! It doesn't do an absolutely perfect perfect job, but it comes *so* close, and I'll bet it can get even closer by specifying a custom config file on the command line. (...) Agreed -- only &xxx; (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) I'd like to see Latin-1 as the raw data format, with conversions to the presentation formats of: (a) HTML for webpages and (b) Latin-1/ASCII for text documents (i.e. news postings). (...) Is this a choice between (a) 7-bit ASCII with HTML (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Format of FAQ items
 
(...) Yes & no. 'Yes' in that a master TOC would/could appear. But 'no' in that the master TOC would just be a special case of a general-purpose sub- hierarchy TOC, which just happens to start at the top. I built one of these just for fun a couple (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Working sketch of FAQ item data format
 
(...) OK, I grok that better, yes. But why was that file referring to (or including) itself, and why were the other files including files in parent directories rather than subdirectories? For definitions? (I guess you answered this below...) (...) (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: I'll volunteer for the LEGO FAQ
 
(...) Tags at the beginning and end of the file? That just makes processing more difficult, not easier. More syntax = more work. Less syntax = less work. I'm looking for a model with a simple but flexible design -- where individual collections Q&A (...) (25 years ago, 25-Apr-99, to lugnet.faq)
 
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Sounds mostly good. Catch my exceptions down below. (...) Or some other tool; but I agree, a well-defined subset of HTML can and should be used. (...) (Please keep in mind Jacob, that these are nits I'm picking. :-) "Newsgroups" would be more (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: Format of FAQ items
 
(...) You mean like a master Table of Contents? Sure, okay. Question" How cautious do we want to get about creeping featuritis? :-, (...) Yep. (...) Oh, yeah. Integers are better for an enumerated value. (...) If you want to write the code for it, (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: Working sketch of FAQ item data format
 
(...) Would "Include" be better? (...) Sure, it looks like you're able to efficiently distinguish between a ng and a non-ng directory. (...) The top line looks good. I doubt we'd need the time-of-day stamp very often, if at all. Would it be possible (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Raw FAQ data format (Was: Format of FAQ items)
 
Todd Lehman (lehman@javanet.com) and Sproaticus (jsproat@geocities.com) writes lots of stuff: Guys! It sounds like you are mixing the raw data format and the presentation format. I'll try to stick to the raw data format here, and list the ideas I (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: I'll volunteer for the LEGO FAQ
 
(...) I don't think so. We should rather choose a separator for category levels: Category: [category] / [subcategory depth 1] / ... (...) Makes sense. (...) Lets use HTML. We can always use "lynx -dump" to generate plain text from the HTML edition. (...) (25 years ago, 24-Apr-99, to lugnet.faq)
 
  Re: Working sketch of FAQ item data format
 
(...) What does the Refer header mean? Say, due to what we figured out on the other thread, we can nix the Locations and Newsgroups headers, and just keep the Subject and Author headers. But doesn't the date belong on the same line is the author?... (...) (25 years ago, 24-Apr-99, to lugnet.faq)


Next Page:  5 more | 10 more | 20 more

Redisplay Messages:  All | Compact

©2005 LUGNET. All rights reserved. - hosted by steinbruch.info GbR