To LUGNET HomepageTo LUGNET News HomepageTo LUGNET Guide Homepage
 Help on Searching
 
Post new message to lugnet.faqOpen lugnet.faq in your NNTP NewsreaderTo LUGNET News Traffic PageSign In (Members)
 FAQ / 91
90  |  92
Subject: 
Re: Raw FAQ data format (Was: Format of FAQ items)
Newsgroups: 
lugnet.faq
Date: 
Mon, 26 Apr 1999 16:48:49 GMT
Viewed: 
2034 times
  
Sproaticus:

[...]

- These header entries have been suggested:
     Subject          [the question]
     Category         [category (and sub-category?) name]
     Content-Language [ISO 639 language code]
     Topic-Level      [integer, 0 is beginner/easy/simple]
     Version:         [author and ISO date]
     Newsgroups:      [comma-separated list of newsgroups
                       the question could appear in]
     Translation:     [translator, from language, latest
                       version string from the translated
                       entry]

(Please keep in mind Jacob, that these are nits I'm picking.  :-)

"Newsgroups" would be more appropriately named "Location",
indicating not just a ng but specific directories in the
LUGNET data heirarchy.

Fine.

   Location: [comma-separated list of Lugnet relative URI's]

Also, something like "Original-Language" makes more sense
than "Translation".

What about

   Translated-From: [ISO 639 language code]
   Translator:      [translator, ISO date]

so

   Revision: Todd Lehman, 1997-12-24
   Revision: Minx Kelly, 1998-09-21
   Translated-From: en
   Translator: Jacob Sparre Andersen, 1999-02-21
   Content-Language: da

is an article written by Todd, edited by Minx, and then
translated from English to Danish by me?

We could also reuse "Revision" for translations.

Also, I'm leaning more towards "Revision" rather than "Version".

BTW, what *is* the format of an ISO date?

  Year (all digits)
  "-"
  Month (two digits)
  "-"
  Day (two digits)

Plus:
       Include:         [applies the headers of the included file]

Which of the header entries?

Subject, Category, Topic-Level, and Location I presume.

- ASCII + HTML entities are allowed in the headers.

At least the ® -style chars.  I don't see much need
for more HTML in the headers.

That is what you call HTML entities.

Todd seems to want Latin-1 + HTML entities. That's fine for me.

- Should we use ASCII or Latin-1 for the content character
  set?

My knee-jerk reaction to the ASCII question is to just use
the lower 128 (not counting the very lowest 32 of course :-),

You need 10 and 13 for newlines :-)

and use some form of encoding for any other characters -- at
least for the raw FAQ format.

Todd says he doesn't have problems with processing Latin-1.
Neither does I.

I think we should allow Latin-1, but let people restrict
_themselves_ to ASCII if they prefer that.

What's the word on numeric character references outside
Latin-1?

Play well,

Jacob

------------------------------------------------------------
--  E-mail:              sparre@cats.nbi.dk               --
--  Web...: <URL:http://hugin.risoe.dk/JJ_Memorial/FAQ/>  --
------------------------------------------------------------
Here's the edited list of header entries:

   Subject          [the question]
   Category         [category (and sub-category?) name]
   Content-Language [ISO 639 language code]
   Topic-Level      [integer, 0 is beginner/easy/simple]
   Revision         [author, ISO date]
   Location         [comma-separated list of Lugnet relative
                     URI's]
   Translated-From  [ISO 639 language code]
   Translator       [translator, ISO date]



Message has 1 Reply:
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Looks good to me. (...) You mean like, Revision: Todd Lehman, 19971224, en Revision: Minx Kelly, 19980921 Revision: Jacob Sparre Andersen, 19990221, da or some such? (...) More easily read by humans, probably just as easy to parse. What would (...) (26 years ago, 26-Apr-99, to lugnet.faq)

Message is in Reply To:
  Re: Raw FAQ data format (Was: Format of FAQ items)
 
(...) Sounds mostly good. Catch my exceptions down below. (...) Or some other tool; but I agree, a well-defined subset of HTML can and should be used. (...) (Please keep in mind Jacob, that these are nits I'm picking. :-) "Newsgroups" would be more (...) (26 years ago, 24-Apr-99, to lugnet.faq)

82 Messages in This Thread:
























Entire Thread on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact

This Message and its Replies on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact
    

Custom Search

©2005 LUGNET. All rights reserved. - hosted by steinbruch.info GbR