To LUGNET HomepageTo LUGNET News HomepageTo LUGNET Guide Homepage
 Help on Searching
 
Post new message to lugnet.admin.generalOpen lugnet.admin.general in your NNTP NewsreaderTo LUGNET News Traffic PageSign In (Members)
 Administrative / General / 10142
10141  |  10143
Subject: 
Re: Possible crawler-bot alert?
Newsgroups: 
lugnet.admin.general
Date: 
Tue, 5 Feb 2002 14:59:37 GMT
Reply-To: 
"Andy Lynch" <andy@NOSPAMandyandjackie.com>
Viewed: 
132 times
  
----- Original Message -----
From: "Dan Boger" <dan@peeron.com>
are there some measure in place? yes, some. Is it possible to block
out crawlers? not really. As long as anyone can point their browser
(or newsreader) and connect to LUGNET, without requiring a password,
the data there can be harvested. We can (and have) put in all sorts
of blocks, trying to weed out bots, but there is just no way of
knowing if a certain request is comeing from a user browsing the site
or from a bot.

My suggestion to you - and this goes to anywhere you post your email • to
- use spam filters and spamblocks. It's a pain, but I fear there's no
other option in todays Internet.

Dan

Dan,
Question regarding this.  I mostly post to LUGNET via e-mail.  I just
tried doing so using a spamblock in my reply-to header.  This works
fine, but my From: field still shows my normal address, and LUGNET is
polite enough to display that fact, as shown here:

http://news.lugnet.com/off-topic/test/?n=3522

And probably this post too, as I am using the same method to post it.
On the web page it is not a MailTo: tag, and that is nice, but my e-mail
address is still shown on the page, and a (simple) regular expression
search would reveal it.  Would it be possible to obfuscate the display a
bit in a way that is html friendly, of course, to make people work
harder to harvest?  Maybe a blank SPAN tag set between the @ and the
domain and another between the domain and the extension or something
like that.  Not really a solution, but harvesters would at least have to
recognize that they were there to weed them out.

Other ideas are welcome of course.
-Andy Lynch



Message has 1 Reply:
  Re: Possible crawler-bot alert?
 
(...) that is correct - since you didn't change your "from" field to use the spamblock. What you need to do is create a new "posting setup" here: (URL) put in your spamblock email address in the "From" area. Then, change your mail client to send (...) (23 years ago, 5-Feb-02, to lugnet.admin.general)

Message is in Reply To:
  Re: Possible crawler-bot alert?
 
(...) are there some measure in place? yes, some. Is it possible to block out crawlers? not really. As long as anyone can point their browser (or newsreader) and connect to LUGNET, without requiring a password, the data there can be harvested. We (...) (23 years ago, 4-Feb-02, to lugnet.admin.general)

5 Messages in This Thread:

Entire Thread on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact

This Message and its Replies on One Page:
Nested:  All | Brief | Compact | Dots
Linear:  All | Brief | Compact
    

Custom Search

©2005 LUGNET. All rights reserved. - hosted by steinbruch.info GbR