Subject:
|
Re: Regex for mutations of "unsubscribe" (was: Re: unsusbscribe lego-robotics)
|
Newsgroups:
|
lugnet.off-topic.geek
|
Date:
|
Sat, 28 Aug 1999 00:44:00 GMT
|
Viewed:
|
88 times
|
| |
| |
In lugnet.off-topic.geek, thou, Todd Lehman (lehman@javanet.com), hast wrytted...
> In lugnet.robotics, Don <don@server.cscmicro.com> (Don) writes:
>
> > unsusbscribe lego-robotics
>
> At first glance, I thought there was a bug in the news<->mail gateway which
> allowed this article through. But the word "unsubscribe" is actually
> misspelled. =:*p
>
> Anyone know of a good regex to catch mutations/misspellings of this word?
> According to the text indexer for the ngs, it's been misspelled quite a few
> ways so far:
>
> unsubcribe
> unsubscibe
> unsubsribe
> unsusbscribe
> unsuscribe
> unsuscripe
Mmmmm gotta love trying to outsmart the idiots. Here's something that might
help (this is a Perl5 regex):
unsu(s|[pb]|[pb]s|s[pb])c?\w*?i\w*?[pb]e
It also catches "unsubscirbe", which is a personal favorite. You may want
to make it more rigid, although there's some guessing going on there on ways
people may try to misspell it in the future. The problem with the word
"unsubscribe" is that wily "bscr" in the middle. And it's a prime example
of a word where people are likely to drop consonants, so when "sounding it
out" might likely misspell it (if they aren't already mistyping it).
a nefaireous mispeller,
Micah
--
perl -e's;^;nsiC%by:n19%o>o2z4on19%`"t2z`fn3o.%.2o.z.2o.g4og.5o.g.4o.4%;;\
s;$;f%.2o.n`fP"Y2>bP"Y2>bz`fzd2>t%`"Y>%`Pz)2>bgfP"Y2>bn%p#f8%.oP"#n%p#p;;\
s;$;.o>%d>(z#n____`Y>bod>Pt%`Yf2">o%__n;;s;#;ppfz;g;s;p;fg;g;s;_;ofo%;g;;\
s;z;2%;g;s;g;3%;g;s;f;3>;g;s;(\d+)(.);$2x$1;eg;y;t>%nC;\x278 \ng;;print;'
|
|
Message is in Reply To:
2 Messages in This Thread:
- Entire Thread on One Page:
- Nested:
All | Brief | Compact | Dots
Linear:
All | Brief | Compact
|
|
|
|