egghelp.org community Forum Index
[ egghelp.org home | forum home ]
egghelp.org community
Discussion of eggdrop bots, shell accounts and tcl scripts.
 
 FAQFAQ   SearchSearch   MemberlistMemberlist   UsergroupsUsergroups   RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

Soundex can be used as a drone scorer

 
Post new topic   Reply to topic    egghelp.org community Forum Index -> Scripting Help
View previous topic :: View next topic  
Author Message
awyeah
Revered One


Joined: 26 Apr 2004
Posts: 1580
Location: Switzerland

PostPosted: Mon Jul 09, 2007 10:46 pm    Post subject: Soundex can be used as a drone scorer Reply with quote

@Just to let people working on drones know:

I found this algorithm earlier from wikipedia. If this can be implmented into TCL, it can be used as a scorer for drone nicks.

Quote:

The Soundex code for a name consists of a letter followed by three numbers: the letter is the first letter of the name, and the numbers encode the remaining consonants.

Similar sounding consonants share the same number so, for example, the labial B, F, P and V are all encoded as 1. Vowels can affect the coding, but are never coded directly unless they appear at the start of the name.

The exact algorithm is as follows:

Retain the first letter of the string
Remove all occurrences of the following letters, unless it is the first letter: a, e, h, i, o, u, w, y
Assign numbers to the remaining letters (after the first) as follows:
b, f, p, v = 1
c, g, j, k, q, s, x, z = 2
d, t = 3
l = 4
m, n = 5
r = 6
If two or more letters with the same number were adjacent in the original name (before step 1), or adjacent except for any intervening h and w (American census only), then omit all but the first.

Return the first four characters, right-padding with zeroes if there are fewer than four.
Using this algorithm, both "Robert" and "Rupert" return the same string "R163" while "Rubin" yields "R150".


Source: http://en.wikipedia.org/wiki/Soundex
_________________
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
Back to top
View user's profile Send private message Send e-mail Visit poster's website Yahoo Messenger MSN Messenger
Sir_Fz
Revered One


Joined: 27 Apr 2003
Posts: 3793
Location: Lebanon

PostPosted: Wed Jul 11, 2007 7:15 am    Post subject: Reply with quote

Soundex Tcl Package. But it won't help with detecting drones.
_________________
Follow me on GitHub

- Opposing

Public Tcl scripts
Back to top
View user's profile Send private message Visit poster's website
r0t3n
Owner


Joined: 31 May 2005
Posts: 507
Location: UK

PostPosted: Wed Jul 11, 2007 1:18 pm    Post subject: Reply with quote

Maybe if someone makes a system, it could be used to score against drones. Just need to find a way of matching the outcome to being a trojan/drone nickname...?
_________________
r0t3n @ #r0t3n @ Quakenet
Back to top
View user's profile Send private message MSN Messenger
awyeah
Revered One


Joined: 26 Apr 2004
Posts: 1580
Location: Switzerland

PostPosted: Wed Jul 11, 2007 8:34 pm    Post subject: Reply with quote

I searched the forum for soundex and found one match. A post by user in a drone matching topic regarding regexp, where he advised soundex to be used with a dictionary for matching drones. And we all know that user is never wrong. Smile

Quote:

Here's a great idea which i'm about to patent:
Look the nick up in a dictionary. If not found; kick the person telling them to correct the spelling error.

Seriously though...
Have you tried using soundex? I think it would work much better than just relying on the random string not containing certain chars. I suggest using a soundex "dictionary", a list or array of soundexes based on the words from a real dictionary + common names. This dictionary could then be searched to see if the soundex of a nick == some existing word's soundex, which should weed out most random strings.

Then you could have a mechanism for re-entering the room that you tell about in the kick reason, so those not meant to be kicked can rejoin. (a /msg command or what ever)

This is of course just theory and fairly cpu/memory intensive stuff, but IMO worth a try


Quote:

That's the beauty of my method (I think/hope), because even if they're not real words most nicks SOUND like real words. Except strange nicks like where a number becomes a couple of chars in a word etc...but these could be translated to chars with a 'string map' before the soundex is made (in most cases) and the rest would be allowed to re-enter through the /msg command


Taken from this thread:
http://forum.egghelp.org/viewtopic.php?t=4982
_________________
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
Back to top
View user's profile Send private message Send e-mail Visit poster's website Yahoo Messenger MSN Messenger
Sir_Fz
Revered One


Joined: 27 Apr 2003
Posts: 3793
Location: Lebanon

PostPosted: Wed Jul 11, 2007 8:49 pm    Post subject: Reply with quote

Now that's a different story, it's not just checking soundex over nicknames - there's a dictionary involved and I agree that it would work but far too much effort would be done over a really not that big of a problem.
_________________
Follow me on GitHub

- Opposing

Public Tcl scripts
Back to top
View user's profile Send private message Visit poster's website
r0t3n
Owner


Joined: 31 May 2005
Posts: 507
Location: UK

PostPosted: Fri Jul 13, 2007 11:16 am    Post subject: Reply with quote

And where do we find a soundex dictionary, or how make one. It will to too time consuming to a human to read and enter each word from a dictionary into soundex and store it in a file/db.
_________________
r0t3n @ #r0t3n @ Quakenet
Back to top
View user's profile Send private message MSN Messenger
awyeah
Revered One


Joined: 26 Apr 2004
Posts: 1580
Location: Switzerland

PostPosted: Fri Jul 13, 2007 1:23 pm    Post subject: Reply with quote

Ask user, he's the one who suggested it, so he would know better. I don't think you need to make a dictionary, thats obviously ridiculous.. maybe use one from a tcl library or maybe from the web.
_________________
·­awyeah·

==================================
Facebook: jawad@idsia.ch (Jay Dee)
PS: Guys, I don't accept script helps or requests personally anymore.
==================================
Back to top
View user's profile Send private message Send e-mail Visit poster's website Yahoo Messenger MSN Messenger
Display posts from previous:   
Post new topic   Reply to topic    egghelp.org community Forum Index -> Scripting Help All times are GMT - 4 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Forum hosting provided by Reverse.net

Powered by phpBB © 2001, 2005 phpBB Group
subGreen style by ktauber