hashcat Forum

Full Version: Getting "unruly": Finding base words
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
This is what I use to find base words in a list of plains. I am posting it both to share and to see if others have ideas for improving it.

Code:
cat plains | tr A-Z a-z | sed 's/^[^a-z]*//g; s/[^a-z]*$//g; y/112345677890@\$\!\#/ilzeasbzvbgoasih/; s/[^a-z]//g; /^$/d' >basewords

A few explanations:

First, I use tr instead of sed to convert upper to lower, both because it's much faster, and because it plays better with unicode.

I then strip out all non-alpha chars from the beginning and end of the line.

Then I do common l33t substitutions (this can probably be improved.)

Then I strip out all non-lower alpha chars, and delete any empty lines.

Example: take the following plains

Code:
l33t1979
h4$hcaT2012
39bananas
69cockmaster69

Becomes:

Code:
leet
hashcat
bananas
cockmaster

All comments, thoughts, and flames welcome.
Nice work there, epixoip !

I am very interested to see if anyone here can help improve this as it is something I am hoping to be able to do.

Unfortunately you are way ahead of me so I don't think I can contribute much apart from occasionally bumping this thread ! Smile
Hi epixoip

Just to let you know that your efforts on this were not in vain ! Smile

We have managed to inspire Blazer to add his own version of this to ULM.

He likes to do things his own way so it will be interesting to see the results.
right on Smile
Reference URL's