How to set UTF-8 charset encoding in .hcchr file? - Printable Version +- hashcat Forum (https://hashcat.net/forum) +-- Forum: Deprecated; Previous versions (https://hashcat.net/forum/forum-29.html) +--- Forum: General Help (https://hashcat.net/forum/forum-8.html) +--- Thread: How to set UTF-8 charset encoding in .hcchr file? (/thread-3244.html) |
How to set UTF-8 charset encoding in .hcchr file? - Wierus - 03-21-2014 I need to have a russian alphabet in UTF-8 encoding. For test I created a file "ru_utf8.hcchr" with two letter "CYRILLIC CAPITAL LETTER A" and "CYRILLIC CAPITAL LETTER BE". Hex content of file: D0 90 D0 91. Size of file: 4 bytes. It means that file is in UTF-8 encoding. Then I calculated MD5 of this file and save it to "secret.md5". This MD5 is the same as MD5 of string in UTF-8 "ÐБ" (russian letters of cource). And then I run hashcat (version 0.47) with follow command line: Code: hashcat-cli32 -m 0 secret.md5 -a 3 -1 ru_utf8.hcchr ?1?1 Code: Input.Mode: Mask (?1) [1] Code: hashcat-cli32 -m 0 secret.md5 -a 3 -1 ru_utf8.hcchr ?1?1?1?1 Solution with the decomposition of characters into 2 bytes are not satisfied, because in this case, there are added a lot of non-existent characters and more extra bruteforce attempts. RE: How to set charset encoding in .hcchr file? - KT819GM - 03-21-2014 I think it's already done what you try to achieve: https://github.com/Rub3nCT/perl-hashcat-utils RE: How to set UTF-8 charset encoding in .hcchr file? - Wierus - 03-21-2014 Thank you for link, but it is doesn't solved my problem. But it helps me to deeply understand the algorithm of the hashcat. I downloaded file "Russian.charset" from link above and try to bruteforcing with it. Hashcat says to me: Code: Input.Mode: Mask (?1) [1] This is not 67 symbols. This is 67 different bytes in file. Such coincidence. This means that for each byte of the searched word, hashcat will bruteforce 67 bytes from custom charset. It is very wasteful when using UTF-8 encoding. How did you solve this problem? It would be perfect if the hashcat could take one symbol represented by multiple bytes, and inserts it also as multiple bytes in the searched word for computing the hash. Also I read the topic: https://hashcat.net/forum/thread-2613.html It's good solution, but it is very difficult to apply for Russian language. |