06-01-2016, 12:19 PM
Yes, I had already read it. It's not a good solution, and when I read it it doesn't seem the author knows the difference between UTF8 and UNICODE. With UTF8, a single character is encoded in one, two, three or four bytes. An "a" is encoded just like the ASCII "a", but as I demonstrated, a "€" is encoded with the three byte sequence "e2 82 ac".
If all UTF8 bytes were encoded with a "base code" and a "character code" as in the blog you mentioned, the solution of something like ?1?2?1?2?1?2 for a 3 character password could be possible, as suggested. But how are you going to do this if the alphabet contains characters that are one, two or three bytes long, and you want to crack a password up to 8 characters?
If all UTF8 bytes were encoded with a "base code" and a "character code" as in the blog you mentioned, the solution of something like ?1?2?1?2?1?2 for a 3 character password could be possible, as suggested. But how are you going to do this if the alphabet contains characters that are one, two or three bytes long, and you want to crack a password up to 8 characters?