Jump to content


MD5 Tutorial


1 reply to this topic

#1 php3ch0

    Advanced Member

  • Members
  • PipPipPip
  • 162 posts

Posted 06 March 2006 - 02:25 PM

What is MD5 or MD5sum?

Its a formula - a way to take a message of an arbitrary length, and create a 128-bit "fingerprint" or "message digest" of the message. MD5 is a way to verify data integrity. On these forums, it comes up fairly often in discussions about storing user passwords and other sensitive data.

Is MD5 encryption?

No. It is simply a one-way fingerprint of the message. It doesn't include the original message, and you can't (generally) use the fingerprint (the md5sum) to 'figure out' the original message.

Okay, so you take a message - like a password - and generate an MD5sum from it.. Can't you brute-force that?

Like any password system, you could attempt to brute force the answer. However, MD5sum's are in a 128-bit space, meaning that to brute force it would take 2^128 attempts - thats over 3 with 38 zeroes after it.

Neat! Thats a lot. Are there any flaws in the algorithm that could speed it up?

A birthday attack is based on the theory that there *might* be *one* md5sum that matches multiple inputs. In theory, it is possible that a "birthday" attack could be possible - two md5sum hashes could be the same. But even then, the total number of brute forces is at 2^64 attempts - still a heck of a lot.

Okay. But couldn't (insert super-sneaky government agency here) build an md5 dictionary, and know what the password was with the md5?

Yes. Its entirely possible. However - it would take some work to do so. For example, just for a dictionary consisting of Alphabet letters (upper and lower), and numbers, there would be 46,656,000,000 entries - all at 32 characters each. Thats over 1 terabyte of data to store and search! It could be done - absolutely. But is it likely?

So its hard to brute force, what about dictionary attacks?

Dictionary attacks are a way of attacking poor passwords - most people use words in their passwords. If you can guess the word - for example, "love", then you can cut down the number of tries it would take. Of course if you guess right, then your # of attacks = 1. However, in general, using common computers as of the writing of this (2003), you can generally get roughly 5 million attacks per second, or fast enough to guess all 8-character Alphanumericals within 497 days.

Thats pretty strong - but is there anything stronger?

A similar method is SHA1 - a more secure 160-bit hashing algorithm. That makes it *much* more secure against brute-force, birthday attacks, and other forms of assault. There are yet more hashing algorithms that are even stronger - but MD5 and SHA1 are both natively supported in the latest PHP, and should be sufficient for most projects.

Allright - I'm sold. Tell me how to use it to store passwords and check them

There are three things we are protecting against - the stored passwords, the transmission of the passwords, and the replay of the password. Each is very different. Lets start with the stored password. We need to take a password, and store it in a variable. Then we need to check that variable against what the user entered:


$secret_password = md5("LOVE"); 

if (md5($_POST['password']) == $secret_password) 
{ 
    echo "Correct password"; 

} else { 

    echo "Incorrect password"; 
}   


Simple enough. However, the password is being sent cleartext in $_POST['password']. Which brings us to another thing to protect against - the cleartext transmission. Thankfully, there is an opensource (GPL'd) javascript MD5 implementation available online. If you use that javascript library to md5 the password before sending it, the server code would look like this instead:


$secret_password = md5("LOVE"); 

if ($_POST['password'] == $secret_password) 
{ 
    echo "Correct password"; 

} else { 

    echo "Incorrect password"; 
}   


Again, fairly simple. However, I mentioned the other problem - replay attacks. If someone could manage to 'sniff' the connection, and capture the md5sum, they could simply use that to login!

The solution to that can be very complex and involved - the same site for the javascript md5 function goes into great detail discussing how to implement a truly secure solution. It's called a "CHAP" login system, and here is a link to his page on it - including complete working PHP and javascript code to implement it.

MD5 is a very useful means to protect user's passwords online - if used correctly. Its not encryption, but it does help prevent whole databases of passwords being compromised.

#2 Kuulest

    Advanced Member

  • Members
  • PipPipPip
  • 326 posts

Posted 06 March 2006 - 05:23 PM

Very informative indeed, I loved the section regarding cracking the md5...
I've slight (it's illegial!!) experience with cracking/hacking myself, so I can add the following:
If you use some common word in English (other languages do not have that hude dictionary databases yet) as your password, dictionary attack would put you down. Just consider the fact that password crackers are also humans, and they do run dictionary attacks using dictionaries containing words LIKELY used as password (that being all words used in Holy Bible, Quran, Greek Mythology, Astrology, Bestselling games/movies/cartoons/books etc). I once come accross the server containing approximatelly 5 K md5()'ed passwords. Do you know the hit rate? More then half of them were down after dictionary attack (I have quite huge dictionaries). I started the decipher operation in evening and by morning I had ~70% of passwords.

That being said, ALWAYS encrypt your passwords when stored in database...I know that academically speaking md5 is not an encryption algorithm, but rather a message digest, still it would work to hide your original password very well..you would still be able to athenticate the user (even though you do not know his actual password, but have its digest), and hackers would need an extra mile in order to get the passwords...your server could be broken, and database stolen/accessed (consider the fact that hacker got a read access to your database, and could view what is in user list tables) and if passwords are plain text, then all user accounts are endangered immediately. If some passwords from your db do not survive dictionary attack, then well it's not your fault..you did all reasonable steps to ensure that data is not accessible, if user of your system picks a poor password then (s)he is to blame;)

Thanks,
Torio





1 user(s) are reading this topic

0 members, 1 guests, 0 anonymous users