views:

126

answers:

4

Recently I have read about hash-tables in a very famous book "Introduction to Algorithms". I haven't used them in any real applications yet, but wanted to. But don't know how to start.
Can anyone give me some samples of using it, for example, how to realize dictionary application (like ABBYY Lingvo) using hash-tables?
And finally wanted to know what is the difference between hash-tables and associative arrays in PHP, I mean which technology should I use and in which situations?
If I am wrong (I beg perdon) please correct me, because actually I am starting with hash-tables and I know just basic (theoretical) knowledge about them.
Thanks a lot.

+9  A: 

php arrays ARE basically hash tables

kgb
Edit: Ah - beat me to it :)+1.
Cam
+7  A: 

In PHP, associative arrays are implemented as hashtables, with a bit of extra functionality.

However technically speaking, an associative array is not identical to a hashtable - it's simply implemented in part with a hashtable behind-the-scenes. Because most of its implementation is a hashtable, it can do everything a hashtable can - but it can do more, too.

For example, you can loop through an associative array using a for loop, which you can't do with a hashtable.

So while they're similar, an associative array can actually do a superset of what a hashtable can do - so they're not exactly the same thing. Think of it as hashtables plus extra functionality.

Code examples:

Using an associative array as a hashtable:

$favoriteColor = array();
$favoriteColor['bob']='blue';
$favoriteColor['Peter']='red';
$favoriteColor['Sally']='pink';
echo 'bob likes: '.$favoriteColor['bob']."\n";
echo 'Sally likes: '.$favoriteColor['Sally']."\n";
//output: bob likes blue
//        Sally likes pink

Looping through an associative array:

$idTable=array();
$idTable['Tyler']=1;
$idTable['Bill']=20;
$idTable['Marc']=4;
//up until here, we're using the array as a hashtable.

//now we loop through the array - you can't do this with a hashtable:
foreach($idTable as $person=>$id)
    echo 'id: '.$id.' | person: '.$person."\n";

//output: id: 1 | person: Tyler
//        id: 20 | person: Bill
//        id: 4 | person: Marc

Note especially how in the second example, the order of each element is maintained (Tyler, Bill Mark) based on the order in which they were entered into the array. This is a major difference between associative arrays and hashtables. A hashtable maintains no connection between the items it holds, whereas a PHP associative array does (you can even sort a PHP associative array).

Cam
Hmmm, such a short explanation. So they are **ABSOLUTELY** the same thing?
Bakhtiyor
If yes, we need to thank PHP for that.
Bakhtiyor
@Bak They aren't in general, but they are in PHP, which plays a bit fast and loose with data structures since there's less of a concern over performance
Michael Mrozek
I see, but in this case why are there so many algorithms for hash functions and stuff like that, if hash function=arrays?
Bakhtiyor
@Michael you make it sound like a disadvantage? It makes PHP different, but I think it's a good difference.
@Bakhityor: I have added to my answer in response to your comment.
Cam
@incrediman. Thanks for additional info. What I have get from your explanation is that I must forget about hash-tables and use and think associative arrays, right? Because associative arrays are hashtable+extra functionality and it makes it more powerful.
Bakhtiyor
@Bakhityor: Your last sentence is perfect. You don't need to 'forget' about hashtables though - in fact it's great you already understand hashtables, because now you can apply that knowledge to associative arrays. I'm adding some examples to my answer to further clarify thing for you.
Cam
@incrediman. Perfect. I am looking forward to see, analyze and remember your example in order to close this subject for myself. Thanks again, your are really **incredible man** :)
Bakhtiyor
+1  A: 

An associative array is an array where you don't access elements by an index, but by a key. How this works internally is implementation specific (there is no rule how it must work). An associative array could be implemented by a hash table (most implementations will do that), but it could also be implemented by some sort of tree structure or a skip list or the algorithm just iterates over all elements in the array and looks for a key that matches (this would be awfully slow, but it works).

A hash table is a way how to store data where values are associated to keys and where you intend to find values for keys within a (usually almost) constant time. This sounds exactly like what you expect of an associative array, that's why most of the time hash tables are used for implementing those arrays, but that is not mandatory.

Mecki
+2  A: 

The difference between an associative array and a hash table is that an associative array is a data type, while a hash table is a data implementation. Obviously the associative array type is very important in many current programming languages: Perl, Python, PHP, etc. A hash table is the main way to implement an associative array, but not quite the only way. And associative arrays are the main use of hash tables, but not quite the only use. So it's not that they are the same, but if you already have associative arrays, then you usually shouldn't worry about the difference.

For performance reasons, it can be important to know that your associative arrays in your favorite language are implemented as hashes. And it can be important to have some idea of the overhead cost of that implementation. Hash tables are slower and use more memory than linear arrays as you see them in C.

Perl lumps the two concepts together by calling associative arrays "hashes". Like a number of features of Perl, it isn't quite wrong, but it's sloppy.

Greg Kuperberg