ansaurus

Question

php make unique hash of rss description

Answer 1

+1 A:

To avoid false duplicates you should use a cryptographically secure hashing algorithm like SHA-1 or MD5.

Albin Sunnanbo 2010-08-15 19:47:22

Answer 2

+1 A:

MD5 is fastest and produces hash that is 32 characters long.

<?php
$hash = md5($description . $title);
?>

I used it in my RSS parser for exactly same purpose. And it works like a charm.

shamittomar 2010-08-15 19:49:57

thanx for all your answers, but i think i'll take shammittomars answer as its 32 chars long, uses md5 and he understood my question. and has gone thru similar problem

Sir Lojik 2010-08-15 19:54:59

Answer 3

+2 A:

sprintf('%u',crc32()) produces 4,294,967,296 combinations, and it's shorter than md5 or sha1. it's only 32 bits wide.

stillstanding 2010-08-15 19:50:48

You should pass the string as the argument of `crc32`, of course.

Daniel 2010-08-15 19:52:30

it's the OP's option. he can use dechex(sprintf('%u',crc32()) if he wants a hex string, or just a plain left-zero-padded for pure decimal digits.

stillstanding 2010-08-15 19:55:02

hmm..... 32bits wide. thanx for this solution

Sir Lojik 2010-08-15 19:59:07

@DanielL... does that output an integer or string?

Sir Lojik 2010-08-15 20:01:11

Remember as @stillstanding wrote, "the less number of characters generated by the hash function, the more likely you'll have collisions in your identifiers. Be certain about that.". MD5 is 128-bit so gives way more identifiers and much much lesser chance of collisions.

shamittomar 2010-08-15 20:02:30

how about similar_text. is this worth doing

Sir Lojik 2010-08-15 20:21:48

@Sir Lojik: Returns an integer.

Daniel 2010-08-16 10:16:16

ansaurus

tags:

views:

answers:

php make unique hash of rss description

related questions