Hi all, Basically, i want my site to aggregate a lot of rss feeds and store them in database during cron job. i use magpie to parse the rss into arrays...everything should seem straight forward although im worried about duplication issues when running the cron job.
what is the best solution to avoid duplicate entries.... here is my theory although i dnt think its efficient.
cron job theory
1) parse rss feed with magpie 2) create md5 hash of link 3) test for existance of md5 in database table... if not ... insert .. if exists ignore or update
lemme know if there is a more efficient way