views:

133

answers:

7

Hi guys,

We have this website, something like a CMS with some extra stuff, it isn't relevant.

So people can register and write some stuff on their profile which is automatically published on the site. About 3 days ago, we start getting fake users whose profiles was just links to marketing sites or sites trying to sell you something. They are no bots, that for sure, they are humans registering users. I think some sort of "evil SEO company" is behind this.

So the only solution we thought of is not to publish automatically their profile until the admin approves it. But doing this everytime one person changes their profile doesn't sound very nice for the real users. So we thought just doing it the first time a user creates a profile. But that would only make them to work harder by creating a profile which seems "spam clear", getting it approved and then edit it to add the spam links.

I don't know if there's a real solution to this, I'm just asking this question to see how you would lead with this kind of people.

Thanks to all! Cheers

EDIT We already have a captcha control and email validation thing. I think they are humans. If they are bots, well, good job to those people, they have made a great bot =S

A: 

Take a look at re-captcha. With this you won't even need to build your own system...just integrate with there system!

Andrew Siemer
We are using it. They are humans, that's for sure.
emzero
+1  A: 

Use this:

http://recaptcha.net/

I wouldn't be so sure they're not bots just yet. Bots have begun creating more human-like profiles by lifting content off of other profiles and appending in a few of their own phrases like "go to my site" which makes it really difficult to tell. I'm skeptical that they aren't bots and if you don't have a captcha in place, you should before assuming that isn't the issue.

EDIT

Alright since you're already using captcha I'd recommend a flagging system. Some computation based on factors like how many users view their profile, how many of those users flag it as spam and weighted by how long those users have been around.

Spencer Ruport
Alright who's downvoting us?
Spencer Ruport
@Spencer: Not me, but probably someone who read this in the question: "They are no bots, that for sure, they are humans registering users."
RichieHindle
@Richie, bots have begun creating more human-like profiles by lifting content off of other profiles and appending in a few of their own phrases like "go to my site" which makes it really difficult to tell. I'm skeptical that they aren't bots and if he doesn't have a captcha in place, he should before assuming that isn't the issue.
Spencer Ruport
@Richie - Done. :)
Spencer Ruport
Good point, it is fairly easy to mistake a smart bot for a not-so-smart human. It is likely his spam is caused by bots.
Brandon
We are using it. They are humans, that's for sure.
emzero
Alright. Updated my answer.
Spencer Ruport
Yes, I like the idea of adding (besides the admin moderation) some sort of community moderation to flag users. I'll discuss that idea with my coworkers and client, thanks. +1 for you
emzero
+3  A: 

At some level you will have to use admin-intervention. My preference would be to allow anyone to create a profile, then let the admin retroactively block the user. When that becomes too difficult (if your site becomes popular) then start leveraging community support. Let people report profiles.

David Berger
+1  A: 

If these are humans, it will be pretty hard to fight. Try to find a pattern in what these users are doing and then when this pattern appears, run the user through a moderation process.

Maybe the emails look the same? Maybe they link toward the same websites (same extension etc)? Maybe they come from the same country? Same IP address?

Also use tricks such as email validation and so on. It's not too annoying for regular users, but it can be for people creating a lot of accounts.

It is also possible that these are not humans. Try putting some simple semantic captcha and see if you get less spam.

marcgg
Oh, I forgot the email validation one. Forcing users to have a working, validated email address before allowing them to do any more than log in and change their password might help, as might limiting accounts to one per email. Then again, throwaway/spammable email accounts are so easy to create that, again, this is only going to be a speed bump.
Rob
We have already a captcha control, so they're not bots. We have already a email validation, but as Rob said, again, having lots of email accounts for this purpose is not hard. So it won't help.By the way, I couldn't find any pattern yet, they are mails from several domains, gmail, hotmail, aol and some custom domain names. We cant filter those of course. IP addresses are just random. I think it's hard to deal with these people...
emzero
+1  A: 

There are a few different options for combating link spam in user submissions, including:

  • Support administrator blocking of accounts and profiles
  • Restrict external link domains to those on a whitelist
    • Reject those on a blacklist
    • Some sort of manual approval for "unseen" domains
  • Throttle user creation/profile modification from the same IP to a sensible number per day
  • Implement a captcha

You'll likely want to adopt some combination of these. With regards to making decisions based on IP addresses, remember that a lot of users on various ISPs will be rotated amongst proxies, etc. between requests (AOL, for example), and be aware of the problems of blocking or throttling single IP addresses which could conceal a large number of legitimate users.

Rob
A: 

At least one Wiki system (MoinMoin) keeps an online list of known spam phrases: http://moinmo.in/BadContent

You could use that to determine whether to mark a new profile for admin approval, or to let it through automatically. It wouldn't catch everything, but it would catch some spammers while being unlikely to affect any legitimate users.

RichieHindle
I'll take a look about that, it might help a little bit as you said. Thank you
emzero
+1  A: 

You could add a community controlled content moderation system. Something similar to what is implemented on StackOverflow where the longer and more frequently a user participates the more power and trust they have. Then when a stupid user or a smart bot signs up on your site with inappropriate data they can get flagged or banned.

Andrew Siemer
Yeap, I like that idea =) +1
emzero