Simple Idea for Comment Spam
The reason weblogs get comment spam is because the comment has a URL attached to it, and that URL has an increased page rank as a result. The URL is very useful because it helps identify the commenter.
There have been two types of solution implemented to deal with comment spam; a moderation/registration system such as “Typekey”:http://www.typekey.com/, or using a redirect on the URL so that page rank isn’t affected.
Redirection, when “done incorrectly”:http://www.movabletype.org/news/2004_01.shtml#000882 takes away the usefulness of the URL, by disassociating the identify of the commenter. It also still allows a comment spam URL to appear and to be clicked on. Just because it goes through redirection, you still get to the spammer’s intended website. Redirection also breaks referral information (and stops the Tab Groups in Firefox working properly but that is a minor issue that only exists for me.)
Moderation/registration seems like overkill in this situation. The problem is the URL, not the comment. Why moderate the comment when all you have to do is moderate the URL?
The system goes like this: when a comment is posted, the text of the comment goes through and is posted to the website. The link is moderated. The moderator can allow the URL to appear at a later date. The system keeps a list of approved URLs and these appear on the weblog straight away. Blacklisted URLs are automatically deleted, and you can decide to delete the comments associated with blacklisted URLs.
The key point is that the comment itself isn’t held up whilst the weblog owner is moderating the comment, there is no registration to dissuade the idle commenter and no kittens are killed.
I couldn’t agree more. Comment spam is a problem Google should help solve. More: http://www.powazek.com/2003/11/000273.html
95% of the urls submitted to my site are ones that I want to let displayed. I would hate to have to approve all of them
How many unique URLS in that 95%? The way I see it is that you get sent an email (like MT-Blacklist) and click a link to approve that URL, but you approve the URL once only.
I imagine that most of the work would only be done at the very start. Plus, if you get a new URL in your comments, you go have a look, don’t you? I have a look, and sometimes its because I can’t tell whether the comment is spam or not. Visiting the URL can confirm this, you would have to do this sort of extra work anyway to keep your comments clean.
If you have a low comment spam ratio it may be a solution that doesn’t need applying. Same could then be said for a lot of the other solutions.
Derek, yes, Bruce Schneier recently said (http://www.itconversations.com/shows/detail119.html ) that spam won’t be solved by technological means, but by economical one. In the world of comment spam, the economy is Google.
That’s a solid way of doing it. Nice idea!
I had a few similar thoughts a wee while back:
http://www.1976design.com/blog/archive/2003/11/16/spam/
why you do not encode the url base64_encode and decode with javascript in href ?. URL Spam is driven is Google. Unfortunately from this point of view G has modified in a wrong way the shape of the web.
Ah, Marica, you don’t fool me. I don’t want to help you sell cameras thank you