How you deal with preventing bots from spamming your forum?

Re: How you deal with preventing bots from spamming your for

I have a 99.9% bot proof security measure that's custom coded by my friend of mine. Seriously, before we had catchpa and everything and we got a few spam bots per week, now we get like 2 a year that are the humans. We will give it to you for $5.
 
Re: How you deal with preventing bots from spamming your for

iPhonefreak said:
I have a 99.9% bot proof security measure that's custom coded by my friend of mine. Seriously, before we had catchpa and everything and we got a few spam bots per week, now we get like 2 a year that are the humans. We will give it to you for $5.

We may have to talk. For the meantime, I've switched to Q&A in place of a captcha.
 
Re: How you deal with preventing bots from spamming your for

I just set up a good Q&A. Make sure that the question is not something that can easily be googled, or is common knowledge (such as "What colour is the sky?"). This has helped me a lot. Yes, we might get the odd one or two every now and again, but that's to be expected really.
 
Re: How you deal with preventing bots from spamming your for

InRomoWeTrust said:
Anyone have any tips? Dealing with some bots atm getting through registration.

Which software are you referring to?
 
Re: How you deal with preventing bots from spamming your for

On Admin Forums we don't allow the use of links until people reach 20 posts. That alone stops most of the spammers 🙂
 
Re: How you deal with preventing bots from spamming your for

ShadyX said:
On Admin Forums we don't allow the use of links until people reach 20 posts. That alone stops most of the spammers 🙂

That's when they register. Would you mind sharing what you do to prevent them registering?
 
Re: How you deal with preventing bots from spamming your for

One way of stopping them completely is to add their IP's to your IP deny manager in your C Panel

Here is a list of the worst bots to add to your Robots.txt, beware that some bots ignore robots.txt so this is not a complete solution IP deny manager is, but a lot more laborious

Code:
User-agent: http://www.almaden.ibm.com/cs/crawler
Disallow: /
User-agent: NPBot
Disallow: /
User-agent: TurnitinBot
Disallow: /
User-agent: EmailCollector
Disallow: /
User-agent: EmailWolf
Disallow: /
User-agent: CopyRightCheck
Disallow: /
User-agent: Black Hole
Disallow: /
User-agent: Titan
Disallow: /
User-agent: NetMechanic
Disallow: /
User-agent: CherryPicker
Disallow: /
User-agent: EmailSiphon
Disallow: /
User-agent: WebBandit
Disallow: /
User-agent: Crescent
Disallow: /
User-agent: NICErsPRO
Disallow: /
User-agent: SiteSnagger
Disallow: /
User-agent: ProWebWalker
Disallow: /
User-agent: CheeseBot
Disallow: /
User-agent: ia_archiver
Disallow: /
User-agent: ia_archiver/1.6
Disallow: /
User-agent: Teleport
Disallow: /
User-agent: TeleportPro
Disallow: /
User-agent: Wget
Disallow: /
User-agent: MIIxpc
Disallow: /
User-agent: Telesoft
Disallow: /
User-agent: Website Quester
Disallow: /
User-agent: WebZip
Disallow: /
User-agent: moget/2.1
Disallow: /
User-agent: WebZip/4.0
Disallow: /
User-agent: Mister PiX
Disallow: /
User-agent: WebStripper
Disallow: /
User-agent: WebSauger
Disallow: /
User-agent: WebCopier
Disallow: /
User-agent: NetAnts
Disallow: /
User-agent: WebAuto
Disallow: /
User-agent: TheNomad
Disallow: /
User-agent: WWW-Collector-E
Disallow: /
User-agent: RMA
Disallow: /
User-agent: libWeb/clsHTTP
Disallow: /
User-agent: asterias
Disallow: /
User-agent: httplib
Disallow: /
User-agent: turingos
Disallow: /
User-agent: spanner
Disallow: /
User-agent: InfoNaviRobot
Disallow: /
User-agent: Harvest/1.5
Disallow: /
User-agent: Bullseye/1.0
Disallow: /
User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95)
Disallow: /
User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0
Disallow: /
User-agent: CherryPickerSE/1.0
Disallow: /
User-agent: CherryPickerElite/1.0
Disallow: /
User-agent: WebBandit/3.50
Disallow: /
User-agent: DittoSpyder
Disallow: /
User-agent: SpankBot
Disallow: /
User-agent: BotALot
Disallow: /
User-agent: lwp-trivial/1.34
Disallow: /
User-agent: lwp-trivial
Disallow: /
User-agent: Wget/1.6
Disallow: /
User-agent: BunnySlippers
Disallow: /
User-agent: URLy Warning
Disallow: /
User-agent: Wget/1.5.3
Disallow: /
User-agent: LinkWalker
Disallow: /
User-agent: cosmos
Disallow: /
User-agent: moget
Disallow: /
User-agent: hloader
Disallow: /
User-agent: humanlinks
Disallow: /
User-agent: LinkextractorPro
Disallow: /
User-agent: Offline Explorer
Disallow: /
User-agent: Mata Hari
Disallow: /
User-agent: LexiBot
Disallow: /
User-agent: Web Image Collector
Disallow: /
User-agent: The Intraformant
Disallow: /
User-agent: True_Robot/1.0
Disallow: /
User-agent: True_Robot
Disallow: /
User-agent: BlowFish/1.0
Disallow: /
User-agent: JennyBot
Disallow: /
User-agent: MIIxpc/4.2
Disallow: /
User-agent: BuiltBotTough
Disallow: /
User-agent: ProPowerBot/2.14
Disallow: /
User-agent: BackDoorBot/1.0
Disallow: /
User-agent: toCrawl/UrlDispatcher
Disallow: /
User-agent: WebEnhancer
Disallow: /
User-agent: TightTwatBot
Disallow: /
User-agent: suzuran
Disallow: /
User-agent: VCI WebViewer VCI WebViewer Win32
Disallow: /
User-agent: VCI
Disallow: /
User-agent: Szukacz/1.4
Disallow: /
User-agent: QueryN Metasearch
Disallow: /
User-agent: Openfind data gathere
Disallow: /
User-agent: Openfind
Disallow: /
User-agent: Xenu's Link Sleuth 1.1c
Disallow: /
User-agent: Xenu's
Disallow: /
User-agent: Zeus
Disallow: /
User-agent: RepoMonkey Bait & Tackle/v1.01
Disallow: /
User-agent: RepoMonkey
Disallow: /
User-agent: Zeus 32297 Webster Pro V2.9 Win32
Disallow: /
User-agent: Webster Pro
Disallow: /
User-agent: EroCrawler
Disallow: /
User-agent: LinkScan/8.1a Unix
Disallow: /
User-agent: Kenjin Spider
Disallow: /
User-agent: Keyword Density/0.9
Disallow: /
User-agent: Cegbfeieh
Disallow: /
User-agent: SurveyBot
Disallow: /
User-agent: duggmirror

You should also visit the error log in your CPanel to see if any bots are trying to get to places they shouldn't, if any entries in the error log look like this :

[Tue Apr 05 20:24:48 2011] [error] [client 62.149.231.222] File does not exist: /home/[your account]/public_html/[domain_name]/websql
[Tue Apr 05 20:24:47 2011] [error] [client 62.149.231.222] File does not exist: /home/[your account]/public_html/[domain_name]/admin
[Tue Apr 05 20:24:46 2011] [error] [client 62.149.231.222] File does not exist: /home/[your account]/public_html/[domain_name]/dbadmin
[Tue Apr 05 20:24:44 2011] [error] [client 62.149.231.222] File does not exist: /home/[your account]/public_html/[domain_name]/lists


they should be blocked by IP
 
Re: How you deal with preventing bots from spamming your for

I definitely recommend using KeyCaptcha. 😎
I used to get bots even with Google Recaptcha.
But KeyCaptcha has stopped them registering.
For me anyway. You could give it a try. 😉
 
Re: How you deal with preventing bots from spamming your for

Q&A has always worked for me, not a single spam bot has gotten through. 🙂
 
Re: How you deal with preventing bots from spamming your for

Kiiu said:
Q&A has always worked for me, not a single spam bot has gotten through. 🙂
I've had loads with Q&A. More so than with Recaptcha.
But I've never had a single bot sign up with KeyCaptcha.
 
Re: How you deal with preventing bots from spamming your for

Quiver said:
Kiiu said:
Q&A has always worked for me, not a single spam bot has gotten through. 🙂
I've had loads with Q&A. More so than with Recaptcha.
But I've never had a single bot sign up with KeyCaptcha.

Q&A entirely depends on what type of question you have. Some can easily be bypasses, whilst some are not.

KeyCaptcha may be good at stopping spammers, but it also annoys me. I usually don't go through the bother of completing the KeyCaptcha unless the site seems interesting.
 
Re: How you deal with preventing bots from spamming your for

Jack~Rouse said:
User-agent: http://www.almaden.ibm.com/cs/crawler
Disallow: /
User-agent: NPBot
Disallow: /
User-agent: TurnitinBot
Disallow: /
User-agent: EmailCollector
Disallow: /
User-agent: EmailWolf
Disallow: /
User-agent: CopyRightCheck
Disallow: /
User-agent: Black Hole
Disallow: /
User-agent: Titan
Disallow: /
User-agent: NetMechanic
Disallow: /
User-agent: CherryPicker
Disallow: /
User-agent: EmailSiphon
Disallow: /
User-agent: WebBandit
Disallow: /
User-agent: Crescent
Disallow: /
User-agent: NICErsPRO
Disallow: /
User-agent: SiteSnagger
Disallow: /
User-agent: ProWebWalker
Disallow: /
User-agent: CheeseBot
Disallow: /
User-agent: ia_archiver
Disallow: /
User-agent: ia_archiver/1.6
Disallow: /
User-agent: Teleport
Disallow: /
User-agent: TeleportPro
Disallow: /
User-agent: Wget
Disallow: /
User-agent: MIIxpc
Disallow: /
User-agent: Telesoft
Disallow: /
User-agent: Website Quester
Disallow: /
User-agent: WebZip
Disallow: /
User-agent: moget/2.1
Disallow: /
User-agent: WebZip/4.0
Disallow: /
User-agent: Mister PiX
Disallow: /
User-agent: WebStripper
Disallow: /
User-agent: WebSauger
Disallow: /
User-agent: WebCopier
Disallow: /
User-agent: NetAnts
Disallow: /
User-agent: WebAuto
Disallow: /
User-agent: TheNomad
Disallow: /
User-agent: WWW-Collector-E
Disallow: /
User-agent: RMA
Disallow: /
User-agent: libWeb/clsHTTP
Disallow: /
User-agent: asterias
Disallow: /
User-agent: httplib
Disallow: /
User-agent: turingos
Disallow: /
User-agent: spanner
Disallow: /
User-agent: InfoNaviRobot
Disallow: /
User-agent: Harvest/1.5
Disallow: /
User-agent: Bullseye/1.0
Disallow: /
User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95)
Disallow: /
User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0
Disallow: /
User-agent: CherryPickerSE/1.0
Disallow: /
User-agent: CherryPickerElite/1.0
Disallow: /
User-agent: WebBandit/3.50
Disallow: /
User-agent: DittoSpyder
Disallow: /
User-agent: SpankBot
Disallow: /
User-agent: BotALot
Disallow: /
User-agent: lwp-trivial/1.34
Disallow: /
User-agent: lwp-trivial
Disallow: /
User-agent: Wget/1.6
Disallow: /
User-agent: BunnySlippers
Disallow: /
User-agent: URLy Warning
Disallow: /
User-agent: Wget/1.5.3
Disallow: /
User-agent: LinkWalker
Disallow: /
User-agent: cosmos
Disallow: /
User-agent: moget
Disallow: /
User-agent: hloader
Disallow: /
User-agent: humanlinks
Disallow: /
User-agent: LinkextractorPro
Disallow: /
User-agent: Offline Explorer
Disallow: /
User-agent: Mata Hari
Disallow: /
User-agent: LexiBot
Disallow: /
User-agent: Web Image Collector
Disallow: /
User-agent: The Intraformant
Disallow: /
User-agent: True_Robot/1.0
Disallow: /
User-agent: True_Robot
Disallow: /
User-agent: BlowFish/1.0
Disallow: /
User-agent: JennyBot
Disallow: /
User-agent: MIIxpc/4.2
Disallow: /
User-agent: BuiltBotTough
Disallow: /
User-agent: ProPowerBot/2.14
Disallow: /
User-agent: BackDoorBot/1.0
Disallow: /
User-agent: toCrawl/UrlDispatcher
Disallow: /
User-agent: WebEnhancer
Disallow: /
User-agent: TightTwatBot
Disallow: /
User-agent: suzuran
Disallow: /
User-agent: VCI WebViewer VCI WebViewer Win32
Disallow: /
User-agent: VCI
Disallow: /
User-agent: Szukacz/1.4
Disallow: /
User-agent: QueryN Metasearch
Disallow: /
User-agent: Openfind data gathere
Disallow: /
User-agent: Openfind
Disallow: /
User-agent: Xenu's Link Sleuth 1.1c
Disallow: /
User-agent: Xenu's
Disallow: /
User-agent: Zeus
Disallow: /
User-agent: RepoMonkey Bait & Tackle/v1.01
Disallow: /
User-agent: RepoMonkey
Disallow: /
User-agent: Zeus 32297 Webster Pro V2.9 Win32
Disallow: /
User-agent: Webster Pro
Disallow: /
User-agent: EroCrawler
Disallow: /
User-agent: LinkScan/8.1a Unix
Disallow: /
User-agent: Kenjin Spider
Disallow: /
User-agent: Keyword Density/0.9
Disallow: /
User-agent: Cegbfeieh
Disallow: /
User-agent: SurveyBot
Disallow: /
User-agent: duggmirror

Thanks for this.
 
Re: How you deal with preventing bots from spamming your for

I have found that the Sortables CAPTCHA plugin works pretty well. I use a system created by the creator of prophpBB that has stopped 100% of the spam registrations on my phpBB3 host.
 
Re: How you deal with preventing bots from spamming your for

I found this on another site which gives details of more bad bots, there are some surprises amongst them
http://www.kloth.net/internet/badbots.php

Luckily they all have IP's so listing them in IPdeny should be easy, but a tad laborious

The one bot you should deny is Majestic-12, it purports to be a search bot, but is in fact a content harvesting bot. This bot can seriously damage your PR.
It has too many IP's to list here, but there is full list of known IP's it uses here :
http://blog.bannasties.com/2012/07/ips- ... -mj12-bot/
 
Re: How you deal with preventing bots from spamming your for

Somehow they are getting around my Q&A. I can't really figure out how given that it isn't google searchable.

I likewise don't know what made my website just 'click'. Maybe it's just because I've pushed up the rankings in search for a few keywords.
 
Re: How you deal with preventing bots from spamming your for

InRomoWeTrust said:
Somehow they are getting around my Q&A. I can't really figure out how given that it isn't google searchable.

I likewise don't know what made my website just 'click'. Maybe it's just because I've pushed up the rankings in search for a few keywords.

If your question is very simple then bots can actually search Google for the answer, the best kind of question is one that requires it to count, so something like "What is the third letter in this word " Welcome" is far better then "Who is President of the USA"
 
Back
Top Bottom