5ubliminal@twitter

Footprints And Fingerprints | Wiping Them Out With Pineapple Juice : 5ubliminal's TellinYa

<a href="http://www.tellinya.com/art2/328/">Footprints And Fingerprints | Wiping Them Out With Pineapple Juice : 5ubliminal's TellinYa</a>
5ubliminal's YAMS
UPDATE: Take a few minutes and read this paper. It somehow reinforces what I said here about interlinking and such practices. It's a good read:)
Footprints and Fingerprints - What's those?

Footprints, SEO related, are repetitive sections of pages which lie in the visual side of things or in the code. For example: Powered By Platform vX.X is a footprint. These sections allow both search engines and competitors to locate your websites and cause you a world full of trouble.

So, everytime you keep a footprint on your site, all your sites that have it can be tracked from it by a skilled researcher. When we talk about source footprints, like affiliate IDs, they virtually can't be tracked by others but here the search engines are the trackers. They get access to your page's source code and can record all these breadcrumbs that could easily give you out.

Footprints are also used to track vulnerable versions of web applications. So getting rid of them quickly from CMS like WordPress, Droopal:) and so on could help you a lot and actually protect you.

The SEO F.E.A.R. - For All The Wrong Reasons

I have some SEO friends obssesed with footprints but for all the wrong reasons. They are actually trying to hide their ownership of their sites from search engines. One day I sat them down and I told them I'll help them strip away all their footprints. So we get on with the work …

The following pretty much regard the link farmers too so pay attention.
Common fingerprints!
  • Hosting: IP Classes, Name-Servers!
  • Domain: TLD, Whois Info!
  • On-Page: SiteWides, Design, Copyright / Version Notices, Comment Form Fields, Any Repetitive Text.
  • Source Code: Comments, CSS Styles.
  • Linking: Interlinking.
  • Linkbuilding: Spamming Angry People (Forum Mods).
  • Links: Affilite IDs.
  • … etc.
Getting rid of your fingerprints!

After analyzing their sites a bit I made up a list and we got on to work. Their main obsession was hiding from search engines so, with that in mind I started telling them what to do:

  • Make sure you Variate the Themes used on your sites. Using a linkdomain you don't want others to clearly see that all those pages linking are yours also. The visual of the page is first thing noticed. You can see when a guy uses the same WordPres theme over and over again in his linkfarm. If possible use your own publishing platforms. This will protect you even more. Everybody is looking for blogs nowadays.
  • Avoid Blog-Roll Links. What? Why? Lazy people use these a lot. These are sitewides. Sitewides are usually devalued and that's how it should be. Only few are counted especially as they have the same anchor text on each page. So, by avoiding blog-rolls, you get more link juice for your content embedded links.
  • Don't Interlink your Link Farm. This will draw a lot of attention especially from search engines. You linkfarm needs to be as widespread as possible and not related one to each other in anyway.
  • Don't Outlink Only to Your Sites. Each post needs at least a 3-1 ratio. 3 good outgoing links to 1 of your sites. Also CSS blend your outgoing links and make the valid ones as obvious (blue) as they get. This will fool many non-professionals.
  • Disable Pingback! Blog and ping might get you indexed but you wouldn't imagine how monitored those services are. A blog has a certain posting schedule. You can't get tens or even more posts per day. And that is easiest to track with the Pinging services where you can monitor posting frequency.
  • Stop Accepting Pingbacks! XMLRPC has to go from the HTTP headers. You have to do your best not to look like a blog even if you use a blog platform. Blogs are seen differently by search engines. Period! That's why they have a blog search and XMLRPC headers (X-Pingback) will get you listed there.
  • Remove any Copyright or Platform Version Signature! Keep it vague. Like © 2008. That's it. Anything more can and will be used against you.
  • Avoid .info as Much as Possible! This depends on budget but .info come cheap. Spammers use them a lot. Try to stay away from them. It's for the best. Go for .org which are a bit cheaper but better viewed.
  • Don't Spam Forums! Or other people inhabbited place. Your sites can be connected between them by both competition and forum mods with loads of free time and … reported.
  • Get other links. Don't rely only on your linkfarms. Get links from all over. Most of these will outrank your linkfarm and will appear, if not ahead, mixed with the linkfarm links. This will make life of those tracking your harder.
  • Don't Host All on Same C-Class. Use different hostings. If you use different IPs you might share the nameservers and those are also footprints.
  • Mask Your Affiliate Links! Block them in robots.txt. Even cloak them for search engines that don't follow robots.txt (MSNBOT).
  • … and these would be a starting point.
Now the last step in footprint free existence.

This is the final step to protect yourself against search engines that could track you using footprints. Everybody was quiet. I started laughing and said:

Please remove the Google AdSense Code, Yahoo! Ads Code, Microsoft Ads Code, Amazon Code from your site. Also ditch any other type of ads that places your ID out in the open.

And they all go: What? Exactly! If you are involved in this kind of advertising you can't hide from the search engines. They know everything about you. The whole obsession of footprint free internet existance is just valid if you think of idiot competitors chasing your sites and reporting them. But you can't hide from the search engines. You can only fly below the radar by not going too far!

I might have missed some footprints. Use the comment form and mention your own. We'll discuss them:)

13 Comments Posted By Readers :

Add your comment
#1 nogenius from United States
Posted on Thursday, 10 April, 2008
Good post - what about filenames of files (if you are using a package like YACG or RSSGM), distribution, order, and location of page elements/HTML tags used, scraped content footprints (resulting from a dirty scraping job)?
#2 5ubliminal web
Posted on Thursday, 10 April, 2008
I call this Keyword Poisoning. It's for a future post ;)
PS: Told'ya I missed some.
#3 nogenius from United States
Posted on Thursday, 10 April, 2008
Quick clarification - for the cloaking of affiliate links, do you mean adding a 301 redirect line to your .htaccess file and then adding that line to the robots.txt "Deny" section?
#4 5ubliminal web
Posted on Thursday, 10 April, 2008
Replace link like this:
afflink?affid=10
with:
/link/cryptic_name/ - which redirects to above link.

And block /link/ in robots.txt. By cloaking I mean if msnbot asks for robots.txt blocked material feed him a 404.
#5 SEO Hosting from United States web
Posted on Thursday, 10 April, 2008
What are your thoughts on WHOIS privacy? I've heard it's pointless as Google, an ICANN accredited registrar, can see past any privacy services for WHOIS information on a domain.
#6 5ubliminal web
Posted on Thursday, 10 April, 2008
I don't think (I like not to think) that Google has access to your WHOIS if it's protected (like GoDaddy's private service).
But, as blackhats don't get very fond of domain names, fake WHOIS info can also work, sometimes better then privacy.
But there are many other ways to match sites with owners. You can use nameservers, IP classes and affiliate IDs and accomplish the same without actually needing to know personal specifics. If you use AdSense they also know where you live:)

As I said. If Google's out to get you, there's little you can do. Play nice and fly low.
PS:Seo Hosting:) You could have signed with your name as you might have noticed I don't give anchor text for this very purpose.
PPS: Hosting companies also know more then they should;)
#7 Kyo from Ukraine
Posted on Friday, 11 April, 2008
Isn't the ratio 3 outgoing links to other pages and 1 link to your page too weak? You know, too many link value passed to unrelated sites?
And I think, that everybody is waiting for a promised post about link farms :).
#8 5ubliminal web
Posted on Friday, 11 April, 2008
It's weak but irrelevant anchor text won't give links to others too much value. The 'stronger' you get the more you'll look like a star.

PS: I know I have some posts pending and they are being written right now along with several thousands lines of C++ code for some tools I'm building at the moment … hence the delay. I'm also thinking and rethinking where to draw the line with the details:)
By the end of the weekend I should have the software finished and passed on to my team for exploitation and debugging;) and I'll have more time to write.
#9 Papa Rage from United States
Posted on Friday, 11 April, 2008
Good post.

I always try to avoid adding comments that might sound nit-picky, but I'm going to risk it with this one because it is a common mistake and I think people should understand this.

There's a big difference between footprint and fingerprint. This post is about fingerprints.
A footprint is something else entirely and NOT something that can be used to identify something. It's a measure a certain dimension of a thing's size. For example:
The memory footprint of an application will be important when you code for a small device.
Your carbon footprint is something you measure in metric tons.
The footprint of an oil drilling station could be 2000 acres.
The footprint of a sky scraper is a measure of land that is taken up by it.
#10 5ubliminal web
Posted on Friday, 11 April, 2008
Somehow true but as I said I'm foreigner :) and everyone calls footprints characteristics that can identify and object.
I rather see fingerprints as traces left behind while doing something. Like when you crack a blog:) you leave an IP in the log. That's the fingerprint.

Let's think like this:
In fresh cement you may see a footprint that could tell someone what shoes the one who left it was wearing.
That's a footprint and same analogy goes here. You leave a mark on your site that can be found and can help others find other sites wearing the same mark.

I don't know … I'll think about this an lookup some 'official' explanations/meanings for both:)
#11 DM from Great Britain
Posted on Sunday, 20 April, 2008
Good addition with that article Subliminal. Illustrates a long held view of mine that those who say academia has nothing to contribute to business are fools.
#12 5ubliminal web
Posted on Sunday, 20 April, 2008
Of course they do:) They have free time and use it for research so they must get some results.
I wish I had their free time…

PS: I wouldn't really say they contribute. It's more like sabotage from the BH POV:)
#13 Health Crazy from United States web
Posted on Wednesday, 14 May, 2008
Thanks for the many tipa on what not to do so your website can do a lot better in search engine results.
5ubliminal's TellinYa.com SEM & SEO Blog © 2007 - All rights reserved unless mentioned otherwise .
Rendered On : [Friday, 12 March, 2010 - 06:06:03 GMT]   No Ajax / Flash Used Here
" Footprints And Fingerprints | Wiping Them Out With Pineapple Juice : 5ubliminal's TellinYa "
Close
Tellinya.com is relocating to blog.5ubliminal.com. This blog is no longer maintained and comments are no longer accepted / answered.