Quick Link Prospecting with Scraper Extension1 year ago
Using XPath to select content in an XML document to scrape for SEO is nothing new, but traditionally SEOs are doing it within Google Docs for simplicity and ease of use. There are some limitations though, which includes a 50 ImportXML limit per spreadsheet and the fact that it’s not done in-line while browsing. I’ve been playing around with a Google Chrome extension called Scraper which allows you to scrape content in-line while browsing.
Let’s walk though some examples of how this is awesome.
Prospecting Guest Posts
Let’s say we’re quickly trying to find guest post opportunities for a food related site.
To do this, I search for food inurl:”write for us” and show 100 results per page (you have to turn off Google Instant for this).
Step One – Advanced Search Query

Step Two – Select and Right Click Listing
Select “Scrape Similar” in the menu and the extension will find the XPath to the selected content and extract it and the repetitive elements similar to it.

Step 3 – View Output
At this stage, you can make edits to the XPath and remove fields of data that have been extracted. You can also define presets for frequently used XPath. The extension does a fair job selecting the content correctly, but depending on the markup of the page, you may need to edit the XPath to select the right text. In this example, it did it perfect without correction.

Step 4 – Export to Google Docs, FTW
From here, it’ll send it directly into Google Docs where you can mash up with the SEOmoz API or other data.

Example Output:
7 More Examples
#1 An Alltop Scraper
You can scrape the curated blog lists at Alltop, such as this huge list of marketing blogs.
It looks like this, in a matter of seconds.
#2 Scrape WordPress Blog Post Comments
Let’s say I wanted to quickly contact everyone who left a comment on a post on Outspoken Media’s blog, such as my link building personas post.
I had to make a quick edit to the XPath so it didn’t select the the comment anchor URL.
//div[2]/dl/dt/span/a[@class='url']
Run this on your guest posts or the comments being left on your competitor’s site.
(You’ll likely have to customize it per blog if the extension doesn’t get it automatically.)
#3 Blog Directory Scraper
Need a quick list of 102 gaming blog? Just head over to the BOTW Blog Directory.
A little edit to the XPath: //div[2]/ul/li/a[1]/@href
And in a few seconds a spreadsheet list of URLs to 102 gaming blogs.
#4 Link Placement and Buys
Similar to the guest post search, but try these in Google and scrape.
inurl:edu alumni discount code
inurl:sponsor intitle:sponsors seattle

#5 Followerwonk Scraper
A quick search for zombie on Followerwonk.
A little Xpath: //div[3]/table/tbody/tr/td/a
And…
#6 Tumblr Submit Scraper
Looking to launch content on Tumblr?
site:tumblr.com inurl:submit zombie OR zombies
#7 A Google Plus Profile Scraper
Looking for food bloggers on Google Plus?
Right Tool, Right Job
This doesn’t replace the benefit of doing some XPath within Google Docs, since you can do scripting and iterate on imports. However, I really like this tool so far. It can do a lot very easily and very quickly.
It does have some bugs and I’ve had to restart my browser a few times because it can stop working.
If you have any other ideas on how it could be used, be sure to drop a comment below.
Pingback: The Good, Bad, and Ugly of Chrome SEO Extensions | Affordable SEO Submission
Pingback: 12 Free SEO Tools of 2011 to be Grateful For
Pingback: The Power of Using Lists for Link Building « Fast Ninja Blog by Freelanceful – Web Design | Coding | Freelancing
Pingback: 10 brilliante SEO Chrome Extensions die 2012 die Arbeit erleichtern | Thomas Hefke
Pingback: SES London 2012 – The Best Bits (A Work In Progress!) | SEOptimise
Pingback: 12 Free SEO Tools of 2011 to be Grateful For | Finish Marketing
Pingback: Link Building Resources – A Curated List | Point Blank SEO
Pingback: Enterprise Link Prospecting: 4 Scalable Ways to Source Link Prospects | Software for SEO link building, social media marketing and public relations | BuzzStream
Pingback: 12 Free SEO Tools of 2011 to be Grateful For | Go Atlanta SEO
Pingback: Verve Search » Blog Archive Search Love London 2012 - Lisa Myers speaking on International SEO
Pingback: 22 Must-Have Chrome Extensions for Productivity (I Think) - Bibiano Wenceslao - Bibiano Wenceslao
Pingback: Relationship Building ≠ Link Building
Pingback: Mike King’s SEO and Social Media Tools – SMX London 2012 | Hallam Internet