How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired...

22
How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to have in your SEO arsenal and you‟re about to learn one way to find them using Scrapebox. Scrapebox has been around for 6 years now and is the swiss army knife of SEO‟s worldwide. Since its inception, it‟s constantly been updated and can be used for a huge array of online marketing related tasks. I paid $47 for it back in 2010 and I can honestly say I‟ve never got as much use and value from a piece of software throughout my 10 years marketing online. In this step-by-step guide I‟ll take you through one of the ways to find high authority expired domains using Scrapebox. Ok, let‟s get started…. 1. Enter primary seed keyword into Google First we need to create a seed list for Scrapebox.

Transcript of How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired...

Page 1: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

How to Find High Authority Expired Domains

Using Scrapebox

High authority expired domains are still a powerful tool

to have in your SEO arsenal and you‟re about to learn one

way to find them using Scrapebox.

Scrapebox has been around for 6 years now and is the swiss army knife of SEO‟s worldwide. Since its

inception, it‟s constantly been updated and can be used for a huge array of online marketing related

tasks. I paid $47 for it back in 2010 and I can honestly say I‟ve never got as much use and value from a

piece of software throughout my 10 years marketing online.

In this step-by-step guide I‟ll take you through one of the ways to find high authority expired domains

using Scrapebox.

Ok, let‟s get started….

1. Enter primary seed keyword into Google

First we need to create a seed list for Scrapebox.

Page 2: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Enter an industry related seed keyword into your country‟s Google.

2. Pick a website to pull keywords from

Pick a URL from the organic listings

Note: best to pick an organic listing and not an Adwords advertiser

For this example, we‟re picking creativespark.co.uk

3. Open Adwords Keyword Planner

Page 3: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

NOTE:- if you‟ve not got a Google Adwords account, you can use any keyword tool for this step. A good

free one to try for ideas is ubersuggest. If you decide to use another keyword tool then jump down to step

7 and continue.

Login to Adwords at http://adwords.google.com and open up the Keyword Planner

4. Search for new keyword and group ideas

Click on „search for new keyword and ad group ideas‟

Page 4: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

4b. Search for new keyword and ad group ideas

Paste the domain name (found in step 2) into the Your landing page text box

Note: If you searched google.co.uk in step 1, make sure the targeting is set to United Kingdom. If you

searched google.com then set the targeting to United States etc etc

Click Get ideas

Page 5: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

5. Download the list

Click the Download button and down the csv file to your local drive.

6. Open CSV and copy the keywords

Open the csv file and select all the keywords from the Keyword column

Copy the keywords from the keyword column.

TIP: hold down CTRL & SHIFT and press the down arrow key to select all the keywords in the column

Page 6: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

7. Prepare the Master Seed list for Scrapebox

Now we‟re going to add some extra data in each line in your keyword list. We‟ll be adding quotes, the

term links and also a Julian date (I‟ll explain this later).

First we are going to add quotes to the beginning of the line.

To do this first open Notepad ++ and paste the keywords into a new window

Place your cursor in the very top left hand side before the first character and open the find function

using CTRL F

Click on the Replace tab

Enter ^ into the Find what box

Enter ” into the Replace with

Make sure Regular expression is selected under search mode

Click Replace All

Page 7: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

7a. Prepare the Master Seed list for Scrapebox

Now we‟re going to add quotes to the end of the link.

Replace the ^ in the „Find what‟ with $

Click Replace all

Save the file as Seed List <domainname>. So in this example the file name would be: „Seed List creative

spark.txt‟

7b. Set the Custom Footprint

NOTE:- This step is only necessary if you want to scrape for older domains. If you decide to leave this

datarange step out you can still find good powerful expired domains, they just light not be as old.

TIP: you can use inurl:links here instead of just “links” which will save you work in Step 14. However,

you will need to reduce the amount of threads in the Harvester to around 1 to 3 else you will get a lot of

errors.

Ok, so first open up Scrapebox.

Page 8: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Now in this particular strategy, we‟re going to look for old websites that have links or resource pages. To

do this first we‟re going to scrape URLs have the word “links” in them. A lot of sites (especially older

ones) used to have these types of pages and they‟re very useful for finding aged expired domains.

So to find these aged domains we‟re going to add a date range to our search query.

In this example, we are going to use 2000 to 2010.

For the Google search engine, the date range has to be added using Julian Date Format. To help you work

out Julian date for your required range, use this Julian date conversion tool.

Once you have your date range sorted, enter the “links” daterange into the Custom footprint box and

make sure „Custom Footprint‟ option is selected. So in this example it would be:

“links” daterange:2451552-2455198

8. Paste keywords into Scrapebox

Copy the first 100 lines from the Seed List text file and paste them into the Keywords window.

NOTE:- you don‟t have to do 100 lines, you could add as many as you like, I just find the scraping process

starts to slow down a lot when you get past around 50 due to the custom footprint and only using 20

proxies.

Make sure Use Proxies is selected.

Private Proxies are highly recommended to get the best results with less hassle.

Page 9: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

TIP: test using IP authentication with your private proxies. I‟ve found out some of them don‟t work as

well if user/password authentication is used.

Click Start Harvesting

9. Start Harvesting

Make sure just Google is selected in the available search engines

Click Start

Now sit back and let Scrapebox do its stuff.

10. Harvest Complete

Once the harvest is complete, you will see a window like the one above.

Page 10: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Click Exit to Main

11. Remove duplicates

Click on Remove/Filter and select Remove Duplicate URLs

12. Copy URLs to the clipboard

Right click inside the URL window and Select Copy All URLs to Clipboard

Page 11: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

13. Paste them into the spreadsheet

From this section on I use a spreadsheet to organise and sort through all the URLs which you can

download below…

Open up the spreadsheet

Make sure the MASTER TAB is selected and paste the URLs into the URL column

14. Filter URLs that contain the word links

Page 12: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

In the next few stages we are going to tidy up this list before we scrape the „link/resource‟ pages for

outbound links.

First we need to create and apply a filter where the URL contains the word – links

15. Paste results into „contains links‟ tab

Next, paste the URLs into the contains links TAB URL column

16. Remove „tag‟ URLs

Page 13: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Remove any URLs that have the word tag by clicking on the filter button and create a filter where the

URL contains the word – tag.

Important: Check the results to make sure they are /tag/ URLs and it doesn‟t just have the word tag in the

domain. Remove these rows by selecting them and hitting the Delete key

Clear the filter and then sort the Column A-Z using the filter button again. This will make the next step

quicker.

17. Remove any URLs that are not link pages

Remove any URLs that are not link or resource pages.

i.e. link pages will look like /links/ OR /useful-links/ OR /resource-links etc etc

In the example above you can see this page is about adding links to a sidebar and not a link or resources

page, so this would be removed.

Delete all these kinds of non „link‟ pages by right clicking on the row number and selecting delete

Note:- This is an optional step, I like to do it so that I just get the link/resource pages

If you‟re unsure, click the URL and check out the page in a browser to see if its a „links‟ page or not.

Also remove URLs for pdf‟s, facebook.com, linkedin.com, econsultancy.com which can be done easily

using a filter.

Page 14: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

18. Paste into Scrapebox

Once you have cleaned the URL list copy the whole URL column by holding down SHIFT & CTRL and

press the down arrow key. This will select all rows in that column with data in

Paste these into a text file and then copy and paste them into Scrapebox. I find doing this this extra step

into a text file saves issues with the next step

Remove duplicate URLs

19. Link Extractor

Open the Link Extractor by going to Addons and selecting Scrapebox Link Extractor 64bit

Page 15: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Click the Load button and select Load URL list from Scrapebox Harvester

Make sure the External option is selected

Click Start

Once its finished click Show save folder

Open up the output text file in a text editor

Copy all the URLs from the link extract file and “Paste & replace” the them into SB harvester

20. Trim URLs

Click Trim and then Trim to Root

Also under the Trim menu click Remove Subdomain from URLs

Remove Duplicate domains

Page 16: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

21. Remove all non relevant domains

This section will depend on which type of domains you are looking for. So for example, if you want web

2.0s then you would leave these in the list. If you want .in domain then you would leave these in the

list…and so on and so on.

I had a piece of software coded that removes all non-relevant domains in a flash. It can be done manually

but it just takes a lot more time. If you want a copy of the software hit me up.

Ok, now Right click on the URL Harvester window and Copy All URLs to Clipboard

Open up the spreadsheet and select the „Cleaned‟ TAB

Paste the URLs into the URL column

It‟s important to note, when removing multiple lines within a filter select them and use the delete key DO

NOT remove via the method used above in step #17

TIP: after each removal below, clear filter from URL and sort the column A-Z again

Use a filter to remove any domains that contain: javascript, any pharma type keywords, gambling,

blogspot, http://blog. directory.

Here‟s examples of some of the other domains I remove from the list, domains that end in; .weebly.com

.wordpress.com hop.clickbank.net .tumblr.com .webgarden.com .livejournal.com .webs.com .edu

Page 17: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

.yolasite.com .moonfruit.com .bravesites.com .webnode.com .web-gratis.net .tripod.com typepad.com

blogs.com rinoweb.com jigsy.com google.com squarespace.com hubspot.com .forrester.com

NOTE: the sub domains above can be stored and checked separately to create web 2.0 lists if you like.

Also check through the list and remove any that are:

– the wrong syntax.

– common domains that you know are not going to be free ie. facebook.com, linkedin.com

searchengineland.com etc etc

– just IP addresses

22a. Check domain availability

Scrapebox has its own domain availability checker, which has come on massively since I first published

this post last year. I‟ve been able to check tens of thousands of domains with it in one batch so this is all I

use now. .

(If you don‟t trust Scrapebox checker then you can use other bulk checkers like Dynadot‟s which allows

you to check 1000 domains at a time. It will only do about 5 batches though before it hits you with an over

usage message and makes you wait about 30 minutes.)

Copy all the URLs from the „Cleaned‟ TAB into a text file and then copy/paste into Scrapebox

Click Grab / Check and select Check Unregistered Domains

Page 18: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

22b. Check domain availability

Once you‟re happy, click Start

22c. Check domain availability

Page 19: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

TIP: Sometimes SB will give result in 0/0 for Pass 2 (WHOIS) If you‟ve checked a lot and this is the case,

close the availability checker, re-open and and try it again.

When its finished, click Export and then select Export Available Domains. I also save the unavailable

domains to so these can be double checked.

Open up the excel Worksheet Template and select the Availability Check TAB

Right click in the top left hand cell and select Paste Special. Then choose the paste as text option.

Tidy the first row up by using a simple cut/paste into the next cell along. Delete the first column

23a. Check Metrics in Majestic

For this section you will need a paid version of Majestic.

If you don‟t have this you can try the free version of SEO Profiler which will at least allow you to check

out the quality of the backlinks.

Page 20: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Note: you can use the copy and paste or just create a file with all the Available domains in and upload. For

this example, we will be using the copy/paste 150 method.

Login to Majestic and go to Tools | Link Map Tools | Bulk Backlinks

Paste the first 150 rows into the window

Sort results by: referring domains ( you don‟t have to do this as the data can be sorted on the next window)

23b. Check Metrics in Majestic

Now everyone has their own thoughts on how to check out the strength of a domain. This could be a whole

blog post on its own, so for the time being I‟ll just cover the main points.

So the boundaries you set for lowest Trust Flow, Citation Flow etc is a personal preference I think. If

you‟re after a guideline then I normally look for domains that have >10 referring domains, TF 15+ and

where TF/CF is >.75

So if you look at the image above you‟ll see there are a few that would warrant further investigation. I

haven‟t registered these so check them out and if they‟re good and you‟re quick you could pick them

up

Make sure you check the TF for the root domain sub domain and full path on each domain that looks good.

You can do this quickly by hovering your mouse over the little cog icon, Right Click on Site Explorer and

open it up in a new browser tab

Page 21: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

Make sure you check out the backlinks. Do they still exist? Are they spammy? Are

they contextual?

Quite often low TF domains can still have really good contextual backlinks so Trust Flow should NOT be

your ultimate guide. Make sure you ALWAYS check out the quality of the backlinks.

A lot of people won‟t do this because of the time it takes, don‟t be one of them else you can end up with

bad domains.

23c. Check Metrics in Majestic

Dig down a little deeper into the backlink profile for each domain. Scrolling down on the Site Explorer

will show you a quick overview of the Anchor text so you can make sure its not over optimised.

Also click the Backlinks tab and physically look at some of their backlinks to make sure they‟re not pure

spam.

Just to be double sure, backlinks can also be checked in ahrefs.

Page 22: How to Find High Authority Expired Domains Using Scrapebox · How to Find High Authority Expired Domains Using Scrapebox High authority expired domains are still a powerful tool to

25. Check Web Archive

Once you have found a domain that has good metrics and authority head on over

tohttp://web.archive.org/ and check what the site looked like in the past.

You‟re looking for a site that was a legitimate business. Personally, I stay away from anything that has

been chinese or has sold dodgy fake gear.

Once you‟ve found a domain with solid metrics head on over to your favourite registrar and get it

registered.

That‟s it for this first method which is just one way that I use Scrapebox to find high authority, expired

domains.