Combatting Crawl Bloat & Pruning Your Content Effectively

39
Charlie Whitworth SEO Director @WhitworthSEO http://www.slideshare.net/ whitworthseo Banc

Transcript of Combatting Crawl Bloat & Pruning Your Content Effectively

Page 1: Combatting Crawl Bloat & Pruning Your Content Effectively

Charlie WhitworthSEO Director

@WhitworthSEO http://www.slideshare.net/whitworthseo

Banc

Page 2: Combatting Crawl Bloat & Pruning Your Content Effectively

what will we cover?

bancmedia.-com

Crawl Bloat & Pruning

@Whitworth-SEO

Page 3: Combatting Crawl Bloat & Pruning Your Content Effectively

beat the bloat

Crawl Bloat

Making search engines work too hard

and not taking full advantage of your

crawl budget@Whitworth-

SEObancmedia.-

com

Page 4: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl definitions

=Crawl Budget

bancmedia.-com

@Whitworth-SEO

Crawl Rate & Crawl Demand

Page 5: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl rate (limit) : maximum fetching rate

Search Console Limits

bancmedia.-com@WhitworthSEO

Website/Server Health

Page 6: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl demand

Popularity (Traffic) Staleness

@Whitworth-SEO

bancmedia.-com

How much does Google want to see your URLs?

Page 7: Combatting Crawl Bloat & Pruning Your Content Effectively

Crawl budget – determined by rate & de-mand

What Google can & wants to crawl

@WhitworthSEO bancmedia.-com

• Empty pages• No valued added• Dupe

• Soft 404• SPAM• Hacked content

Page 8: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl budget

“Prioritizing what to crawl, when, and how much resource the server hosting the site can allocate to crawling is more important for bigger sites, or those that auto-

generate pages based on URL parameters”

Gary Ilyes 

bancmedia.-com

@Whitworth-SEO

Page 9: Combatting Crawl Bloat & Pruning Your Content Effectively

how to identify crawl bloat

bancmedia.-com

@Whitworth-SEO

Page 10: Combatting Crawl Bloat & Pruning Your Content Effectively

how to identify crawl bloat

bancmedia.-com

@Whitworth-SEO

Page 11: Combatting Crawl Bloat & Pruning Your Content Effectively

why is this a problem?

bancmedia.-com@WhitworthSEO

Don’t work them too hard

Search engine bots hate crawling rubbish

Page 12: Combatting Crawl Bloat & Pruning Your Content Effectively

why is this a problem?

Engagement metrics are key

especially now we’re “mobile first”

@Whitworth-SEO

bancmedia.-com

Page 13: Combatting Crawl Bloat & Pruning Your Content Effectively

examples of rubbish: search result URLs

/search/bancmedia.-

com@WhitworthSEO

/sessionkey=

/search-results?q=

Page 14: Combatting Crawl Bloat & Pruning Your Content Effectively

examples of rubbish: paginated URLs

bancmedia.-com

rel=next/prevnoindex

@Whitworth-SEO

Page 15: Combatting Crawl Bloat & Pruning Your Content Effectively

examples of rubbish: faceted navigations

json-ld

disallow parametersbancmedia.-

com@Whitworth-

SEO

noindex

Page 16: Combatting Crawl Bloat & Pruning Your Content Effectively

who’s seen this?

Page 17: Combatting Crawl Bloat & Pruning Your Content Effectively

cut the crap

Get Rid Of It

@Whitworth-SEO

bancmedia.-com

Page 18: Combatting Crawl Bloat & Pruning Your Content Effectively

plan

Page 19: Combatting Crawl Bloat & Pruning Your Content Effectively

noindex

<meta name=“robots" content=“noindex">Won’t deal with crawl bloat in the short term

Still an effective tacticbancmedia.-

com@WhitworthSEO

Page 20: Combatting Crawl Bloat & Pruning Your Content Effectively

url parameters

@WhitworthSEO

Powerful tool when used properly

No need for developer sup-port

bancmedia.-com

Page 21: Combatting Crawl Bloat & Pruning Your Content Effectively

xml sitemaps

Redirected URLs

Noindexed URLsPoor quality pages

@WhitworthSEO bancmedia.-com

Page 22: Combatting Crawl Bloat & Pruning Your Content Effectively

robots.txt

bancmedia.-com@WhitworthSEO

Exercise Caution!

Powerful crawl bloat tool when used effect-ively

Page 23: Combatting Crawl Bloat & Pruning Your Content Effectively

robots.txt

Blocking Crawl Paths – Stems Authority Flow

bancmedia.-com@WhitworthSEO

Google needs to see tags to honour them!

Page 24: Combatting Crawl Bloat & Pruning Your Content Effectively

google’s hint

Google gives you all you need to monitor your results

All these features are your friends

@WhitworthSEO bancmedia.-com

Page 25: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl stats

Great way to gauge your efforts

@Whitworth-SEO

bancmedia.-com

Page 26: Combatting Crawl Bloat & Pruning Your Content Effectively

bancmedia.-com@WhitworthSEO

Pruning

the prune

Page 27: Combatting Crawl Bloat & Pruning Your Content Effectively

pruning – the fun bit

SEO Housekeep-ing

Now you have your crawl under control, show off your best bits

bancmedia.-com@WhitworthSEO

Page 28: Combatting Crawl Bloat & Pruning Your Content Effectively

pruning – why?

• Spinning

• Excessive blog-ging

• Sparse articles@WhitworthSEO bancmedia.-

com

• Over optimisa-tion

• Saturation

Page 29: Combatting Crawl Bloat & Pruning Your Content Effectively

stage one – content audit/content is king

Content auditing still essential nonethe-less

bancmedia.-com@WhitworthSEO

SEO’s most annoying cliché

Page 30: Combatting Crawl Bloat & Pruning Your Content Effectively

stage two: the prune

410 – Gone

@WhitworthSEO bancmedia.-com

For pages you want gone forever

Page 31: Combatting Crawl Bloat & Pruning Your Content Effectively

stage two: the prune

bancmedia.-com@WhitworthSEO

Won’t help crawl bloat short term

Will disappear eventu-ally*

Page 32: Combatting Crawl Bloat & Pruning Your Content Effectively

stage two: the prune

Crawl Bloat

Authority

Indexa-tion

@WhitworthSEO bancmedia.-com

Page 33: Combatting Crawl Bloat & Pruning Your Content Effectively

stage two: the prune

NoIndex

Will be crawled less over time ensures a high qual-ity SERP

For duplicate content, cannibalisation and API style pages

@WhitworthSEO bancmedia.-com

Page 34: Combatting Crawl Bloat & Pruning Your Content Effectively

stage two: canonical

For duplicate content, cannibalisation and API style pages

@WhitworthSEO bancmedia.-com

canonical

Page 35: Combatting Crawl Bloat & Pruning Your Content Effectively

stage three: content repurposing/enhance-ment

@WhitworthSEO bancmedia.-com

EvergreenSeasonalLong formResearch

Page 36: Combatting Crawl Bloat & Pruning Your Content Effectively

search quality guidelines

@WhitworthSEO bancmedia.-com

Search Quality Evaluator Guidelines

Page 37: Combatting Crawl Bloat & Pruning Your Content Effectively

stage four: the finishing touch

Link Profile Ana-lysis

Links pointing at ex-cluded pagesUsual disavow re-

quest

@WhitworthSEO bancmedia.-com

Don’t ruin all your hard work with a crap link profile

Page 38: Combatting Crawl Bloat & Pruning Your Content Effectively

crawl bloat and pruning checklist

Does your site have crawl bloat?Choose the appropriate fixes for your sitePrune your remaining content

Continually improve and enhance@WhitworthSEO bancmedia.-

com

Page 39: Combatting Crawl Bloat & Pruning Your Content Effectively

final tips: thanks for coming

@WhitworthSEO bancmedia.-com

Thanks for Listen-ing!

2 Tips