Webmaster...

176
Webmaster Conference Product Summit November 4, 2019 | Mountain View, CA

Transcript of Webmaster...

Page 1: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Webmaster ConferenceProduct SummitNovember 4, 2019 | Mountain View, CA

Page 2: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Welcome

Page 3: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Lightning talks

Structured Data

Search Console

Web Deduplication

Google Images

Rendering

Titles & Snippets/Result Previews

Googlebot & Web Hosting

Claiming Your Knowledge Panel

Page 4: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Session

Improving Search Over the Years

Paul HaahrDistinguished Engineer, Search

Page 5: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Why this event?

Page 6: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

We do get lots of feedback...

Page 7: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

We do get lots of feedback...

Webmaster Office Hours

Page 8: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

And we really do think about webmasters & content owners constantly...

Page 9: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

We're interested in organizing a ‘meet the ecosystem’ initiative for the core search PM team, and Emily thought you all would be good thought partners to kick this off.

“”

An example of that, the origin of this event:

Page 10: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

We know there can be concerns, but our future is with you.

Page 11: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But both our futures depend on search continuing to evolve…

Page 12: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

The world of search has evolved from this…

Page 13: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

To this.

Page 14: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Traffic could be one measure of ecosystem health as search evolves.

Google sends over 24 billion visits per month to news publisher websites.

24billion

visits/month

Page 15: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But measuring in pure traffic can be imperfect.

Page 16: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Can you get medicine for someone pharmacy

Before Bert After Bert, better page listed

Page 17: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Traffic lost — but was it useful traffic

in the first place?

Page 18: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

What are ways Google should consider assessing

how it supports the open web & ecosystem?

Feedback on…

Page 19: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

What are ways Google can improve support overall?

Feedback on…

Page 20: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Anything that’s on your minds!

Feedback on…

Page 21: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Lightning talks

Page 22: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Allen HarveyProduct Manager, Search

Structured Data

Page 23: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Evolution of Search

Page 24: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

For example, recipe queries

Powered by structured data!

Page 25: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Example HowTo structured data

Page 26: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Structured data helps Google understand your page and enable special features.

Page 27: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Makes your pages eligible for Rich Results

Q&A FAQ

Page 28: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Eligible to appear in carousels on pages

Recipes

Page 29: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Eligible to appear in search experiences

Events

Page 30: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

In summary, makes your pages eligible to...

Appear as Rich Results

Show in page-level features

Be visible in Search experiences

Work on the Assistant

1 2 3 4

Page 31: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do I get started?

Page 32: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Find the structured data feature most suitable for your site in our Search gallery

goo.gle/searchgallery

Page 33: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

View the example markup and add your own content

Page 34: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Test and Preview your Structured Data in Rich Results Test

goo.gle/richresults

Page 35: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Measure performance in Search Console

g.co/searchconsole

Page 36: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Recap

Structured data helps Google understand your page and makes it eligible to appear in special features.

All structured data features are available at goo.gle/searchgallery, please take a look and find the one right for your site.

Rich Results Test and Search Console can help you get your markup right and monitor performance for your pages.

Page 37: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 38: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Search Console

Hillel MaozSoftware Engineer

Ofir RovalProduct Manager

Page 39: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Search Console’s mission:

Provide data & tools to help site owners improve their websites and optimize their appearance on Google

Page 40: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do we ‘improve’ websites

Step 1: Define what can help a page succeed on Google

HTML

Page 41: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do we ‘improve’ websites

Step 1: Define what can help a page succeed on Google

Step 2: Classify all the pages in the Google index annotate any issues

HTML

Page 42: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do we ‘improve’ websites

Step 1: Define what can help a page succeed on Google

Step 2: Classify all the pages in the Google index annotate any issues

Step 3: Help site owners fix their site

HTML

Page 43: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How Search Console Works?

Google Index

Website owners

WWW

Page 44: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How Search Console Works?

Extract all relevant information from Google Search, report it to webmasters and help them use it to improve their site.

Page 45: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Search Console Components

Search Analytics

WWW

Accounts

Alerts

Reporting

Testing Tools

Page 46: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How does it work?

● Track all impressions on Search

● Many billions of Search results daily

● 16 months of data

● Support slice and dice to help you find insights and trends

Search Analytics

Page 47: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How does it work?

● Track all pages crawled by Google

● Many millions of domains

● 100s of signals supporting dozens of Search features

● Actionable reports to help you fix any issues

Reporting

Page 48: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How does it work?

● Trigger the entire Google index stack for a given URL

● On demand for any URL

● High fidelity to Search

● Support full debugging flow

Testing Tools

Page 49: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How does it work?

● Manage all Search Console sites and owners

● Millions of sites / domains

● Protect website data via verificationAccounts

WWW

Page 50: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How does it work?

● Understand over time site changes

● Send emails to all Search Console websites

● 10s of alert types

● Make sure you never miss any important change with your site

Alerts

WWW

Page 51: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Launching today:

Speed Report on Search Console

Page 52: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How we created the Speed report

Step 1: Identified the metrics that characterize fast pages

Step 2: Used real-world data and apply it over the Google index

Step 3: Created a speed report to help focus on issues

HTML

Page 53: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Search Console Speed report

Get an overview on your site performance: slow, moderate, fast

Data is based on real user experience as represented by Chrome User Experience Report (CrUX)

Page 54: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Identify Speed Issues

See examples of pages with speed related issues

Use Page Speed Insights to get optimization suggestions

Validate speed fixes resulted in improved user experience

Page 55: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Get positive examples

Identify what type of pages are performing well

Apply similar implementation to other sections of your site

Page 56: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 57: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Allan ScottSoftware Engineer

Web Deduplication

Page 58: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

What is it?

Deduplication

Raw

Deduplicated

1. Identify and cluster duplicate web pages.

2. Pick representative URLs, index unique pages.

3. Forward signals to representative URLs.

Page 59: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Why deduplicate?

Deduplication

Users don’t want the “same” page repeated.

More room for distinct content in the index.

You retain signals when redesigning your site.

We can also find alternate names.

1 2 3 4

Page 60: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Deduplication

Overview

What we use to cluster Localization

Selecting representative

URLs

Page 61: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Signals we use

Clustering

1. Redirects.

2. Content.

3. Rel=Canonical.

4. Etc...

Page 62: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Clustering

Redirects

We largely trust redirects, as they are almost perfectly predictive of duplicate content.

This is part of why we recommend using them when you move/redesign your sites.

Page 63: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Clustering

Content

We also use content checksums.We make efforts to ignore boilerplate.Catches many soft error pages.This is why we prefer getting an HTTP error.

Page 64: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Clustering

Rel=Canonical

We use your annotations to cluster.These tend to get more verification.Thresholds are still intentionally loose.Had more than a few “broken script” clusters.

Page 65: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

tl;dr version

Localization

1. When main content is the same, pages cluster.

2. This can include boilerplate-only localization.

3. Clever geo-redirecting often also clusters.

4. We use hreflang alternates to bridge this gap.

Page 66: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Representative URLs

Canonicalization

1. To deduplicate we must pick a representative.

2. Candidates compete on weighted criterion.

3. Weights are trained by ML.

Page 67: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Signals

Canonicalization

1. Hijacking is our overriding concern here.• Escalations via WTA forums are valuable.

2. Second concern is “user experience”: Security, slow meta refresh, etc.

3. Third, webmaster signals:Redirects, rel=canonical, sitemaps.

Page 68: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Some Suggestions

1. Use redirects to clue us into your site redesign.2. Send us meaningful HTTP result codes.3. Check your rel=canonical links.4. Use hreflang links to help us localize.5. Keep reporting hijacking cases to the forums.6. Secure dependencies for secure pages.7. Keep canonical signals unambiguous.

Page 69: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 70: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Francois SpiesProduct Manager: Google Images

Google Images

Page 71: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Product overview

Product changes

SEO best practices

01

02

03

Page 72: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over
Page 73: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over
Page 74: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over
Page 75: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Recent Product Changes

Page 76: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Product Updates

Images on Desktop

Viewer in side panel

More prominent publisher context alongside image

Align viewer format across surfaces

Page 77: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Aesthetic changes to improve legibility

More prominent button to visit page

Product Updates

Images viewer

Page 78: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Reduce friction for users to visit AMP web pages

Product Updates

Swipe to Visit

Page 79: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Optimization Best Practices

Page 80: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Use structured data for rich results (especially for products, videos, recipes)

Page 81: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Use descriptive titles, captions, and filenames

Page 82: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Use high-quality & optimized images, placed next to text, on mobile-friendly pages

Page 83: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

In closing, a reminder that you are able to check traffic from Google Images within Google Search Console

Page 84: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 85: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Rendering

Erik HendriksSoftware Engineer

Page 86: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Rendering allows Googlebot to see the same web as users do.

Page 87: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Rendering

Before After

Page 88: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Googlebot renders with Chrome

Page 89: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Challenges

Page 90: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Challenges

Fetching

JavaScript

At web scale

Page 91: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Fetching

Page 92: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Fetching

Limited access (robots.txt)

Limited crawl volume1 2Differences

Page 93: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Fetching

Resource fetches per page(obeying robots.txt)

50-60 60-70%Cache Hit Rate

Page 94: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Fetching

20x

Page 95: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Fetching

We do not obey HTTP caching rules

● Don't rely on clever caching

Corners will be cut

We might not fetch everything

● Minimize fetches● Be resilient

1 2

Page 96: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

JavaScript

Page 97: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

JavaScript

Performance matters!

We limit CPU consumption

We will interrupt script

Excessive CPU consumption negatively impacts indexing

Page 98: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Popular ways to fail

Error loops (robots.txt, missing features)

Cloaking

Cryptocurrency miners

JavaScript

Page 99: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 100: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Titles & Snippets/ Result Previews

Phiroze ParakhSoftware Engineer

Page 101: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Result Preview Pillars

Relevance to user need

Help users choose between results

Illuminate depth of content within results

Express diversity of content in the ecosystem

Page 102: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Web Result Architecture

AttributionTitle

Content Preview

Page 103: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Relevance in Snippets - an Example

[queries in dbms]

[database queries]

Page 104: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Images in Previews

Images are relevant to the query

Placement on right● Images are secondary to title/snippet

Galleries support contentful pages

Users visit a greater diversity of sites

Page 105: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Video in Previews

Video is relevant to the query

Preview indicates if video is dominant or supportive

Video metadata informs user experience

Page 106: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Sitelinks in Previews

Links are relevant to query

Links extracted algorithmically● Menus, Site-structure ● Drives traffic into a diverse set of sites

Sitelink-images help users● Pithy links are better understood

Page 107: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Entity Facts in Previews

Relevance to needs around the entity

Facts extracted algorithmically● Tables, Lists

Page 108: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Forums as Previews

Posts extracted algorithmically

Forum cluster is relevant to query

Users find value in forum sites

Markup (QnA) takes priority

Page 109: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Tables & Lists as Previews

Pages with dominant Tables/Lists

Helps users contrast content

Structure and position on the page guides the preview

Page 110: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Summary

Relevance infuses everything

Attribution helps user choose the source

Express the depth of content● Sitelinks drive traffic into the site● Forum previews show related content

Preview diversity supports the ecosystem● Imagery, Video, Tables, Lists, Facts

Page 111: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 112: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Googlebot & Web Hosting

Jin LiangSoftware Engineer

Page 113: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Googlebot and Web Hosting

HTTP Request

HTTP Response

How has HTTP serving evolved

Robots.txt explained

Google crawl rate

01

02

03

Page 114: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

HTTP Server Popularity Changes

HTTP Server Popularity Seen By Googlebot

Page 115: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Traffic ShiftingTo HTTPS

HTTP vs HTTPS Traffic

Page 116: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

HTTP Fetching Getting Faster

Average Download Time (ms)

Page 117: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Robots.txt Explained

User-Agent: GooglebotAllow: /useful-contentDisallow: /do-not-crawl...

Common way for webmasters to specify access to their websitesNot a standard, and no effort to make it so...… until we proposed to IETF in July!

Page 118: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Robots.txt fetching issues

For every URL we crawl, we must first check the robots.txt.

Robots.txt fetch can fail.

If we get 200, 404 → Good

Unreachable → Bad

If we get 5xx → OK if it’s transient

Robots.txt itself needs to be fetched.

Page 119: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Robots.txt fetch status

200404 5xx Unreachable

Page 120: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

200404 5xx Unreachable

Robots.txt usage recommendation

Use robots to control access

Return 404 for open access

Create a robots.txt, or respond with 404

Do not always return 5xx

Page 121: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Crawl Rate Limiting

Googlebot has sophisticated way to determine how fast to crawl a site

Webmaster can set a custom crawl rate in search console if needed

Page 122: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Custom Crawl Rate Recommendations

Custom crawl rate does not trigger more crawl.

It may accidentally reduce our crawl rate.

Our recommendation: “Leave it to Google” unless your site is being overloaded

Page 123: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 124: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Claiming your knowledge panel

Lauren ClarkProduct Manager

Alex HsuProduct Manager

Page 125: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Billions of searches take place on Google every year, and many of those are about people and organizations.

When they find you, how do you want to introduce yourself?

Page 126: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over
Page 127: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Claim your knowledge panel

Page 128: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Find your person or business and look for a prompt to claim

Page 129: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Update your info

Page 130: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Businesses that complete their informationare twice as likely to be considered reputable by consumers.

Page 131: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Confidential + Proprietary

Suggest updates to inaccurate or outdated information in your knowledge panel

Page 132: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Confidential + Proprietary

Add any missing information - like your hours, customer service number, and street address.

Page 133: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Confidential + Proprietary

Make sure your website and other web profiles are up to date

Page 134: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Engage searchers

Page 135: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Confidential + Proprietary

Create content to engage and inform searchers on Google

Page 136: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Get Started

Page 137: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

People and Brands Businesses

Page 138: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Thank you

Page 139: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Improving Search Over the Years

Paul HaahrDistinguished Engineer, Search

Page 140: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Ranking Case Studies

Page 141: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do Google Search engineers think about ranking problems?

Page 142: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

I could try to explain a step-by-step methodology...

Page 143: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But there isn’t one.

Page 144: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

There are many.

Page 145: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

And they involve a lot of debugging, experimentation,

evaluation, guesswork, research, and (often) luck.

Page 146: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Instead, here are some examples...

Page 147: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Synonyms and Siblings

Page 148: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Google’s Synonyms System

• User vocabulary ≠ Document vocabulary

• System tries to bridge the gap by automatically adding alternative words

• Similar to using OR, but usually less important than original terms

• One of Google Search’s most important ranking components

Page 149: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

An example...

[cycling tours in italy]

[cycling OR cycle OR bicycle OR bike OR biking

tours OR tour OR holidays OR vacation

in

italy OR italian]

Page 150: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Contextual: Synonyms depend on other query words

[gm truck] ⇒ “general motors”

[gm barley] ⇒ “genetically modified”

[baseball gm salary] ⇒ “general manager”

Page 151: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Not the same as English Synonyms

• Designed to find good search results

• Hidden behind the scenes (mostly)

• Unimportant whether they’re actually synonyms to a human reader

Page 152: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But...For a short time in 2005, Google’s top result for [united airlines] was continental.com

(The two companies did merge in 2010, but it wasn’t our fault.)

(We hope.)

Page 153: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Why? Synonyms

(And a couple of unrelated bugs that I’m not going to talk about.)

[united airlines]

[united OR continental

airlines OR air OR airline]

Page 154: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

How do we fix things?

• We want algorithmic solutions

• Don’t just manually block the problems

• Look for patterns of failures

Page 155: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Synonyms sometimes finds siblings

• We can learn pairs of words that serve similar roles but aren’t interchangeable

• Consider pairs of searches:[united reservations][continental reservations]

[united newark airport][continental newark airport]...

• “Siblings” (often rival siblings!)

Page 156: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Can we distinguish siblings from useful synonyms?

• Again, look to searches people do

• People compare siblings to each other:[united vs continental][canon vs nikon][beatles vs stones][godzilla vs king kong]...

• Look for [X vs Y] queries from logs, use as a negative signal for X⇒Y and Y⇒X synonyms

Page 157: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Then comes the hard part

• Process logs, build data, run experiments, evaluate the results, tune, repeat...

• Eventually, find many other synonym failures:cat⇒dogpart time⇒full time

• But we also lost some good synonyms:sign in⇒sign onaddress⇒contact

Page 158: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Lessons

Understanding patterns of failures can reveal solutions

By not patching over algorithmic problems manually, we get more general solutions

Every change has wins and losses

Page 159: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Non-Compositional Compounds

Page 160: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Information Retrieval

Information Retrieval is mostly about matching and counting words

• Including title vs body, links, frequency, etc

This is the basic underpinning of Search.

Page 161: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Relevance comes from matching words

Consider these pages, which are good matches for [new york hotels]

• Title, body, links, etc

Page 162: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But sometimes it’s a bit too simplistic

Are they good matches for [york hotels]?

Page 163: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Compounds

A compositional compound is “a phrase of two or more words where the words composing the phrase have the same meanings in the compound as their conventional meanings”.

A non-compositional compound is one where the meanings differ.

Page 164: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

New York

“New York” is non-compositional. Even though it is formed by compounding “New” and “York,” there’s nothing York-related now.

Not all place names follow the same rule

• “York” is not “New York”• “Vegas” is “Las Vegas”

Page 165: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Can we identify non-compositional compounds?

Algorithm:

• Start with a set of “X Y” phrases• Look at pages where “X Y” occurs• If “X” or “Y” only appears in “X Y” on

most of those pages, guess that it’s non-compositional

For “new york”:

• “new” appears alone on many• “york” appears alone on very few

Page 166: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Matching NCCs

Now that we have non-compositional compounds, what do we do?

Specialized matching code:

[york hotels]⇒

[(ignore_left:new york) hotels]

• Meaning: “Don’t match ‘york’ if the word to the left is ‘new’.”

Page 167: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Lessons

Edge case: would be very hard to predict in advance, but obvious to the first person who tries this query

Once seen, it’s obvious there is a general pattern here

[fantasy game] is not “final fantasy”

[view office] is not “mountain view”

...

Hard work is done offline, ahead of time

Small change in matching code

Page 168: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

😀🐳😱🙏🏻

Page 169: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Language Evolves Over Time

If you received “🤣” in a text in 1996, when Google launched at Stanford, would you have known it meant “rolling on the floor, laughing”?

Probably not. The first emoji appeared in 1997.

Page 170: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Why Search for Emoji?

• People use emoji all the time

• But, often, they’re not sure what they mean exactly

• So, they search for them

Page 171: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Emoji in Search

Unfortunately, for a long time, Search ignored emoji and other “special characters”

• “Nobody searches for them”• Expensive to index if they’re not used

What happened?

[😀] didn’t find anything[smiley face 😀] sort-of worked[😀 meaning] found dictionaries

Page 172: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Index/Query Alignment

Changes to what Search indexes are complicated, because they need to go in the right order:

• First, update indexing to allow emoji• Wait for documents to be reindexed• Then, change query parsing

But first:

• Prove the cost is worth it!• Even though they didn’t work, people were

using emoji in >1 million searches per day

Page 173: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

But...

After data was in the index and query parsing was fixed, we ran evaluations, with very negative results.

Lots of other systems and models that needed to be updated before launch:

• Link processing• Spelling• Autocomplete• ...

Page 174: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

While we were at it...

Also added math and other symbols

• [∞]• [∑x]• [P≠NP]

And someone else did (some) punctuation

• [+=]• [== vs ===]• [P!= NP]

Page 175: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

Lessons

Things which look easy from the outside can be a lot of work to implement

All the assumptions you bake into your code can change over 20 years

Page 176: Webmaster Conferenceservices.google.com/fh/files/events/wmconf_product_summit_slides_publish.pdfGooglebot & Web Hosting Claiming Your Knowledge Panel. Session Improving Search Over

The Search Emoji Team