Search Ads Evaluation General Guidelines

Search Ads Evaluation: General Guidelines
Version: 20150820
READ INSTRUCTIONS CAREFULLY!
These ratings use newly updated ads rating guidelines that are different from earlier versions of the
guidelines. If you have not read the guidelines since the version date listed above, you must carefully read or
review all instructions before rating.

user query
Search ad rating involves interpreting a . A user query is the set of keywords that a
user enters into the Google search engine. When rating a search ad, perform the following
steps:
1. Review the Google search results page, try to understand the user query, and form an
opinion about what the user hopes to accomplish by using a search engine.
2. Use the evaluation criteria found in the following instructions to analyze an
advertisement and the advertising experience the user will have if he or she clicks on the
ad.

User Intent
user intent
An understanding of the is necessary to accurately rate a search ad. The user intent
is what the user hopes to accomplish by using the Google search engine. Note that users use
the search engine to look for a variety things, and there are many user intents.

Some queries are very easy to understand, others are more difficult, and some may seem
impossible to understand. Regardless of its meaning, you must research the query and form an
opinion about the user intent. We strongly advise you to review the Google search results page
to determine user intent. In order to objectively determine how promising or unpromising an
advertiser offering is for a particular user query, it is important to form an opinion about the user
intent before beginning an analysis of the advertisement.

Queries with Multiple Meanings

If a query has multiple meanings, please consider that all meanings can be placed on a
spectrum between plausible meanings and highly implausible meanings. When analyzing an
advertiser offering, consider what meaning the advertiser uses and where it falls along this
spectrum. This will help you determine the appropriate search ad rating.
Plausible Meanings
If a query has several plausible meanings, it is important to consider them all. If an advertiser
assumes a particular meaning in an ad or on a landing page, and it is a reasonable meaning,
assume that is the meaning that the user intended.

Refer to the following example to better understand plausible meanings:

User query: [ java ]

This query could refer to an island, coffee, or a computer language. With no
additional information available, it is impossible to say which meaning the user
intended. Ads that respond to any of these meanings are acceptable since all
three meanings are reasonably plausible.

Possible but Unlikely Meanings

If an ad or landing page assumes a meaning that is possible but not very likely, this a
secondary interpretation
. An ad or landing page that addresses only a secondary
interpretation of the query is given a lower rating than an ad that addresses a plausible
Secondary Interpretation of Query
meaning. Use the flag in this case. Generally, rate an ad
or landing page that responds to a secondary interpretation negatively.

Refer to the following example to better understand possible but unlikely meanings:

User query: [ paris ]

While there are a number of cities called Paris, unless there is some reason to
think that one of the smaller cities is meant, a query mentioning Paris is probably
referring to Paris, France. So, an ad for hotels in Paris, Texas instead of Paris,
France is probably incorrectly comprehending the user intent. Even if the ad is
otherwise a good one, rate it on the negative side of the scale and use the
flag.
Implausible Meanings
If an ad or landing page assumes a meaning that is completely implausible, treat it as
Secondary Interpretation
completely wrong and choose a very negative rating. Do not use the
of Query
flag if the meaning is clearly implausible.

Refer to the following example to better understand implausible meanings:

User query: [ paris ]

The query probably refers to the city of Paris, France. If an advertiser interprets
the meaning to be plaster of paris, it is almost certainly not addressing the user
query. Use a very negative rating.

Misspelled Queries
Users often misspell queries. When evaluating a query, if it is clear what the user means, and
the misspelled version of the query has no meaning, ignore the misspelling.

Analysis is more difficult if the query appears to be a misspelling, but the misspelled version has
a unique meaning. First consider the query as the user entered it, and then consider if it may be
misspelled. If advertisers respond to misspellings, ratings may need to be adjusted.

Refer to the following example to better understand misspelled queries:

User query: [ goodnight moom ]

Goodnight Moon. I
There is a famous children’s book called t is very possible
that the user means to type [ goodnight moon ] but types [ goodnight moom ]
Goodnight Moo
instead. However, there is actually a novel titled m
. While the
novel is quite obscure, it might be what the user wants.

Advertiser Responds to Actual Spelling in Query

If the advertiser assumes that the query is correct as it stands (in the example above, assumes

the user meant [ goodnight moom]), treat the advertiser’s query interpretation as acceptable.
You would then need to decide separately how promising the ad or landing page are.

Advertiser Responds to a Plausible Correction of Spelling in Query
If the advertiser assumes that the query is misspelled and addresses a corrected version of the
n
query (in the example above, assumes that the user mistyped and meant [goodnight moo]),
judge for yourself whether this was a good assumption. If you think it was a reasonable
assumption, the ad and landing page are treated as if this were the user intent. Don’t modify
Secondary Interpretation
your scores to account for the spelling correction, and don’t use the
of Query
flag.

Advertiser Responds to a Possible but Unlikely Correction of Spelling in

Query
If the advertiser assumes a corrected spelling that you think is possible but not very likely, this is
secondary interpretation.
a An ad or landing page that addresses only a secondary
interpretation of the query is given a lower rating than if it had responded to a likely or plausible
meaning. The flag must be used. An ad or landing page
that responds to a secondary interpretation is generally rated negatively.

Implausible Spelling Correction

If the advertiser’s interpretation of the query is based on a completely unreasonable
Secondary
assumption, treat it as completely wrong and give it a very negative score. The
Interpretation of Query
flag is not used in this case.

Continuing with the previous [ goodnight moom ] example, both query interpretations are
reasonable. The only way to know this is to research the query and analyze how the advertiser
interprets it.
Queries for which a Reasonable Ad is Impossible

Sometimes a query is either so hard to interpret or so noncommercial in nature that no ad will
–
be a good match. Be careful in these casesrate the ad and landing page according to how
—
well they actually respond to the query, and do not worry about how hard it would be to show an
appropriate ad for that query. Do not give an ad positive ratings if a better ad for the query
cannot be determined or if it seems like it was a good try. Rate it positively only if it addresses
the query intent. If the query intent cannot be determined, the ad must be rated negatively. For
example, a query of [ www ] or [ when did ] is not complete enough to serve a proper ad.
Unrateable Queries
In some rare cases, a query may appear that is the result of an error in how the task was added
to the evaluation system. For example, a query may appear in the incorrect rating language, or
a query of jumbled characters may appear that, after research, has no discernible meaning. Do
not attempt to provide AC or LP ratings for queries like this. Instead, use one of the flags
provided in the Query Flag section. These flags are only present on the first page of a Search
Ads task, the Ad Creative rating section.
Approximate Query Location

Many tasks will include a map indicating the geographical region where the query was entered.
If you are unfamiliar with the location in the map, you can zoom in and out of the map to
familiarize yourself with it. Knowing the approximate location where the query was entered may
help you decide how relevant the Ad Creative or Landing Page is to a user. Some ads will be
more relevant given the location, some will be less relevant. For some ads, knowing the location
will not make any difference.
When Location Matters

Sometimes the query, the ad, or both may refer to a specific geographic location. Even when
the approximate query location is available, it can sometimes be difficult to determine how to
handle a locationspecific advertisement.

Location Mismatch Between Query and Approximate Query Location
If the query contains geographical information, you should use the information available in the
query instead of the map, especially if the two conflict. For example, if the query is [pizza in
new york], but the map indicates Los Angeles, pizza restaurants in Los Angeles are a bad
result.
Query Specifies a Location but Ad Does Not

If a query specifies a location, take it into account when evaluating the ad. Sometimes research
is required to determine whether the product or service is compatible with the location. This
research is required before you can choose an Ad Creative rating score.

User query: [ pizza Santa Monica ]

If an ad pointed to the main Round Table Pizza chain homepage, but didn’t
mention this California city, this might initially seem useful. However, if, upon
using the location finder on the site, there are no locations within a reasonable
distance of Santa Monica, this ad is probably not useful.
Ad Specifies Location but Query Does Not

If the query does not specify a location and the approximate query location is also not available,
then evaluate the ad as if the user were in an appropriate location. For example, if the query is [
pizza ] and the ad is for a pizza restaurant in Barstow, California, assume that the user was in
Barstow, California.
Neither Query Nor Ad Specify a Location

If the query does not specify a location and the ad does not either, then assume that the user
can be anywhere in the target country of the rating language. Ignore the approximate query
location, if one is provided.
Location Mismatch Between Query and Ad

Assuming there is a match between the product or service and the query and ad, take the
location proximity into account in evaluating an ad. If the ad for a given query specifies a
different and incompatible location, this makes it a worse ad.

User query: [ pizza Santa Monica ]

If the ad is for a pizza restaurant in Manhattan, this is very unpromising.
However, an ad returned for pizza in a different but nearby location, like a
neighboring town, could be useful, and this ad might not be as bad as the
previous example. Use common sense to determine if the ad exceeds a
reasonable distance for the user, and an acceptable distance may vary
depending on the query. For example, a user may be willing to travel farther to
buy a new car than to get a haircut or go to the supermarket. For certain queries,
serving an ad with a completely different location may still be promising. For
example, if a user in the United States is looking for [ vacation in Australia ], then
an ad for “vacation in New Zealand” is not necessarily a bad ad since the user is
likely to be willing to travel a long distance for a vacation.

Search Ad Rating: How Promising is this Ad?

Evaluate the Ad Creative. A promising ad is one that seems like it will give the user what he or
she wants. An unpromising ad looks like it will be disappointing, unhelpful, dangerous, or
irrelevant.

Use the slider bar to select from four possible ratings:
■ Very Promising
■ Somewhat Promising
■ Somewhat Unpromising
■ Very Unpromising
Very Promising and Somewhat Promising are positive ratings: use them for ads that look like
they’re good ads for the user to see that would be worth clicking on. Somewhat Unpromising
and Very Unpromising are negative ratings: use them for ads that look like they’re bad ads for
the user that aren’t worth clicking on. Consider the following factors while evaluating an ad:

Satisfying the User

Does the ad offer something that will satisfy the user? An ad that has nothing to do with what
the user wants is always very unpromising. An ad cannot be promising if it is not relevant.
However, being relevant is not enough to make an ad good.

Correct Meaning of the Query

Does the ad address the correct meaning of the query? An ad for a car dealership does not
address the query [ cars 2 ]that is the name of a movie. Even if it would be a good ad for
—
some other carrelated query, it is a completely terrible and unpromising ad for that query.

Clarity and General Appeal

Is the ad written in a clear, appealing way? An ad that makes sense and does not have
mistakes, hardtounderstand language, or awkward phrasing can be good; and, an ad that
looks stupid, looks like it was written by a machine, is unintentionally funny, or just does not
make sense is usually bad. Does the ad clearly state what the advertiser offers? A good ad is
easy to understand. A bad ad may be overly vague or may not communicate enough
information to conclude the ad will lead to a positive advertising experience.

Potentially Scammy Ad
Does the ad look like a scam? An ad that seems too good to be true, sleazy, or deceptive to
users is usually bad.

Promise of Additional Links

Some ads contain multiple links to different sections of the website. You do not need to click on
these links, and when you are rating the creative, you cannot click on them. However, if these
links look promising or useful, this may be a reason to increase your ad creative score. If links
look unpromising, confusing, or useless, this may be a reason to decrease the ad creative
Ads with Additional Features
score. See the section for more detailed guidance.

Advertiser is Different Merchant or Provider from Query

Sometimes a query will specify both a product or service and a particular merchant or provider.
If the ad offers the desired product or service from a merchant or provider that is not the one
specified in the query, it should not be considered a negative user experience unless there is
another issue with the ad.

Analyzing the Advertiser Visible URL

The web address (also called the visible URL) displayed in the ad can provide clues about how
promising or unpromising an ad may be. The visible url can affect your evaluation in the
following three ways:
● If you are familiar with the advertiser based on the URL displayed in the ad, you
may use your background knowledge when rating the ad. Just use the “Used Prior
Knowledge In Judging Advertiser” flag.
● If you aren't familiar with the merchant, assume the merchant is legitimate, even if
that's not how you behave in your own online activities. Important note: this only
applies if there's nothing in the URL that looks suspicious (see next bullet point).
● If the URL itself makes you suspicious, don’t hesitate to mark the ad bad. For
example, an ad for online book shopping that looks very good except that the URL
m
of the merchant is www.amazo.com is pretty suspiciousthat looks like the
merchant is trying to trick you into thinking you’re going to amazon.com. This is
likely a scammy ad, and if you think an ad is scammy, it deserves a bad rating.
(Other tricks of this sort in addition to misspellings in URLs include adding random
numbers, unexpected extensions, or subdomains to create URLs like
www.amazon22.com, windows7onet.in, or windows7.customersupportus.net)

Do not assume an ad is promising just because it contains the same words as the query.
Do more than note that the words match— machines can tell us this. We need human
judgment: tell us whether a human being will find an ad appealing. If a user is looking for [
blue pants ] an ad that says “PANTS BLUE BLUE PANTS www.bargainautoparts.com” is
likely a bad ad even though it has the words “blue pants.”

Distinguishing Between Very Promising and Somewhat Promising

If an ad looks like it will lead to a page that satisfies the user intent, it deserves a rating of
Somewhat Promising or Very Promising. A good ad deserves one of the ratings described in the
following two sections.
Very Promising
A Very Promising ad should look like it points to a page where a user can find almost exactly
what is described in the query. If the user is looking for a particular product, the ad should
appear to point to a page for that product. If the user seeks a particular kind of store, the ad
should appear to point to a store of that kind. If the landing page of a Very Promising ad does
not satisfy the user intent, it will be a surprise and a disappointment to the user.
Somewhat Promising
A Somewhat Promising ad should also appear to take the user to a page where the product he
or she is looking for can be found; however, rather than appearing to point to a page where the
user can find
exactly
what he or she wants, a Somewhat Promising ad might do one of the
following things:
● It might look like it points in the right direction but not exactly at the target. For
example, if the user seeks a specific model of camera, an ad that looks like it will
point to a reputable camera store’s main page is Somewhat Promising.
● It might look like it points to something that might satisfy the user intent but is not
exactly what he or she wanted. For example, if the user seeks a particular model of
camera, a Somewhat Promising ad might point to a slightly different but reasonably
similar model of camera.
Sometimes it is just not possible to be confident about what the user seeks. If an ad seems to
point in the right general direction but there is no way to tell exactly what the user wanted,
Somewhat Promising is the highest possible rating.
Distinguishing Between Positive and Negative Ad Creative Rating

Evaluate the Ad Creative and weigh the criteria for Very Promising/Somewhat Promising
against the criteria for Somewhat Unpromising/Very Unpromising. If you find that positive
elements and negative elements both seem applicable to the creative you’re evaluating, ask
yourself which side of the positive/negative division seems to be a more reasonable fit and
choose a rating on that side.
Distinguishing Between Somewhat Unpromising and Very

Unpromising
It is especially important to distinguish between ads that are simply bad and ads that are very
bad for the user entering the query. The following section provides additional guidance to
distinguish between the Somewhat Unpromising and Very Unpromising ad creative rating.
Somewhat Unpromising
A Somewhat Unpromising ad generally isn’t a great ad to show the user, but it is likely that
some subset of users may find it useful.
● Even if the creative is not of the exact same topic as the query, as long as there is
some clearly related task or intent, there are some users who may find the creative
appealing. One example is if the user seeks [ weight loss pills ] and the ad is for
“diet tips” or “exercise machines”. These types of ads should be rated as Somewhat
Unpromising. They don’t directly provide what the user is looking for, but could be
somewhat useful to the user so don’t deserve the lowest ratings.
● If it is not really clear whether users will find the ad useful, rate it as Somewhat
Unpromising. One example is if a user is searching for some information and the ad
asks the user to search for the same information again elsewhere. It is hard to
know in these cases whether the ad will be able to provide anything useful to the
user, since he/she is being asked to repeat the same action again possibly just to
get similar results. Please view the Google search results for the query to get an
understanding of what the user currently sees and what information he/she currently
has access to. If you believe that the ad won’t provide any additional information
from what is already presented to the user, rate it as Somewhat Unpromising.
However, if you believe that clicking the ad will provide additional useful information
to the user, don’t rate it as Somewhat Unpromising give it a higher rating. One
example that would deserve a higher rating than Somewhat Unpromising is if the
user is searching to buy a particular item and the ad is asking the user to search for
that particular item across numerous stores and merchants. Another example that
would again deserve a higher rating is a query for some specific industrial
machinery part, and an ad inviting the user to repeat the search on a search engine
devoted to machine part sales and manufacture.
● Sometimes a query specifies a location, and the ad targets a different location. For
When Location Matters
these specific examples, please refer to the section.

Very Unpromising
There are several cases where Very Unpromising is the only appropriate rating.
● Very Unpromising ads have
no
reasonable chance of satisfying the user. Try to put
yourself in the user's mindset is it possible at all that the creative offers something
useful to the user? If there is no reason at all to think that the user will find the
creative useful, rate it Very Unpromising. (Note: you might think “It’s always
possible
that someone might find
anything
useful, even though it has nothing to do
with the query.” Don’t go that far!)
● If the creative looks like a scam, or leads the user to harm, rate it as Very
Unpromising.
● MachineGenerated Ads
If the creative falls into one of the categories listed in the
section, rate it as Very Unpromising.
● If the creative promises to do the impossible, such as selling a person or city, rate it
Very Unpromising.
● Just because there is a strong term overlap between the query and creative does
not mean the ad is a good match for the query. If the user is searching for
[ homeowners insurance ] and the ad is for “medical insurance,” the user will very
likely not find the creative useful and you should rate it as Very Unpromising.

Machine‐generated Ads

Some ads are partially autogenerated to take words from the query and place them in the
creative text. There is nothing wrong with this in itself. For example, if the query is [ xbox 360
used ] and the creative says “Buy a used xBox 360 on eBay,” that’s a good ad. Unfortunately,
sometimes these machinegenerated ads turn out very badly. Very Unpromising ad creatives
may have some of the following issues:

Things offered that cannot be bought

User query: [ san diego, ca ]

An ad that says “Buy San Diego cheap on eBay” is ridiculousyou can’t buy a
city. Ads that are unintentionally ridiculous, horrible, or offensive, by suggesting
that you can buy concepts, human beings, body parts, criminal acts, or similar
things are Very Unpromising.

Part of the query removed, substantially changing the meaning

User query: [ roses lime juice ]

An ad that offers the action, “Buy roses,” has left out so much of the query that
the entire meaning has changed. By taking only part of the text of the query what
remains substantially changes the meaning.

Part of the query removed, resulting in overly general ad

User query: [ how do i remove gum from satin ]

An ad that offers “Get information on how to remove,” is nearly meaningless: too
much has been removed from the query. By taking only part of the text of the
query, the result is far too general to be promising for the user query.

Nonsensical, jumbled, or ungrammatical ad creative

User query: [ how do i remove gum from satin ]

An ad that says “Search for how do I remove gum” or “Find how do I remove gum
from satin” is awkward and ungrammatical. Ads that end up nonsensical,
jumbled, or ungrammatical because a query has been crammed into a space
where it doesn’t really belong is Very Unpromising.

Be on the lookout for these
. If you’re not paying close attention to how the ads actually look
and sound, it can be easy to think these look finebut to a user who is actually reading the text,
—
they can look laughable, annoying, or foolish, and in some cases, deeply offensive or hurtful.
Even those that just look sort of silly or awkward are very bad.

Ads with Additional Features

Some ad creatives are just text and a single link to the advertiser page. Other pages contain
additional features that mayor may not
provide something valuable to the user. Ad creatives

may contain maps, videos, images, star ratings from customers, multiple links to specific pages
on the advertiser site, and a variety of other features.

These additional features may affect the quality of an ad creative. If the special features add to
or detract from the appeal, informativeness, or usefulness of an ad, the Ad Creative score can
be raised or lowered.

For an ad that contains only text and a single link to an advertiser page, use only the previous
criteria in making a decision. For ads that contain anything in addition to text and a one link,
consider the following factors, and decide whether to raise or lower your score:
● If an ad does not deserve a score of Somewhat Promising or Very Promising based
on the previous criteria, be cautious about giving it a positive rating just because of
additional features.
● An ad that deserves a score of Somewhat Promising or Very Promising based on
the previous criteria can be given a negative rating if additional features detract from
it severely.
● Use common sense when deciding whether additional features improve or detract
from an ad enough to move it between Somewhat Promising and Very Promising
scores in each category.
● An ad that is scammy or harmful can never be improved by additional features.
● Additional feature should relate to the user intent in a sensible way. If the user is
looking for information about a current movie, a movie trailer in the ad creative
relates to the user in a sensible way, but a map to the movie studio where the film
was made does not. The trailer probably improves the experience, but the map
detracts from it.
● Where an additional feature is relevant to the user intent, it should be informative,
easy to use, and clear. If it is confusing, boring, or hard to figure out, it may either
detract from the experience or just fail to improve it.
● An ad may have multiple additional features. Consider all of them together when
determining your ad creative score.
● Raise or lower your rating by a small amount if the additional feature has little
impact on the ad creative. Raise or lower it a large amount, according to the
previous criteria, if the additional feature has a big impact.
● Do not consider the physical size of an ad creative rating it. If the size of an ad
creative causes it to display incorrectly in the ads rating interface, alert an
administrator but ignore the issue while rating it.

Search Ad Rating: Does Landing Page Satisfy User Intent?

Use the slider bar to select from the following four rating categories while determining how likely
it is that a landing page will satisfy the user intent. Only consider the user query and the landing
page, and ignore the ad creative completely.
■ Satisfaction Likely
■ Satisfaction Possible
■ Dissatisfaction Possible
■ Dissatisfaction Likely
Satisfaction Likely and Satisfaction Possible are positive ratings that satisfy the user query.
Dissatisfaction Possible and Dissatisfaction Likely are negative ratings that do not satisfy the
user query.

Only rate the landing page that opens after clicking on the Visit Landing Page button. Do
not base your score on pages that are accessible by clicking on links in the body of the ad
creative. NEVER copy and paste a link to visit the page, and NEVER manually change the
URL.

The fundamental principle of landing page evaluation is this: the user starts a search on
Google.com with a goal in mind. The user then enters a query and reviews Google’s search
results and ads. The user then clicks on the ad currently being reviewed, and that ad takes the
user to the landing page. Keep in mind that in order for a user to have a positive experience with
an advertiser landing page, he or she should be closer to the goal expressed in the query,
otherwise it is a negative experience. The section below helps frame how distance from the
user’s goal helps determine a landing page rating score.

Distance from the User’s Goal
Carefully review the Google search results page to determine the distance from the user’s goal.
Does the Landing Page move user closer to his or her goal, further from the goal, or neither
closer nor further from the goal?

If the user is
closer
to the goal, the landing page deserves a positive rating. For example, if the
user is hoping to buy a specific camera, and the landing page is a store offering that camera for
sale, the user has come closer to accomplishing his or her goal.

If the user is
further
from the goal, the landing page deserves a negative rating. If the user is
hoping to buy a specific camera, and the landing page is a store offering pet food, this is a dead
end. The user will need to go back to the search page or start a new search, so he or she is
actually further from the goal than before clicking on the ad.

If the user is
neither closer to nor further from
the goal, the landing page deserves a negative
rating. If the user is on a Google search results page and clicks on an ad that just takes them to
a page of similar search results, which overall did not provide any additional value, no progress
has been made; the user is no closer to or further from the goal than before clicking the ad.

Deciding this is not an exact science. Rely on good judgment. The following guidelines more
deeply explain how to generally rate landing pages, but they do not explain how to rate a
landing page in every situation.

Distinguishing Between Satisfaction Likely and Satisfaction

Possible
Satisfaction Possible and Satisfaction Likely are positive ratings. If the landing page offers the
user exactly or very nearly what he or she wants, use a Satisfaction Likely or Satisfaction
Possible rating.
Satisfaction Likely
To receive this rating, a landing page must offer just what the user looked for. If the user wants
car reviews, it should offer car reviews. If the user wants car reviews about a specific model, it
should offer car reviews about exactly that model. If the user wants a category of product, the
landing page should be devoted to or include that exact category of product. For a Satisfaction
Likely rating, what the user is looking for should be apparent with no additional action needed by
the user. It is permissible, however, to click on a link to get detailed information.
Satisfaction Possible
Use this rating if the page is satisfactory but does not immediately present exactly what the user
seeks. If the product or service is for sale on the site, but a search or straightforward navigation
is required to find the item, select a rating of Satisfaction Possible rather than Satisfaction
Likely. If the site offers a very plausible substitute for a particular product specified in the query,
it may receive a rating of Satisfaction Possible or lower. If the query is a search for information,
and this information can be found without too much trouble on the advertiser site but is not on
the landing page, use Satisfaction Possible. The one exception here being if the user could
have found that same information on the search results page before clicking on the ad. If that is
the case, the landing page does not deserve a positive rating.

Considering Trustworthiness
Do not give a landing page a Satisfaction Possible or Satisfaction Likely rating if you do not trust
the information found on that landing page or if you would not make a purchase from the
advertiser site. A page that offers the exact product that a user is looking for is useless unless
the user trusts it enough to actually make a purchase there. A seemingly trustworthy merchant
selling a particular camera at a particular price might deserve a better rating than a page that
clumsily aggregates a random set of products, even if the same camera at the same price is
offered on that page too. Similarly, a page offering the exact information that the user is looking
for is not useful if there is no reason to think that the information is correct. For example, if the
user seeks some medical information, a site belonging to a medical school is a good source of
trustworthy information while a blog post by an unknown person is a much more doubtful
source. Never use a rating of Satisfaction Likely or Satisfaction Possible if the page appears
scammy or harmful.

Specific Versus General: Mismatch Between Queries and Landing Pages

Sometimes when the query is for a specific product, the landing page is basically on target but
much broader or much more specific than the query.

If the landing page has the product specified in the query, but a huge number of other products
too, this may be a decent experience, but probably is not good enough to get into the
Satisfaction Likely range in most cases.

If the query is for something general, like [ camera ], the landing page might be extremely
specific. For example, a product page for a particular model of camera from a particular
manufacturer with a particular set of options. In this case, too, it might appear to be a decent
experience, but it probably is not good enough to get rated as Satisfaction Likely in most cases.

You may judge that in particular cases, the experience is better or worse than the guidelines
above would suggest. For example, if the page has a huge number of different products, but the
product in the query is clearly the most prominent and the first thing you see, you might decide it
deserves Satisfaction Likely; if it’s so buried in the other products that you don’t even realize it’s
there, you might decide it deserves a negative rating. Similarly, if the query looks general and
the landing page is for a very specific product, you might think that the product is so obviously
the best possible thing to offer for that query that it deserves Satisfaction Likely; on the other
hand, if the product is technically in the right category but very very unlikely to be what the user
wants (for example, an expensive antique camera requiring glass plates instead of film for the
query [ camera ]), you might decide that it deserves a negative rating.

Distinguishing Between Positive and Negative Landing Page

Rating

Evaluate the query and the Landing Page and weigh the criteria for Satisfaction Likely/Possible
against the criteria for Dissatisfaction Possible/Likely. If you find that positive elements and
negative elements both seem applicable to the landing page you’re evaluating, ask yourself
which side of the positive/negative division seems to be a more reasonable fit and choose a
rating on that side.
Distinguishing Between Dissatisfaction Possible and

Dissatisfaction Likely

Dissatisfaction Likely and Dissatisfaction Possible are negative ratings. If you think that the
user will have a negative experience, always use either Dissatisfaction Possible or
Dissatisfaction Likely. If you have no particular reason to think a page will interest the user,
always use either Dissatisfaction Possible or Dissatisfaction Likely.

Dissatisfaction Possible
● If the page is marginally related to the query and you think that there’s a small
chance the user would be interested, use Dissatisfaction Possible.
● If the page can eventually lead to what the user wants, but only through many clicks
or through clicks that lead to an entirely different website, use Dissatisfaction
Possible.
● If the page offers something that you think the user might be interested in, but not
what the user was looking for and not especially close to it, use Dissatisfaction
Possible. For example, if the user is looking for baseball gloves, and the landing
page offers athletic socks, there’s probably some chance that the user might be
interested. However, it’s not what the user was looking for, and not all that close to
it, so it deserves Dissatisfaction Possible.
● If the page can eventually give the user what he or she is looking for, but the
process is protracted and difficult, use Dissatisfaction Possible.

Dissatisfaction Likely
● If the page has nothing to do with the query, use Dissatisfaction Likely.
● If the query is for a product or service, and neither the product/service nor anything
close to it can be purchased from the page, use Dissatisfaction Likely.
● If the query or a word in the query has two meanings, it is clear which meaning is
intended by the user, and the advertiser responds to the wrong meaning, use
Dissatisfaction Likely. For example, [ cars 2 ] refers to a movie. A page for a car
dealership is clearly a bad landing page for this query, even if it might be a good
result for [ car sales ].
● If the page looks like a scam, you think users could be harmed by it, or it either
attempts to trick the user into downloading something by labeling a download button
in a confusing way or tries to download a file without action by the user, use
Dissatisfaction Likely.
● If the page loads but is completely unusable (for example, because some content
does not load, or page doesn’t display properly) use Dissatisfaction Likely. If
enough of the page does not load at all (for example, you encounter a 404 error),
use the Error Did Not Load flag instead of a rating.
● If the page is very bad for any other reason, use Dissatisfaction Likely.

Query Flags
Use these flags to indicate that a query is unrateable. This means that it, and the AC and LP
paired with it, are not eligible to be assigned ratings. A Search Ads query is unrateable if it has
one of the following problems:

● it is in a language other than the task language (Foreign Language)
● it is unambiguously pornographic or about sexual services (Porn)
● it is complete nonsense; research reveals no plausible meaning (Nonsense)
● it was transcribed incorrectly, using an English rather than Cyrillic keyboard for Russian
words (Russian Transcription Error)

If you use one of these flags, all of the later questions will turn gray and don’t need to be
Note that these flags are only on the first page of a Search Ads task, the Ad
answered.
Creative rating section.
Foreign Language Query

Use this flag when the query is in a language other than the language of the task. If the query
contains words or phrases in another language, but there is enough content in the task’s
language that it is understandable, do not use this rating. If the query appears to be in a foreign
language, but research reveals that the query term may be commonly used in your rating
language or is the name of a specific group, product, or person, do not use this rating.

Remember to check the language of the task, especially if you work in multiple languages. If
your rating language is English, you rate ads in English for English queries. If you rate in
another language, you will rate some tasks in that language and some tasks in English. Your
rating language is always designated at the top of the task page. Even if you speak the
language of the query, if the task is supposed to be for a different language, use this rating.

Porn Query
Use this flag only for queries that are unambiguously for pornographic content or sexual
services. Queries for racy or suggestive content, medical information, or art photos generally
shouldn’t get this rating. Queries for dating services generally shouldn’t get this rating, unless
those dating services depict nudity or specifically identify themselves as sexual rendezvous
services.

Nonsense Query
Use this flag for queries that are complete nonsense, where research reveals no plausible
meaning. As you research, take into consideration that queries that may look like nonsense
might actually turn out to be meaningful. The following are examples of queries that
do
have
meaning and
should not Nonsense Query
receive the flag:

● a misspelling
● a product code or model number
● technical specifications
● a partial web address or YouTube video ID
● a specific username or Twitter handle
● an uncommon acronym or abbreviation

Don’t assume that a query is nonsensical just because you do not immediately know what it
means. Encountering a completely nonsensical query is rare. Most queries mean something, so
you should always research the query, even if at first it seems like nonsense. Only use this
rating when there is no way for you to reasonably guess about user intent, even after
researching the query.

Russian Transcription Error

This flag applies only to raters working in Russian. If you are working in a language other than
Russian, this flag will never be applicable to your tasks, and you should not use it. If you are
working in Russian, you will receive separate instructions for determining when queries should
be considered transcription errors.

While you will not be able to assign AC, LP, or AC to LP ratings after using one of these Query
Flags, you will still need to submit the task for your answers to be recorded. You will submit your
responses directly from the first page of the task by clicking the Submit button at the bottom left
of the task.

Ad Creative Flags
If an Ad Creative meets the criteria for using one of the following flags, please use that flag. If
criteria are not met for a flag, do not use the flag.
Navigational Bullseye

Use the Navigational Bullseye flag when both these things are true:
1. The query appears to be a search for a particular website, section of a website, or web page.
2. The creative looks like it will point to the corresponding website, section of a website, or web
page.

Not every query is a search for a particular websitein fact, the vast majority are not.

The Navigational Bullseye flag should only be used where the frame of reference is similar or
compatible between query and creative. For example, with the query, [ ford explorer ], the
Navigational Bullseye would be used for creative that appears to take the user to the Ford
Explorer section of the Ford website; however, the flag would not be used if the creative
appeared to take the user to a different page on the Ford site (a page devoted to the Ford
Focus) or a general page on the Ford site (their homepage, for example).

Foreign Language

Use this flag when the creative is in a language other than the language of the task. Remember
to check the language of the task, especially if you work in multiple languages. Even if you
speak the language of the creative, if the task is supposed to be for a different language, use
this flag.

A creative should be legible in your rating language: if the creative contains words or phrases in
another language, but there is enough content in the task’s language that it is understandable,
do not use this flag and proceed with the normal creative rating.

If you use this flag, some of the later questions will turn gray and don’t need to be answered.

Unexpected Porn

Use this flag when both these things are true:

1. The query is not a search for pornographic content or sexual services. If the query has both a
pornographic interpretation and a nonpornographic interpretation, assume that the
nonpornographic interpretation is the actual user intent.
2. The creative appears to offer pornographic content or sexual services.

Use this flag only for unambiguously pornographic content or sexual services. Racy or
suggestive content with no nudity, nudity in a medical context, or art photos generally shouldn’t
get this flag. Dating services generally shouldn’t get this flag unless they depict nudity or
specifically identify themselves as sexual rendezvous services. A regular dating service may
deserve a bad rating if it doesn’t match what the query appears to be looking for, but it would
not get the flag.


Used Prior Knowledge In Judging Advertiser

Use this flag when knowledge you already had about the advertiser influenced your ratings,
either for good or bad. Use this only when your rating is different from what you think you would
have given seeing the ad for the first time with no prior experience. If a creative is clearly bad,
don’t use the flag even if you already happen to have confirmation that a bad rating is deserved.


Use this flag when the creative text indicates that the advertiser is targeting a clearly secondary
interpretation of the query. An interpretation is secondary if it’s reasonable, but there is some
other interpretation of the query that you consider much more likely. Don’t use this flag with
interpretations that are wrong or unreasonable. Don’t use this flag if you think that the query
has multiple, equally likely meanings, and the advertiser is targeting one of those meanings. Do
use the flag where the query has multiple, equally likely meanings and the advertiser targets an
obscure or lesslikely meaning. Please review the main guidelines for instructions on how to
select a scale rating when you use this flag.

Landing Page Flags
If a Landing Page meets the criteria for using one of the following flags, please use that flag. If
criteria are not met for a flag, do not use the flag.
Navigational Bullseye

Use the Navigational Bullseye flag when both these things are true:

1. The query appears to be a search for a particular website.
2. The landing page is that site.

Not every query is a search for a particular websitein fact, the vast majority are not.

Foreign Language

Use this flag when the landing page is in a language other than the language of the task, with
no obvious way of getting to a version in the language of the task. Remember to check the
language of the task, especially if you work in multiple languages. Even if you speak the
language of the page, if the task is supposed to be for a different language, use this flag.

Don’t use this flag if there is some clear way to get to a version in the target language. For
example, if you are rating a Japanese task, a landing page in English with a Japanese flag in
the corner pointing to a Japanese version of the site should not get this flag.


Unexpected Porn

Use this flag when both these things are true:

1. The query is not a search for pornographic content or sexual services. If the query has both a
pornographic interpretation and a nonpornographic interpretation, assume that the
nonpornographic interpretation is the actual user intent.
2. The landing page offers pornographic content or sexual services.

Use this flag only for unambiguously pornographic content or sexual services. Racy content with
no nudity, nudity in a medical context, or art photos generally shouldn’t get this flag. Dating
services generally don’t get this flag unless they depict nudity or specifically identify themselves
as sexual rendezvous services. A page with racy content, nudity in an art or medical context, or
dating services may deserve a negative rating if it doesn’t match what the query appears to be
looking for, but it shouldn’t get the flag.


Unexpected Download

Use this flag when any of the following happens:

1. Clicking on the Visit Landing Page button initiates an attempt to download a file.
2. Some link, button, or graphic on the landing page initiates a download when clicked, but does
not clearly indicate that it will do so. For example, a big red button that says “Enter site” or
“Check the weather,” but starts a download when clicked, deserves the flag. A similar button
that says “Get It Now” or “Click here to download” does not.

Never install downloads that a site tries to initiate in this way: it is not part of the rating process.


Error / Did Not Load

Your job when evaluating a search ads task is to evaluate content provided by the advertiser
(the ad creative and landing page). Use the Error/Did Not Load (EDNL) flag to indicate that you
cannot evaluate the landing page because there is no landing site content provided by the
advertiser. There are several reasons why you might not be able to access landing site content
provided by the advertiser, including:
● the page or site no longer exist
● the page or site are under construction
● your browser is not able to find or access the page we provided you
● your virus/malware protection software blocks you from accessing the site
● the landing page opens using a 3rdparty program (such as iTunes) that you do not
have installed

It’s not always easy to immediately determine if the EDNL flag should be used because different
things can happen when a landing page is not working properly. Here are some examples of
what you might see when no landing site content is available to evaluate:
● a completely blank page
● a generic Not Found message generated by your web browser
(example: https://www.googlenews.com/default.html)
● a generic error message generated by the advertiser’s server
(example: http://www.centraldopolidor.com.br/enceradeiras.htm)
● a generic webpage (often filled with affiliate links) shown by the hosting service in
place of the actual landing page
(example: http://genealogywise.com/?reqp=1&reqr=)
● a search results page shown by your internet service provider because the actual
landing page cannot be accessed
(example:
http://www.dnsrsearch.com/index_results.php?querybox=sdiwfkdis.com&submit=Se
arch)
● a notice that the site or page is under construction with no way to access any other
part of the landing site
(example: http://www.reidknorr.com/demos/vinta_ss/)

In all these cases you should use the EDNL flag because you cannot access any content from
the actual landing site to evaluate. In the first two examples, above, there is little or no content
to evaluate. In the last four examples, there may be content you can see, but it is either not
content from the landing page advertiser (e.g. the hosting service, browser, ISP), or the entire
advertiser site is inaccessible.

Note that a landing page could have an error on it but still have landing site content or a way to
access landing site content on the page. Here are some examples of things you might see
when there is an error on the page but advertiser content is still available to evaluate:
● a page which partially loads
● an error saying that the page could not be found, but linking to another part of the
landing site
● an error stating that the product could not be found, but page provides alternatives
or a way to search the landing site for other products
● a blank page or an error page that still has site navigation tools (usually on the top
or side)
● an error page which automatically redirects to and loads a working page on the
landing page advertiser site
● a landing page which is blocked by a registration form

If an advertiser landing page provides enough content to rate, don’t use the EDNL flag. In the
some
cases above, the flag is not used because there is at least advertiser content on the LP
upon which to base your evaluation on.

If you use this flag, some of the later questions will turn gray and won’t need to be answered.


Use this flag when the landing page content indicates that the advertiser is targeting a clearly
secondary interpretation of the query. An interpretation is secondary if it’s reasonable, but there
is some other interpretation of the query that you consider much more likely. Don’t use this flag
with interpretations that are wrong or unreasonable. Don’t use this flag if you think that the
query has multiple, equally likely meanings, and the advertiser is targeting one of those
meanings. Do use the flag where the query has multiple, equally likely meanings and the
advertiser targets an obscure or lesslikely meaning. Please review the main guidelines for
instructions on how to approach the scale rating when you use this flag.

If you have questions about this project that are not answered by the instructions above, please
review the Rater Hub first, which contains additional content about rating. If you encounter a
technical problem with this rating task, use the “Report A Problem" link in the lowerright hand
corner of the rating page.

Search Ads Evaluation General Guidelines

Uploaded by

Copyright:

Available Formats

Search Ads Evaluation General Guidelines

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Search Ads Evaluation General Guidelines

Uploaded by

Copyright:

Available Formats

Search Ads Evaluation: General Guidelines

Queries with Multiple Meanings

Possible but Unlikely Meanings

Advertiser Responds to Actual Spelling in Query

Advertiser Responds to a Possible but Unlikely Correction of Spelling in

Implausible Spelling Correction

Queries for which a Reasonable Ad is Impossible

Approximate Query Location

When Location Matters

Query Specifies a Location but Ad Does Not

Ad Specifies Location but Query Does Not

Neither Query Nor Ad Specify a Location

Location Mismatch Between Query and Ad

Search Ad Rating: How Promising is this Ad?

Satisfying the User

Correct Meaning of the Query

Clarity and General Appeal

Promise of Additional Links

Advertiser is Different Merchant or Provider from Query

Analyzing the Advertiser Visible URL

Distinguishing Between Very Promising and Somewhat Promising

Distinguishing Between Positive and Negative Ad Creative Rating

Distinguishing Between Somewhat Unpromising and Very

Ads with Additional Features

Search Ad Rating: Does Landing Page Satisfy User Intent?

Distinguishing Between Satisfaction Likely and Satisfaction

Specific Versus General: Mismatch Between Queries and Landing Pages

Distinguishing Between Positive and Negative Landing Page

Distinguishing Between Dissatisfaction Possible and

Foreign Language Query

Russian Transcription Error

Used Prior Knowledge In Judging Advertiser

Secondary Interpretation of Query

Landing Page Flags

Error / Did Not Load

Secondary Interpretation of Query

You might also like