Wednesday, December 5, 2012

Official: Google Panda Refresh On November 5th (Version 21)


Google Panda Update #21We have a confirmed Google Panda update that happened this past Monday, November 5th.
This would be the 21st Panda refresh or update we have on record. Before that, we had version 20 on September 27th, which was about five-plus weeks ago.
Danny Sullivan confirmed the update at Search Engine Land. Typically I'd try to post about this in more real time or before they are confirmed, but I've been offline for a bit.
This refresh impacts about 0.4% of queries that a regular user might notice and 1.1% of the United States in English searches.
Google tweeted:

Past Google Panda Update:

 Google Panda #21 Update On November 5th
Resources by http://searchengineland.com/google-releases-panda-update-21-138902

Friday, October 5, 2012

Google Webmaster Guidelines Get An Update

Keep in mind when an update to Google’s Webmaster Suggestions was determined in advance, before Look for search engines attracted it back down? Well, Look for search engines has officially customized them now.

“Both our main fantastic recommendations and many of our more particular content (like those on back links methods or unseen text) have been updated and prolonged to provide you with more information about how to make fantastic websites for both clients and Look for search engines,” explains Google’s Look for Quality team.

“The main idea of our fantastic recommendations has not changed: Focus on the person,” the team leads to. “However, we’ve involved more support and cases of activities that you should prevent in order to keep your site present with Google’s search results.”

For More Details: http://support.google.com/webmasters/bin/answer.py?hl=en&answer=35769

Friday, August 17, 2012

The Emanuel Update: Google Will Penalize Sites Repeatedly Accused Of Copyright Infringement

Latest Algo UpdatesHas someone filed a large number of DMCA “takedown” requests against your site? If so, look out. That’s the latest penalty that may cause you to rank lower in Google’s search results. It joins other penalties such as “Panda” and “Penguin.” We’re dubbing it the “Emanuel Update” in honor of Hollywood mogul Ari Emanuel, who helped prompt it.

Hollywood Hates Google

Google has had no lack of criticism from the entertainment industry over the past few years, criticism that’s also caused some content owners to hold back on doing deals with the company.
 One of the most recent flare-ups came at the D Conference earlier this year, when Ari Emanuel, co-CEO of William Morris Endeavor, groused that Google could filter out pirated content if it really wanted to. He said:
I don’t want them to censor results, but they have a bunch of smart guys there that can figure this stuff out….
Look, Google can filter and does filter for child pornography. They do that already. So stealing is a bad thing, and child pornography is a bad thing.
It’s well worth watching the video of his discussing this, which starts around 47 minutes in here. Google later responded saying things weren’t so easy. Said Google, as coveredin AllThingsD:
“I think he was misinformed, very misinformed,” Google’s Susan Wojcicki responded today onstage. “We do not want to be building a business based on piracy.” As compared to child porn, which people can generally recognize on sight, Wojcicki said, “When I see content, I don’t know if you own the copyright.”

Detecting Pirates

But as it turns out, there is a way that Google can guestimate if there’s copyright infringement happening, by making use of Digital Millennium Copyright Act “takedown” requests.
These requests are one of the ways to get content removed from Google. Anyone can file a request. It’s not proof of copyright infringement. It’s merely an allegation, and one that can be challenged. But Google evaluates each request, and if deemed valid, content is removed.
The requests are a pain to file, and they only remove an individual web page. If you’re a big entertainment company, it’s like playing Whac-A-Mole. But now, Google’s shift will change the game from a page-by-page basis to a site-by-site one. Beginning next week, a site will a lot of requests against individual pages will find all of its pages ranking lower in Google. From today’s post:
Starting next week, we will begin taking into account a new signal in our rankings: the number of valid copyright removal notices we receive for any given site. Sites with high numbers of removal notices may appear lower in our results.

Why Now?

Why not do this before? Personally, my own feeling is that Google — now a content distribution company that really wants partnerships — has finally decided it needs to deal with the embarrassing situation of pirated content showing up in its results (this happens at Bing, too, but Hollywood generally doesn’t care about that). For its part, Google says the change is only now happening because it finally has the data it needs:
Since we re-booted our copyright removals over two years ago, we’ve been given much more data by copyright owners about infringing content online. In fact, we’re now receiving and processing more copyright removal notices every day than we did in all of 2009—more than 4.3 million URLs in the last 30 days alone. We will now be using this data as a signal in our search rankings.

What’s A Valid Request?

But what if someone files a complaint that Google upholds, even if it’s not? Actually, that will happen. “Valid,” as best I can tell, simply means that someone filed the right paperwork and that Google didn’t receive a counter-challenge. From the post:
Only copyright holders know if something is authorized, and only courts can decide if a copyright has been infringed; Google cannot determine whether a particular webpage does or does not violate copyright law. So while this new signal will influence the ranking of some search results, we won’t be removing any pages from search results unless we receive a valid copyright removal notice from the rights owner. And we’ll continue to provide “counter-notice” tools so that those who believe their content has been wrongly removed can get it reinstated.
Then again, it also seems like even if the paperwork is valid, Google does, on occasion (about 3% of the time) still refused to honor them. It gives some sometimes funny examples of this here.

Who Will Get Hit (& Why YouTube Won’t)

If you want a sense of what sites are likely to be hit, Google told us this page at its Google Transparency Report site showing domains with the most requests against them is a starting place:

Google clarified, however, that just because a site is on that page doesn’t mean it will be hit with a penalty. It’s just a general guide, said Google, to what it means when it talks about sites getting a lot of notices.
It’s important to note that the page only shows removals from web search. What’s missing? From the site:
  • Requests for products other than Google Search (e.g, requests directed at YouTube or Blogger) are not included.
  • Requests submitted by means other than our web form, such as fax or written letter are not included.
YouTube itself is the target of many DMCA requests and removes content all the time. Technically, it should be prone to being downgraded by the Emanuel Update, as a site that has many DMCA requests against it. In reality, that’s unlikely to happen. To understand more about this, see our separate articles, How YouTube Will Escape Google’s New Pirate Penalty & Google: Many Popular Sites Will Escape Pirate Penalty, Not Just YouTube.

The Emanuel Penalty?

That leads back to the name. Reacting to complaints about content farms and poor quality content clogging its search results, Google released the Panda Update in February 2012. Periodically, it is run (on a roughly monthly basis) to decide if there is new content that should be penalized or poor sites that have improved.
This year, in April, Google released the Penguin Update, another filter. This one also runs periodically and is especially designed to go after sites that overtly spam Google.
Google hasn’t given the coming DMCA-based update a name. It doesn’t always name its updates and filters, and I’m sure it won’t in this case. So, we’ve taken the lead.
Google might argue, as it has done with Penguin, that Emanuel isn even a penalty at all but rather an “adjustment.” The sites hit by Emanuel won’t be penalized. They just won’t be as rewarded when the new system kicks in.
Adjustment or not, my guess is that it will feel like a penalty to the sites hit. They’ll drop from the first page of search results and effectively be invisible. Chances are (I’m checking on this, this will be a signal that’s periodically checked, so that if a site seems to have received fewer requests over time, it might see its rankings get restored.
Postscript: Google said it was too early to detail how the process will work and that it will be “adjusting as we go.”

False Accusations?

Meanwhile, the crazy world of Google’s search results gets even crazier. Now, aside from worries that people might point bad links at a site to hurt it, publishers can worry about DMCA requests, as well.
At least with the DMCA requests, they’re far harder to file and put the person doing the targeting on the record.
Public Knowledge, a digital rights group, has come out with a blog post that’s fairly well-rounded, voicing concerns but also reassurances. From the conclusion:
If Google’s new policy helps users find legitimate sources of content, protects the valid interests of copyright holders, and doesn’t penalize lawful sites, then it’s a win all around. But any new system such as this has potential dangers and unintended consequences, and can be abused. Google is undoubtedly aware of this–but it remains to be seen how it will respond to problems that arise, and whether it will continue to put the interests of users first.
The EFF, another digital rights group, is more worried. In a blog post today, it says in part:
In particular, we worry about the false positives problem. For example, we’ve seen thegovernment wrongly target sites that actually have a right to post the allegedly infringing material in question or otherwise legally display content. In short, without details on how Google’s process works, we have no reason to believe they won’t make similar, over-inclusive mistakes, dropping lawful, relevant speech lower in its search results without recourse for the speakers.Takedown requests are nothing more than accusations of copyright infringement. No court or other umpire confirms that the accusations are valid (although copyright owners can be liable for bad-faith accusations). Demoting search results – effectively telling the searcher that these are not the websites you’re looking for – based on accusations alone gives copyright owners one more bit of control over what we see, hear, and read.
It’s worth noting that this change won’t drop the sites from Google. It’ll just make them harder to find, almost certainly for common terms poeple might use, like the name of a movie or a song, perhaps for things like the name of a movie or song plus a file format. But for those who learn to dig the right way, they’ll still uncover these sites, at least pages from the sites that haven’t been formally removed by DMCA requests.

The Major Search Change Omitted From Google’s Press Event

One thing I find very disturbing. Google trotted out a major press event earlier this week to talk about all that’s new in its world of search. We got told about things like a new Google Search app for iOS and a trial of Gmail results within regular search.
But there was no time to mention Emanuel? No time to cover with the assembled press what’s arguably the biggest search news Google announced this week?
Google told me the details for the DMCA filter were only finalized this morning. Even if so, it feels like this could have been discussed at the press event, rather that pushed off to a Friday afternoon. That comes across as Google hoping those who worry about this new system being abused or some type of censorship won’t be noticed.
Oh, they’ve noticed. But come Monday, they’ll have moved on to the latest iPhone rumor. If Google’s going to get a black eye over this, chances are, it will be brief.

Making Hollywood Happy

Don’t get me wrong. There are some good reasons why this might be helpful. I’m not in favor of pirated content getting rewarded in search results, especially when so much of it can be linked to crappy ads and potential malware.

I sure think, like others, that Hollywood and the entertainment industry itself would solve much of the problem by making more content available. The Oatmeal’s “I Tried To Watch Game Of Thrones” comic is a great illustration of this.
But this is probably a necessary move by Google to get the entertainment dinosaurs to do more. At least they can’t just keep blaming Google rather than their archaic distribution models. The MPAA, by the way, has already praised it. From The Guardian:
The move was welcomed by the entertainment industry Michael O’Leary, senior executive vice-president for global policy at the Motion Picture Association of America, said in a statement: “We are optimistic that Google’s actions will help steer consumers to the myriad legitimate ways for them to access movies and TV shows online, and away from the rogue cyberlockers, peer-to-peer sites and other outlaw enterprises that steal the hard work of creators across the globe.”
RIAA is favorable and headlines its statement, incorrectly, that this is a plan to “better prioritize” licensed music. From the statement:
Today Google has announced a potentially significant change in its search rankings that can make a meaningful difference to creators:  sites that are the subject of large numbers of copyright removal notices may be ranked lower in search results than before.  This should result in improved rankings for the licensed music services that pay artists and deliver fans the music they love. This change is an important step in the right direction – a step we’ve been urging Google to take for a long time – and we commend the company for its action….But by taking this common-sense step and treating copyright in a way that’s consistent with the search firm’s approach to other forms of activity on the Internet, Google has signaled a new willingness to value the rights of creators.  That is good news indeed.  And the online marketplace for the hundreds of licensed digital services embraced by the music business is better today than it was yesterday.
One thing’s for certain. Hollywood, including Ari Emanuel, will be hearing about Google’s move to make this change over and over again in the weeks to come, directly from Google, as content deals continue to be hammered out.
By the way, if you’re really trying to understand all this algorithm, update and filter stuff, see our Search Engine Land’s Guide To SEO and Periodic Table Of SEO Ranking Factors pages. Also see the articles below for more background on some of the topics I’ve covered.

Related Topics: Featured | Features: Analysis | Google: Emanuel Update | Google: SEO | Google: Web Search | Top News

Friday, June 8, 2012

History of Past Panda Updates

•    Panda 3.6 ---on April 27th
•    Panda 3.5 ---on April 19th
•    Panda 3.4 ---on March 23rd
•    Panda 3.3 --- on about February 26th
•    Panda 3.2 --- on about January 15th
•    Panda 3.1 --- on November 18th
•    Panda 2.5.3 --- on October 19/20th
•    Panda 2.5.2 --- on October 13th
•    Panda 2.5.1 ---on October 9th
•    Panda 2.5 ---on September 28th
•    Panda 2.4 ---in August
•    Panda 2.3 --- on around July 22nd.
•    Panda 2.2 --- on June 18th or so.
•    Panda 2.1 --- on May 9th or so.
•    Panda 2.0 --- on April 11th or so.
•    Panda 1.0 --- on February 24th

Friday, March 23, 2012

Alleged POSITIVE ON-Page SEO Google Ranking Factors

The term "Keyword" below refers to the "Keyword Phrase", which can be one word or more.
Green rows confirmed by Google patent

Note -
Patent
Claim
#

Factor
#

POSITIVE
ON-Page SEO Factors

Brief Note

50

-

KEYWORDS

Google patent - Topic extraction
For keyword selection,
try Google Ad Words - Google Trends

HOT

1

Keyword in URL

First word is best, second is second best, etc.

HOT

2

Keyword in Domain name

Same as in page-name-with-hyphens

-

Keywords - Header

HOT

3

Keyword in Title tag

Keyword in Title tag - close to beginning
Title tag 10 - 60 characters, no special characters.

-

4

Keyword in Description meta tag

Shows theme - less than 200 chars.
Google no longer "relies" upon this tag, but will often use it.

-

5

Keyword in Keyword metatag

Shows theme - less than 10 words.
Every word in this tag MUST appear somewhere in the body text. If not, it can be penalized for irrelevance.
No single word should appear more than twice.
If not, it may be considered spam. Google purportedly no longer uses this tag, but others do.

-

Keywords - Body

-

6

Keyword density in body text

5 - 20% - (all keywords/ total words)
Some report topic sensitivity - the keyword spamming threshold % varies with the topic.

-

7

Individual keyword density

1 - 6% - (each keyword/ total words)

HOT

8

Keyword in H1, H2 and H3

Use Hx font style tags appropriately

-

9

Keyword font size

"Strong is treated the same as bold, italic is treated the same as emphasis" . . . Matt Cutts July 2006

-

10

Keyword proximity (for 2+ keywords)

Directly adjacent is best

-

11

Keyword phrase order

Does word order in the page match word order in the query?
Try to anticipate query, and match word order.

-

12

Keyword prominence (how early in page/tag)

Can be important at top of page, in bold, in large font

-

Keywords - Other

-

13

Keyword in alt text

Should describe graphic - Do NOT fill with spam
(Was part of Google Florida OOP - tripped a threshold - may still be in effect to some degree as a red flag, when summed with all other on-page optimization - total page optimization score - TPOS).

-

14

Keyword in links to site pages (anchor text)

Links out anchor text use keyword?

-

NAVIGATION - INTERNAL LINKS

SITE

15

To internal pages- keywords? -

Link should contain keywords.
The filename "linked to" should contain the keywords.
Use hyphenated filenames, but not long ones - two or three hyphens only.

SITE

16

All Internal links valid?

Validate all links to all pages on site.
Use a free link checker. I like this one.

SITE

17

Efficient - tree-like structure

TRY FOR two clicks to any page - no page deeper than 4 clicks

SITE

18

Intra-site linking

Appropriate links between lower-level pages

54

-

NAVIGATION - OUTGOING LINKS

55

19

To external pages- keywords?

Google patent - Link only to good sites. Do not link to link farms. CAREFUL - Links can and do go bad, resulting in site demotion. Unfortunately, you must devote the time necessary to police your outgoing links - they are your responsibility.

56

20

Outgoing link Anchor Text

Google patent - Should be on topic, descriptive

61, 62

21

Link stability over time

Google patent - Avoid "Link Churn"

-

22

All External links valid?

Validate all links periodically.

-

23

Less than 100 links out total

Google says limit to 100,
but readily accepts 2-3 times that number. ref 2k

-

121
(added)

Linking to Authority

Some say this gives a boost -
Others say that is absurd. However, it certainly is the opposite of linking to trash, which WILL hurt you.

-

OTHER ON-Page Factors

-

24

Domain Name Extension
Top Level Domain - TLD

.gov sites seem to be the highest status
.edu sites seem to be given a high status
.org sites seem to be given a high status
.com sites excel in encompassing all the spam/ crud sites, resulting in the need for the highest scrutiny/ action by Google.
Perhaps one would do well with the new .info domain class. - Nope. Spammers jumped all over it - no safe haven there. Not so much, now - .info sites can rank highly.

-

25

File Size

Try not to exceed 100K page size (however, some subject matter, such as this page, requires larger file sizes).
Smaller files are preferred <40K (lots of them).

-

26

Hyphens in URL

Preferred method for indicating a space, where there can be no actual space
One or two= excellent for separating keywords (i.e., pet-smart, pets-mart)
Four or more= BAD, starts to look spammy
Ten = Spammer for sure, demotion probable?

6, 7
12, 13

27

Freshness of Pages

Google patent - Changes over time
Newer the better - if news, retail or auction!
Google likes fresh pages. So do I.

8, 9

28

Freshness - Amount of Content Change

New pages - Ratio of old pages to new pages

27

29

Freshness of Links

Google patent - May be good or bad
Excellent for high-trust sites
May not be so good for newer, low-trust sites

-

30

Frequency of Updates

Frequent updates = frequent spidering = newer cache

-

31

Page Theming

Page exhibit theme? General consistency?

-

32

Keyword stemming

Stem, stems, stemmed, stemmer,
stemming, stemmist, stemification

-

33

Applied Semantics

Synonyms, CIRCA white paper

-

34

LSI

Latent Semantic Indexing - Speculation, no proof

-

35

URL length

Keep it minimized - use somewhat less than the 2,000 characters allowed by IE - less than 100 is good, less is even better

-

OTHER ON-SITE Factors

5

36

Site Size - Google likes big sites --

Larger sites are presumed to be better funded, better organized, better constructed, and therefore better sites. Google likes LARGE sites, for various reasons, not all positive. This has resulted in the advent of machine-generated 10,000-page spam sites - size for the sake of size. Google has caught on and dumped millions of pages, or made them supplemental.

4

37

Site Age

Google patent - Old is best. Old is Golden.

3

38

Age of page vs. age of site

Age of page vs. age of other pages on site
Newer pages on an older site will get faster recognition.

-

Note: For ALL the POSITIVE On-Page factors listed above,
PAGE RANK can
OVERRIDE them all. So can Google-Bombing.