Thursday 3 January 2013

Google Can Now Index… Flash!

Google Can Now Index… Flash!

As you now, a Flash movie as the index page of a site has always been amejor problem with search engine optimization. There’s simply no content for the search engines to index.

So when Ilearn that Google can index the contents of Macromedia Flash movies, I was astonished. It seemed this remarkable discovery had gone unnoticed in the SEO community, especially in light of Google’s upcoming IPO.

But as you probably know, Google has always been the first to index different types of content: PDF files, .doc files, etc. Google has also made amazing inroads in being able to index dynamic content.

And now they’re the first major search engine to index Flash – another significant step forward in the SEO industry.

So why has Flash presented such problems in the past?

Background of Macromedia Flash Movies and SEO

With a Flash movie as the main page of a site , the web site owner is giving up crucial text necessary to prove to the search engines that the main page is abut a perticular topic. Without that text,the search engines hav nothing to index. Therefore, the main page rarely close well in therankings, unless off-page factors such as link popularity or link reputation are sufficentto carry the page on there own.

In the past, legitimate work arounds have been few and far between. This made thing extremely difficult for business who wanted to create a rich user experience with a Flash home page, such as web design firms, photography studios, graphic design firms, and so forth.

So, these businesses often sacrificed rankings for the user experience, since they could rarely have both while stillfollowing all of the guidelines set forth by the search engines.

Introducing… Michael Marshall

When I learned that Google is indexcingFlash from Michel Marshall, creator of Theme Master (http://www.theme-master.com) and chat/ forum moderator for our online search engine marketing courses (http://www.onlinewebtraning.com), and when I learned of the facinating discoveries he’d made, I immediately wanted to interview him for an article.

So let’s take at look at what Michael has discovered about Google and Flash.

Question:

Michael, how do we know that Google is now indexing the contents of Flash files? Is there a wway that we can search the index just for Flash?

Michael Marshall:

Yes, You can enter your search term in Google, and along with that search term, use the filetype operator and restrict your search to the file extension “swf”. This will search for your search term only in Macromedia Flash files. You should see [Flash] just.

For example:

“Best Free Banner Exchange Market” filetype:swf

Question:

How can we extract the text found in a Flash file to see what Google sees?

Michael Marshall:

Macromedia has a Flash Search Engine SDK (http://www.macromedia.com/software/flash/download/search_engine/) that will give us just what we need. That SDK (Softwarre Development Kit)includes an application named ‘swf2html’. Swf2html extracts text and links form a Macromedia Flas SWF file, and returns the data to sidout or as an HTML document. Swf2html is provided as a compiled application and as a static library for linked library implimentation. For complete functionality, see the file Readme .htm included in the SDK.

Question:

Do you have an examle of a Flash file that we can see, as well as an example of the text that the Micromedia tool extracted from the Flash file?

Michael Marshall:

Yes. I have an example of each. If you look at the extreted output in Web page form, you will see that it is not very pretty. Nevertheless, you’hv got lots of SEO-worthly content there, and that’s what we are most concerned with. You should visit the Flash prresentation itself, mouse over the text , and click the links in the presentation so you can be familiar with the Flash presentation. You can compare where certain text appears in the Flash presentation and where it is found in the extracted text.

Example of Flash file:

http://www.internet-marketing-analysts.com/flash_sample.html

Example of extracted text:

http://www.internet-marketing-analysts.com/extracted_text.html (Note: This Flash example is based on one of Michael’s own products. However, I chose to use it for two reasons: 1) because of the many different types of Flash involved; and, 2) because it is a text-heavy Flash example, as opposed to many other examples of Flash that Icould have chosen to use. Added Note: Be sure to highlight the entire page of extercted text by clicking on Ctrl A.) In the output file, you’ll notice that some text seems to be repeated on multiple lines and one portion of it even appears invisible since the font color comes out white. This is just a side effect of the conversion/extraction tool and is not really invisible text or spaming.

In other words, you’re doing nothing wrong when this happens – it’s simply due to the tool itself and not spamming or true invisible text.

Question:

But how do we know that’s how Google sees it?

Michael Marshall:

A simple test will show us how much of the text in a Flash presentation can be seen (or extracted) by Google. Perform an exact search (and use the filetype operator) on some text which appears at the top of the html output from Macromedia’s tool, and then perform a simillar search for text that appears at the bottom. Try similar search on text that appears in the middle as well if you really want to be sure. This is a good spot check to see what Google is grabbing from the Flash file. Since we can’t know exactly what Google uses to read the Flash file, the micromedia tool is only an approximation, and this spot check is always the best measure.

Question:

How much of your Flash movie does Google see? In other words, how deep into the Flash file dose the spider go?

Michael Marshall:

In my experience testing the macromedia tool, I have found that Google sees all the text that the tool can extract including all links… everything from top to bottom.

Question:

You mentioned that when certain types of motion in a Flash movie are associated with text, the resulting extracted extracted output will contain duplication occurrences of the text.

Those techies among us will know what that means, but for those non-techies (like me), does this mean that we need to be careful about using certain types of animation, because it could result in duplicate content, therefore creating the possibility of spam or problems with our SEO efforts?

Michael Marshall:

Yes. The type of animation you apply to text in your Flash presentation has impact on how that text gets extracted. You wouldn’t want your keyword density or theme focus to get thrown off by mistake due to applying the wrong type of animation to certain text.

Question: When viewing the source of the HTML output extracted from your Flash file (see that source code found at the bottom of this page: http://www.internet-marketing-analysts.com/Google-Flash_tutorial/), there’s no title tag. What text does Google pull as the title tag in the search results?

Michael Marshall:

In my experiences, I have found that the first line of text in the extracted output gets by Google as the title tag in the search results. You may want to use swf2html and spot checkand modify your flash presentation until you get the desired result. In addition, the description in the search rsults is created dynamically (according to the user’s query) from snippets of text inside the Flash presentation as extracted by Google.

Question:

Do you have any other tips for optimizing Flash files?

Michael Marshall:

Yes. I would recommend that people read my more technical tutorial for more details on optimizing Flash file. (See below) One thing I would add is the problem that might be encountered by Flash presentations which use dynamic content pulled from a database, xml file, etc. bsed on user input. Such content is not part of the xml file itself and, therefore, will not be indexible by Google.

Question:

What about Flash banners? Will Google also index the contents of Flash banners?

Michael Marshall:

Yes. Any Flash presentation, whether full-page or banner size, can be indexed by Google. I have found many instantences of both. For the More Technical SEOs…

No comments:

Post a Comment