Optimizing Your Blog’s Indexable HTML Content for SEO

[This is part of the The Blogger’s Essential Guide to Search Engine Optimization Series.]

One of the best ways to understand how search engines work is to understand what they can and cannot index – in other words, what they can and cannot see.

We talked about some of the challenges and limitations of search engines in the previous post but now it’s time to focus on what they were built to do: Crawl your awesome content, store the information for future use and retrieval, and then produce the results when someone looks for the content that your specific blog provides.

We’re going to get a bit technical but nothing that should necessarily scare you – in fact, you’ll be amply supplied to learn what is most important for you as a blogger by the time this post (and this series) is complete.

Sure, you’re not a programmer nor a software developer nor have you created the blog application that you so fondly use every single day – but you’re a tried and true practitioner and you can benefit greatly with a little website construction 101.

Ready?

Blog Pages for Humans and Search Engines:

Becoming a blog architect isn’t as hard as you might think – in fact, simply knowing the basics of how a search engine crawls your site can help you optimally structure it more efficiently, right?

It’s like knowing the rubric, the map so to speak, of how to build that IKEA bookshelf that you bought without thinking:

Instructions make all the difference, right?

By knowing just a bit you can build that shelf faster and more efficiently with the instructions in-hand, right?

Developing an understanding of indexation by search will help you optimally structure your content for search engines – but make sure you’re also doing it (and seriously considering) the structure for people too. In this way you’ll develop a search engine-friendly blog that’ll return you pageviews and hits for as long as your blog lives.

I would like that, wouldn’t you?

Let’s Do Some HTML:

Search engines (all search engines) crawl your content looking for your content in it’s HTML-form, or text format. As we mentioned previously, images, flash, applets, java, iframes, plugins, and more are invisible to search engines and their bots that crawl your content. Sure, the content is still there but it’s not “there” for search engines.

What you need to make sure, then, is that you do all that you can to provide the search engines with content that can be crawled easily in the form of HTML text on each blog page.

There are some advanced ways that you can make those limited elements more open to being crawled:

  1. Images (.gif, .jpg, .png) can be given “alt attributes” via HTML or be replaced by text via CSS styling.
  2. Flash, java, applets, videos, audio, and more can have the content repeated via plain text on the page. In the case of video and audio the use of transcripts is vitally important.

So how do you know what is in HTML format and what is not? You can use a few tools to help you accomplish this very basic test:

1. Google Cache

Click the 'Cached' button.

You can easily see what the search engines see if you google your own blog and click the “Cached” button.

Then click the “Text-only Version” on the right:

And then you’ll see what search engines see:

What do you see when you do a test on your blog? Is everything there? Is there anything that doesn’t look quite right?

This very basic test can very quickly help you not only know what search engines see but also what you can do to optimize your blog architecture in such a way that will help search engines index your content better and more effectively.

2. SEO Browser

SEO Browser can also show you what a search engine sees as well. The “Simple” tool will show you the basics while the “Advanced” tool (after registration) can give you access to a few more results, like so:

More data via Advanced.

The same difference a Google Cache but it does give you another look so it’s worthy of a bookmark.

One neat thing that I discovered just doing this activity is that one of my “more tags” was wrapped in an H3 tag when it shouldn’t have been:

Wrong!

It should look like this:

Correct!

I quickly adjusted and corrected this. Wow, what a simple (and perfect) example of how this tool helped me catch an error.

But these tools can provide much more information (and strategy) than just correcting a more tag, right? Here was the offending code:

Whoops!

So over the next few blog posts I’ll dive into how we can specifically optimize elements of your blog, blog pages, and blog posts so that it’s can be better indexed by search engines.

See you in a bit and make sure you Subscribe to stay up to date with the next few blog posts!

[This is part of the The Blogger’s Essential Guide to Search Engine Optimization Series.]

  • Calvin Koepke

    Nice! I always wondered about that shtuff.

    • http://john.do John Saddington

      sure thing calvin!

  • Dewitt Robinson

    Learning.

    • http://john.do John Saddington

      constantly. ;)

  • http://noahsdad.com Rick Smith

    Man..this makes for great reading at 4 am! Thanks for posting all of this awesomeness!

    • http://john.do John Saddington

      why… are you up… so late……

  • http://www.tillhecomes.org Jeremy Myers

    Nice tips. I have learned to insert my “more” tag from within the HTML tab as the visual tab likes to place it in between h2 and h3 tags.

    • http://john.do John Saddington

      you sir are a champ!

  • Zimbrul

    Woww, this is one piece of knowledge and you can now look inside your site like a surgeon. Very useful indeed.

    • http://john.do John Saddington

      sure thing zimbrul!

      what does your gravatar mean?

  • http://thoughtsaboutnothing.com Kyle Reed

    this is good stuff

    • http://john.do John Saddington

      apparently you were caught by spam.

      • http://thoughtsaboutnothing.com Kyle Reed

        huh?

        • http://thoughtsaboutnothing.com Kyle Reed

          oh never mind I got you know.
          Man I didn’t even ask for a billion dollars or help for my dying grandmother.

          • http://john.do John Saddington

            ???

  • Joshua Chase

    Great post John. I have a question, in the screenshot of where you performed a search on your site, it displays your Site title and description, and below that it pulls links for you r min categories it looks like. Did you do something special with your sitemap.xml for that?

    • http://john.do John Saddington

      no, this happens overtime through google. if your site gets indexed well enough it’ll provide these shortlinks.

  • http://benrwoodard.com Ben

    I think this is some sort of Jedi SEO training.

    I’d love to see a Star Wars 7 with Luke becoming a blogging Jedi with mad SEO skills. Wouldn’t that be awesome?! He could compete against all the other Black Hat SEO spamming dark side.

    Yep, I need some more coffee.

    Good stuff John.

    • http://john.do John Saddington

      thanks ben. … and luke becoming a jedi seoer… is… a funny thought.