views:

98

answers:

2

Hi, I have built a Twitter client that's only one page: index.html. When being used, it draws in content from JSON feeds to populate various columns. The issue is hardly any traffic is coming in from Google except for searches on the site's domain name. This is because Google only crawls index.html when it's blank and before the user enters any Twitter username to be looked up.

How can I have the Google spider, when attempting to crawl index.html crawl instead of just index.html (or in addition) also say index.html#CNN (ie site.com/#CNN) or index.html#whitehouse whereby all content from these pages will also be crawled and thus be accessible to users searching on the terms therein.

+2  A: 

I don't think Google will ever include an fragment (#) in its index. You should provide an alternate URL, like site.com/CNN

Greg
Problem here is that I don't want such a page being named as such, as it would not reflect the page's content as soon as the user moves away from CNN. But, it may be the only choice.
Adrian33
A: 
  • Use a site map to illustrate the options for Google.
  • Embed hidden links so that Google knows to look there?
  • Use a separate page to list some "examples". Could be a hybrid for your users plus Google...
Frank V
the site map might work, but there is already one link on the home page to site.com/#testuser (some "examples" exist, but only in a div that draws content from a text file when loaded in a browser (not crawler?)) Content from site.com/#testuser is not crawled.
Adrian33