ansaurus

Question

static pages in Django sitemap framework

Answer 1

A:

How a sitemap is used is dictated by the search engine. Some will only index what you have in the sitemap, while others will use it as a starting point and crawl the entire site based on cross-linking.

As for including non-generated pages, we just created a subclass of django.contrib.sitemaps.Sitemap and have it read a plain-text file with one URL per line. Something like:

class StaticSitemap(Sitemap):
    priority = 0.8
    lastmod = datetime.datetime.now()

    def __init__(self, filename):
        self._urls = []
        try:
            f = open(filename, 'rb')
        except:
            return

        tmp = []
        for x in f:
            x = re.sub(r"\s*#.*$", '', x) # strip comments
            if re.match('^\s*$', x):
                continue # ignore blank lines
            x = string.strip(x) # clean leading/trailing whitespace
            x = re.sub(' ', '%20', x) # convert spaces
            if not x.startswith('/'):
                x = '/' + x
            tmp.append(x)
        f.close()
        self._urls = tmp
    # __init__

    def items(self):
        return self._urls

    def location(self, obj):
        return obj

You can invoke it with something like this in your main sitemap routine:

sitemap['static'] = StaticSitemap(settings.DIR_ROOT +'/sitemap.txt')

And our sitemap.txt file looks something like this:

# One URL per line.
# All paths start from root - i.e., with a leading /
# Blank lines are OK.

/tour/
/podcast_archive/
/related_sites/
/survey/
/youtube_videos/

/teachers/
/workshops/
/workshop_listing_info/

/aboutus/
/history/
/investment/
/business/
/contact/
/privacy_policy/
/graphic_specs/
/help_desk/

Peter Rowell 2010-07-22 20:29:43

I'm really sorry for taking too long to respond, just totally forgot about it. I don't really like this solution, but it's acceptable. I myself used urlresolver but it's quite messy as well. So I'm still in doubt.

Vladimir Shulyak 2010-09-05 19:18:50

Not to worry. I wasn'

Peter Rowell 2010-09-05 23:04:14

I wasn't in love with it either, but when we did it (Summer of 2007) it seemed like a quick way to get it working.

Peter Rowell 2010-09-05 23:05:15

ansaurus

tags:

views:

answers:

static pages in Django sitemap framework

related questions