ansaurus

Question

How can I avoid reloading an XML document multiple times?

Answer 1

+1 A:

I believe the closest thing you are going to get is Memcached.

Although, I wouldn't worry about it, especially if it is a local file. include like operations are fairly cheap.

Chacha102 2010-01-29 05:21:16

Well it is simplexml_load_file that is doing the loading, not quite like an include and it seems that loading many thousands of records would be less than cheap. I'll see what memcached has to offer.

rg88 2010-01-29 06:10:06

Answer 2

A:

Could you load it into $_SESSION data? or would that blow out memory due to the size of the chunk?

niggles 2010-01-29 05:23:39

Answer 3

+2 A:

Just have your server do the reading and parsing of the paginated file based on the user input and feedback. Meaning it can be cached on the server much quicker than it would take the client to download and cache the entire XML document. Use PHP, Perl, ASP or what have you to paginate the data prior to displaying it to the user.

drlouie - louierd 2010-01-29 05:27:11

Answer 4

+1 A:

To the question "hitting the XML file every time is a huge waste" then answer is yes, if you have to parse that big XML file everytime. As I understand, you want to save the chunk the user is interested in so that you don't have to do that everytime. How about a very simple file cache? No extension required, fast, simple to use and maintain. Something like that:

function echo_results($start)
{
    // IMPORTANT: make sure that $start is a valid number
    $cache_file = '/path/to/cache/' . $start . '.xml';
    $source     = '/path/to/source.xml';
    $mtime      = filemtime($cache_file);

    if (file_exists($cache_file)
     && filemtime($cache_file) < $mtime)
    {
        readfile($cache_file);
        return;
    }

    $xml = get_the_results_chunk($start);
    file_put_contents($cache_file, $xml);

    echo $xml;
}

As an added bonus, you use the source file's last modification time so that you automatically ignore cached chunks that are older than their source.

You can even save it compressed and serve it as-is if the client supports gzip compression (IOW, 99% of browsers out there) or decompress it on-the-fly otherwise.

Josh Davis 2010-01-29 05:33:54

But would $xml = get_the_results_chunk($start);file_put_contents($cache_file, $xml);Be much of an improvement over just my original:simplexml_get_file();I'm not sure it would be.

rg88 2010-01-29 06:29:16

I don't know what your simplexml_get_file() function does. By caching the chunk, you save 99% of the processing the next time it is requested. If you prefer, you can also precache the whole 40 chunks at once, everytime the source document changes.

Josh Davis 2010-01-29 11:01:37

ansaurus

tags:

views:

answers:

How can I avoid reloading an XML document multiple times?

related questions