External memory merge sort | ansaurus

tags:

views:

27

answers:

1

Q:

External memory merge sort

Can anyone point me to a good reference on External Memory Mergesort? I've read the wiki page but am having trouble understanding it exactly. An animation might help but I can't seem to find one.

Basically, I know that you have a certain number of blocks on disk, and you can fit a certain number of blocks in memory. Lets say you have 32 blocks on disk and 4 blocks in memory. In the first pass you read 4 blocks into memory at a time, sort them in memory, and write them back out do disk. So at this point you have 8 sorted runs of 4 blocks. How does the merging work? Since I have 4 blocks in memory (assume I have one more for output) I think I should be able to merge 4 of those 8 runs at a time, and then merge the next 4 runs. And then in the last pass I want to merge the whole thing. But don't you have to read each block from disk each time? So how does this not become a n^2 solution?

A:

I think I get it now. When you merge you still have to read all of the blocks on disk (let's call that B) but you don't have to read them B times. You only read them B log B times.

JPC 2010-10-12 01:23:03

related questions

CLR Profiler - Attaching to existing process

What's the maximum amount of RAM I can use in a Windows box?

.Net 2.0 - How efficient are Generic Lists?

Process Memory Size - Different Counters

C++ Memory management

Using MySQLi - which is better for closing queries

How Does A Stack Overflow Occur and How Do You Prevent It?

C Memory Management

How does .net managed memory handle value types inside objects?

FLVPlayback component memory issues

memset() causing data abort

Is Visual C++ memory managed by the Dot Net framework

Preventing Memory Leaks with Attached Behaviours

How to dispose a class in .net?

best way to persist data in .NET Web Service

Reading Other Process' Memory in Mac OS / BSD

Replicating load related crashes in non-production environments

Secure Memory Allocator in C++

Best way to wrap rsync progress in a gui?

Some kind of task manager for JavaScript in Firefox 3?

Of Memory Management, Heap Corruption, and C++

Understanding reference counting with Cocoa / Objective C

Setting Objects to Null/Nothing after use in .NET

Heap corruption under Win32; how to locate?

Anatomy of a "Memory Leak"