ansaurus

Question

Unix directory inodes - fragmentation, and dumping directory contents

Answer 1

+2 A:

For question 1, external fragmentation normally causes an overhead of about 2x or so,¹ plus you have internal fragmentation from allocation granularity. Neither of these comes close to explaining your observation.

So, I don't think it is normal steady-state fragmentation.

The most obvious speculation is that 1.5MB is the high-water mark; at one time it really did have either 1.5MB bytes of entries or 1.5MB/2 bytes of entries with expected fragmentation.

Another speculation is that the 50% rule is being defeated by a non-Markovian allocation. Imagine that I name files with "tmp%d", so, tmp1, tmp2, ... tmp1000, tmp1001, ...

The problem here is that rm tmp1 doesn't make room for tmp1001. This is obviously a wild guess.

Q2: There isn't a good way to read the raw directory. AFAIK, you would need to either hack the kernel or use debugfs to change the inode type, read it, then change it back, or use debugfs to read the inode, get the block numbers, then read the blocks. A functional debugging approach is probably more reasonable.

You can address the performance issue by making sure that indexing is enabled. See tune2fs.

¹Knuth's fifty percent rule: in the steady state, 50% of ops are allocations, 50% are frees, 50% of free blocks merge, then holes are 50% of allocations, and 50% of the space is wasted. (Aka, 100% overhead.) This is considered "normal". Malloc has the same problem.

DigitalRoss 2009-10-27 01:43:20

Good answer, thanks. We had a large temporary increase of # files at one point, so your "obvious speculation" has some merit (and unfortunately it was not obvious to me until you pointed it out. :))

Jonathan Swartz 2009-10-28 12:16:09

ansaurus

tags:

views:

answers:

Unix directory inodes - fragmentation, and dumping directory contents

related questions