ansaurus

Question

K&R interface for reading directories: superfluous DIR structure?

Answer 1

+3 A:

From K&R:

Regrettably, the format and precise contents of a directory are not the same on all versions of the system. So we will divide the task into two pieces to try to isolate the non-portable parts. The outer level defines a structure called a Dirent and three routines opendir, readdir, and closedir to provide system-independent access to the name and inode number in a directory entry.

So the reason is portability. They want to define an interface that can survive on systems that have different stat structs or nonstandard open() and close(). They go on to build a bunch of reusable tools around it, which don't even care if they're on a Unix-like system. That's the point of wrappers.

Maybe it's not used because they started out by defining their data structures (with a Dirent inside DIR) but ended up not using it. Keeping data structures grouped like that is good design.

Nathon 2010-09-17 14:02:46

Portability was also my first guess but after thinking it over I don't see how `DIR` could contribute to this. The only relevant information it can pass to `readdir()` is the file descriptor.I still don't see the use of the `Dirent` component in `DIR`. Regardless of the system, any implementation of `readdir()` can have a static `Dirent` to which it can return a pointer, so this should not be a portability issue.It is true that `dirwalk()` accesses the contents of a `Dirent`, but this is the static one from `readdir()`, not the one contained in `DIR`.Am I missing something?

qfab 2010-09-17 14:43:50

Hey, it looks like you're right. My guess is that they started out by defining their data structures (with a `Dirent` inside `DIR`) but ended up not using it. Grouping related data together in structs is good juju. A good exercise would be to rewrite the code to make use of `DIR.d` instead of having `readdir()`'s callers have their own `Dirent` pointers.

Nathon 2010-09-17 15:08:10

Yes, this is a plausible explanation. But considering that the book was published over 20 years ago (2nd edition), it is strange that something like this is not mentioned in the [errata](http://cm.bell-labs.com/cm/cs/cbook/2ediffs.html).

qfab 2010-09-17 16:18:04

Strange, yes. But not unheard of. The list of errata you linked was last updated in October of 2006, which means they found an error 18 years after it was published. Plus, it isn't actually broken code; just some wasted memory.

Nathon 2010-09-17 17:16:43

That is true, the code is not broken, but it is quite confusing for a beginner like me.

qfab 2010-09-17 21:44:08

@Nathon: Could you edit your answer so that it contains the assumption you made in the comments? I would then set it as my accepted answer.

qfab 2010-09-17 21:48:55

Answer 2

A:

It is so they don't have to allocate memory for the Dirent structure that is returned by readdir. This way they can reuse the Dirent between subsiquent calls to readdir.

Burton Samograd 2010-09-17 15:47:59

But they don't have to allocate memory anyway because `readdir()` stores the `Dirent` as a static variable.

qfab 2010-09-17 16:33:02

@qfab: Yes but that's a really bad design. A hypothetical improved implementation would put the buffer inside the `DIR` structure so that simultaneous reading of multiple directories would not clobber the data (and so it would be thread-safe as long as you don't use a single `DIR` object from more than one thread at a time). I expect modern implementations do this; mine certainly does.

R.. 2010-09-17 16:59:37

ansaurus

tags:

views:

answers:

K&R interface for reading directories: superfluous DIR structure?

related questions