What are your favorite programming-related academic papers?

views:

4733

answers:

+68 Q:

What are your favorite programming-related academic papers?

What are your favorite programming-related / cs academic published papers?

It could be a functional pearl, a programming-language paper like those often cited on lambda-the-ultimate.org, etc. Really, anything vaguely related to programming. Please also explain why you like the paper.

+13 A:

I have in mind a somewhat forgotten paper: Structured Programming with go to Statements, Donald Knuth, ACM Computing Surveys, Vol 6, No. 4, Dec. 1974.

It's actually a rebuttal to the "GOTO considered harmful" creed (saying that "GOTO is sometimes okay, really"), a view that is now out of fashion (to say the least!), but it also says some important things about the practice of programming. Most famously, it is the source of the quote "Premature optimization is the root of all evil". :-)
[It's also instructive to read the quote in its original context, to see what Knuth did not mean.]

Quite related is Knuth's Computer Programming as an Art, Knuth’s Turing Award lecture (1974), printed in Communications of the ACM, Volume 17, Issue 12, Dec. 1974, which is available e.g. here.

ShreevatsaR 2008-12-10 23:56:58

I just found out that the excellent Mark Jason Dominus (mjd, http://blog.plover.com/) also considers this paper (Structured programming with go to Statements) "my single all-time favorite computer science paper": http://blog.plover.com/prog/Hoare-logic.html#fn3

ShreevatsaR 2009-08-25 21:46:30

+9 A:

Oh there are so many fantastic papers. Purely Functional Data Structures is great. I'm still reading it, but so far it has been an excellent read. The implementations of the data structures are short and beautiful.

Jules 2008-12-11 00:04:03

Chris turned his thesis into a book. It's a great book.

Norman Ramsey 2008-12-11 05:07:04

+3 A:

By far: Functional Reactive Animation by Conal Elliott and Paul Hudak. It was the paper that got me interested in functional reactive programming, but more importantly it is a fantastic exposition of the design methodology called Semantic Design, which has had a profound impact on the way I think about software.

Pickler Combinators is another one of my favorites, for its minimal elegance in solving a practical problem.

luqui 2008-12-11 00:32:01

To be clear, functional reactive animation is since outdated with much research in the FRP community, but it is still my favorite paper on the subject.

luqui 2008-12-11 00:33:47

The FRP paper was voted the most influential paper from ICFP 1997, I believe.

Norman Ramsey 2008-12-11 05:17:05

+8 A:

G'day,

While not purely academic Edsger Dijkstra's paper "The Humble Progammer", his Turing Lecture from 1972, is an excellent paper to read. My favourite quote is

The competent programmer is fully aware of the strictly limited size of his own skull; therefore he approaches the programming task in full humility, and among other things he avoids clever tricks like the plague.

HTH

cheers,

Rob

Rob Wells 2008-12-11 00:43:14

+1 Dijkstra's archive @ utexas is worth combing through slowly

just somebody 2009-12-13 18:06:15

+1 A:

It might not seem programming related, but it really is something that programmers should be aware of: http://www.apa.org/journals/features/psp7761121.pdf ("Unskilled and Unaware of It: How Difficulties in Recognizing One's Own Incompetence Lead to Inflated Self-Assessments")

Oftentimes, we only see our own knowledge, and the corresponding ignorance of someone else in our own field. We mock the "PhD Professor" who can't remember how to get their email. What we fail to see is our own ignorance in their and other fields. Once we approach other people, our customers, knowing that they have knowledge that we don't, and we have knowledge that they don't, and that our goal is to help them do their jobs better, we can communicate without arrogance and condescension.

thursdaysgeek 2008-12-11 01:46:32

Interesting read.

Paul Nathan 2008-12-11 04:59:37

A very good paper to put (anonymously) in someones inbox... Pure evil.

2009-02-18 10:02:43

Sure, but they probably won't understand why it was in their box!

thursdaysgeek 2009-02-20 03:53:45

+8 A:

Big Ball of Mud

Abstract:

While much attention has been focused on high-level software architectural patterns, what is, in effect, the de-facto standard software architecture is seldom discussed. This paper examines this most frequently deployed of software architectures: the BIG BALL OF MUD. A BIG BALL OF MUD is a casually, even haphazardly, structured system. Its organization, if one can call it that, is dictated more by expediency than design. Yet, its enduring popularity cannot merely be indicative of a general disregard for architecture.

These patterns explore the forces that encourage the emergence of a BIG BALL OF MUD, and the undeniable effectiveness of this approach to software architecture. What are the people who build them doing right? If more high-minded architectural approaches are to compete, we must understand what the forces that lead to a BIG BALL OF MUD are, and examine alternative ways to resolve them.

A number of additional patterns emerge out of the BIG BALL OF MUD. We discuss them in turn. Two principal questions underlie these patterns: Why are so many existing systems architecturally undistinguished, and what can we do to improve them?

Why do I like it? It's an irreverent but useful look at the software development world as it actually exists.

Darron 2008-12-11 01:49:56

+12 A:

Here are the papers that come to mind:

Automata via Macros. A functional pearl in Scheme that illustrates the power of macros in conjunction with tail-call optimizations.
Beautiful concurrency. An accessible presentation of Software Transactional Memory (STM), showcasing the power of the type system in Haskell.
Stochastic Lambda Calculus and Monads of Probability Distributions. This paper shows that probability distrbutions form a monad, and it can be nicely implemented in languages like F#, Scala, and of course Haskell.
From Variadic Functions to Variadic Relations: A miniKanren Perspective. A paper that can be seen as a condensed advanced version of the Reasoned Schemer. The appendix has a full implementation of a miniKanren, a Prolog-like language.
A tutorial on the universality and expressiveness of fold. If you're not totally fluent with fold, you'll be after reading this paper.
Monadic parsing in Haskell. A functional pearl which introduces monads for parsing.
Functional Programming with Bananas, Lenses, Envelopes, and Barbed Wire. Presentation of all sorts of common constructs of functional programming.

namin 2008-12-11 03:22:00

+1 A:

The page rank citation ranking system

It describes how Google works.

Google uses it for web documents, but it's useful for pretty much any directed graph structure. For non directed graph's it's subject to manipulation, but if you "trust" the nodes in the graph, then it will work for non directed graphs too.

One interesting application of it to programing would be to index libraries in source code as a ranking system in a code / api documentation search.

It would be really cool, for example, to build an index of something like source forge (or code plex), using "page rank" between types and methods as a relevance metric.

The easiest language to do it for would be Java (because Java namespaces use reverse domain names by convention).

I wrote a prototype of something like this when I worked at Microsoft. It indexed VB source code, api documentation, and compiled .net assemblies. It also did stemming of identifier names, recogonzing camel casing, pascal casing, and underscores, and split things up into multiple groups.

It got pretty good results.

I was pretty new though (I had only been there a couple weeks), I worked on a different team (the VB team), so I wasn't very successful in convincing the MSDN help folks to adopt it. They ended up just using Windows Live Search.

In any case....

Another use for it would be for Stack Overflow.

You could, for example, treat stack overflow as a large graph, with edges from people to documents, and from documents to people. If you then weighted the edges based on reputation, you could compute page rank over the graph, and then use the page rank to sort search results.

This would produce different results than what Google does (because the edges in normal page rank are not weighted), but I think for the purposes of searching Stack Overflow questions it would yield better results than Google.

I'm willing to bet that most links into stack overflow from the "outside" are to the main page, not to individual questions.

This means from Google's point of view, most stack overflow questions and answers are pretty much equivalent.

But, if you calculated page rank, using "reputation" to weight edges, you would get relevance results that reflected the values of the Stack Overflow community.

In any case... they paper is really good. You should read it.

Scott Wisniewski 2008-12-11 03:58:05

Most google searches don't go to the main page. I'm consistantly finding SO questions popping up in search results

Elizabeth Buckwalter 2010-05-21 21:12:36

I didn't say most Google Searches go to the main page.I was saying that the Google Results may not be as good as they could be, because Google doesn't take reputation into account.They results may very well be "good enough".I was just expressing a way that I thought they could be "better".

Scott Wisniewski 2010-05-21 22:06:58

+7 A:

Gotta throw in Leslie Lamport's Time, Clocks, and the Ordering of Events in a Distributed System. Not so much for the clock sync part, but for the distributed event ordering. First paper I know of to talk about what causality really means in a distributed system. None of today's big webapps would really work without that :-).

tgamblin 2008-12-11 04:01:52

+3 A:

of course, the answer would be any computer generated scientific paper!

but joking aside, I'm (like many other probably) really intrigued by the whole P != NP problem.

The first time I heard about (this was in my first bachelor year at VUB) it had a huge influence on me. I still can remember how the problem got my attention for several weeks... I kept reading and looking up information about, even foolishly tried to come up with some clever algorithms on my own. Good times it were back then!

here is an example of such a paper, but there are many, many others just like it

Sven 2008-12-11 04:02:33

+2 A:

I really enjoyed A History of Erlang by Joe Armstrong, Erlang's creator. It's a really interesting inside look at the creation of a new (and exciting) programming language.

mipadi 2008-12-11 04:28:23

+18 A:

For this audience I think I have to go with John Hughes's seminal paper Why Functional Programming Matters because it is densely packed with great ideas and because the topic of the paper is programming. I like the paper because any programmer can pick it up and read it and most will come away excited by the ideas. Also, there are lots of examples in the paper, so you can immediately pick up some kind of Haskell implementation and start trying the ideas for yourself right away.

For the academics in the crowd, I'll also give a shout-out to Tony Hoare's great paper Proof of Correctness of Data Representations. It's intensely mathematical, but it is the canonical reference for a critically important and poorly understood technique: abstract data types. If we all understood about abstraction functions and representation invariants, the world would be a far better place. Heck, I'd just settle for getting people to document their representation invariants and leaving me to figure out the abstraction functions for myself. (Some of Tony's best papers have been collected into a book called Essays in Computing Science. Tony is one of my heroes and this collection has some fabulous chapters including Tony's wonderful Turing lecture on 'The Emperor's New Clothes; a talk he gave to a lay audience on what programming is; a proof of a simple program, finding the kth largest element in an array; and many other nice things. Even if you're not mathy you can pick up this book and skip the mathy chapters and still enjoy it a lot.)

Norman Ramsey 2008-12-11 05:15:15

The John Hughes paper link is dead. Here's a new one: [Why Functional Programming Matters (PDF)](http://www.netsis.com.tr/goksel/IleriProgramlama/Functional.pdf "Why Functional Programming Matters by John Hughes")

Ergwun 2010-09-23 03:51:30

+7 A:

Don't mean to get a little too classic on you, but I've gotta say Turing's On Computable Numbers, with an Application to the Entscheidungsproblem has got to be, at least I would consider, one of the most significant (even if only historically) papers in CS. And Charles Petzold's The Annotated Turing does an excellent job of illustrating the significance, as well as making the mathematics understandable, even to a mildly-mathematical programmer such as myself. In fact, I'm pretty sure Jeff mentioned it in one of the Stackoverflow podcasts.

JayRan 2008-12-11 07:58:16

+3 A:

Trace Trees [PDF] by Andreas Gal, Michael Bebenita, Mason Chang, Michael Franz

Used by Mozilla Tracemonkey javascript interpreter, and contributed to the "javascript interpreter wars."

ansaurus

tags:

views:

answers:

What are your favorite programming-related academic papers?

related questions