The CloneDR is a tool for finding exact and near-miss blocks of code constructed by copy and paste activities.
It can handle systems of millions of lines of code.
It uses precise langauge grammars to pick out language structures (identifiers, expressions, statements, blocks, functions, classes, packages, ...) that have been copied, and to determine the points of variation across the sets of clones (any of those structures can be parameters!)
CloneDR operates on a wide variety of languages (C, C++, C#, Java, PHP, COBOL, Python, Ada, Fortran, ...).
The website has a number of sample clone detection reports from a variety of those languages.
EDIT OCT 2010: EGL, and Visual Basic (VBScript, VB6, VB.net) added.