ansaurus

Question

How can I make sure all my Python code "compiles"?

Answer 1

A:

I think what you are looking for is code test line coverage. You want to add tests to your script that will make sure all of your lines of code, or as many as you have time to, get tested. Testing is a great deal of work, but if you want the kind of assurance you are asking for, there is no free lunch, sorry :( .

Adam Luter 2009-06-22 12:39:25

He is not looking for code to pass tests. He already said, "in 6 months when the otherwise nice code finally gets run, it might simply crack due to some typo." Tests check whether the code does "the right thing" for some finite input set, not whether it uses valid syntax throughout (what the OP wants)

Matthew Flaschen 2009-06-22 12:41:44

It won't pass many tests if it has typos. If your coverage touches every line of code (not every logic path), you'll be reasonably sure that it will work reliably.

S.Lott 2009-06-22 13:01:00

-1. I'm sorry Adam, the question suggests such QA efforts as rather unrealistic, hence the answer is of little help.

sharkin 2009-06-22 13:31:44

"every line of code" doesn't buy you nearly as much as people think. Trivial example posted in my answer.

Matthew Flaschen 2009-06-22 13:34:18

While "every line of code" is not isomorphic to "perfect", it covers as many bases as a C++ compiler covers. C++ code can compile and be full of holes that don't surface until the program is abused in production. A simple set of unit tests will give you tremendous confidence at very, very low cost. Python is so easy to write that the incremental cost of a few unit tests is still (often) cheaper to develop than C++.

S.Lott 2009-06-22 13:41:17

It does not cover as many bases. C++ programs have their own issues, but any C++ compiler will catch these kinds of errors (undeclared variable). Unit tests are very valuable, but they're not enough (for any language).

Matthew Flaschen 2009-06-22 13:49:04

Compiler is very valuable, but it's not enough (for any language).

S.Lott 2009-06-22 14:30:59

I respectfully disagree, Matthew. I meant line coverage tests, which do not focus on testing functionality, but rather try to trigger every line of code. In your example for your answer, your typo is on it's own logical line, just not on it's own physical line. Line test coverage *would* find this. I think the tools you pointed out will help find mistakes too, though. The point remains that there is still no free lunch.R.A., I don't suggest you don't use these tools, nor do I suggest you do line-coverage tests. Rather I just state you are at an impasse given resources and requirements.

Adam Luter 2009-06-22 16:07:18

There is no agreed upon definition of logical line. I fail to see why you think he shouldn't use these tools. I think he does have the necessary resources.

Matthew Flaschen 2009-06-22 17:16:05

S. Lott, obviously the compiler isn't enough either.

Matthew Flaschen 2009-06-22 17:17:09

I specifically said he should (albiet, with a double negative). Anyway, please don't mince words, line-coverage tests would work, if you'd like: remove the word 'line'.

Adam Luter 2009-06-23 12:55:17

Answer 2

+19 A:

Look at PyChecker and PyLint.

Here's example output from pylint, resulting from the trivial program:

print a

As you can see, it detects the undefined variable, which py_compile won't (deliberately).

in foo.py:

************* Module foo
C:  1: Black listed name "foo"
C:  1: Missing docstring
E:  1: Undefined variable 'a'


...

|error      |1      |1        |=          |

Trivial example of why tests aren't good enough, even if they cover "every line":

bar = "Foo"
foo = "Bar"
def baz(X):
    return bar if X else fo0

print baz(input("True or False: "))

EDIT: PyChecker handles the ternary for me:

Processing ternary...
True or False: True
Foo

Warnings...

ternary.py:6: No global (fo0) found
ternary.py:8: Using input() is a security problem, consider using raw_input()

Matthew Flaschen 2009-06-22 12:39:34

Good recommendation of pychecker and pylint. pyflakes is also good because it's very fast, and the svn trunk version will catch unused local variables. As for testing "every line," I think you should at least test every "path." That would have caught your blow-up example.

Ryan Ginstrom 2009-06-22 13:54:59

Works great Matthew, thanks!

sharkin 2009-06-22 13:58:16

It is impossible to identify (and then test) every possible logic path, because that is equivalent to the halting problem (http://en.wikipedia.org/wiki/Code_coverage).

Matthew Flaschen 2009-06-22 14:03:02

You can certainly ensure that almost every piece of code is hit at least once. You can't test every logic path for even a slightly complex program, which is why I put "path" in quotes. Making sure every piece of code was hit in your example would have caught the error.

Ryan Ginstrom 2009-06-22 15:17:20

"You can certainly ensure that almost every piece of code is hit at least once." That really doesn't seem to mean anything in particular. Almost? Piece of code?

Matthew Flaschen 2009-06-22 15:40:05

+1 because these tools definitely help. But line-coverage would find the typo above if you test logical-lines, not physical-lines. Also, I am confused as to why PyChecker is complaining about 'a' but not 'fo0'?

Adam Luter 2009-06-22 16:09:45

Adam, it detects fo0 for me. And I don't think there's a solid definition for logical lines.

Matthew Flaschen 2009-06-22 16:45:18

Answer 3

A:

Your code actually gets compiled when you run it, the Python runtime will complain if there is a syntax error in the code. Compared to statically compiled languages like C/C++ or Java, it does not check whether variable names and types are correct – for that you need to actually run the code (e.g. with automated tests).

Alex Morega 2009-06-22 12:49:04

-1. It seems there actually are tools around to discover errors like that, hence not needing to actually run the code to discover them.

sharkin 2009-06-22 13:36:18

Answer 4

+1 A:

If you are using Eclipse with Pydev as an IDE, it can flag many typos for you with red squigglies immediately, and has Pylint integration too. For example:

foo = 5
print food

will be flagged as "Undefined variable: food". Of course this is not always accurate (perhaps food was defined earlier using setattr or other exotic techniques), but it works well most of the time.

In general, you can only statically analyze your code to the extent that your code is actually static; the more dynamic your code is, the more you really do need automated testing.

Kiv 2009-06-22 13:09:00

Answer 5

+1 A:

Others have mentioned tools like PyLint which are pretty good, but the long and the short of it is that it's simply not possible to do 100%. In fact, you might not even want to do it. Part of the benefit to Python's dynamicity is that you can do crazy things like insert names into the local scope through a dictionary access.

What it comes down to is that if you want a way to catch type errors at compile time, you shouldn't use Python. A language choice always involves a set of trade-offs. If you choose Python over C, just be aware that you're trading a strong type system for faster development, better string manipulation, etc.

Imagist 2009-06-30 03:26:18

ansaurus

tags:

views:

answers:

How can I make sure all my Python code "compiles"?

related questions