views:

456

answers:

6

PEP 8 says:

  • Imports are always put at the top of the file, just after any module comments and docstrings, and before module globals and constants.

On occation, I violate PEP 8. Some times I import stuff inside functions. As a general rule, I do this if there is an import that is only used within a single function.

Any opinions?

EDIT (the reason I feel importing in functions can be a good idea):

Main reason: It can make the code clearer.

  • When looking at the code of a function I might ask myself: "What is function/class xxx?" (xxx being used inside the function). If I have all my imports at the top of the module, I have to go look there to determine what xxx is. This is more of an issue when using from m import xxx. Seeing m.xxx in the function probably tells me more. Depending on what m is: Is it a well-known top-level module/package (import m)? Or is it a sub-module/package (from a.b.c import m)?
  • In some cases having that extra information ("What is xxx?") close to where xxx is used can make the function easier to understand.
+2  A: 

As long as it's import and not from x import *, you should put them at the top. It adds just one name to the global namespace, and you stick to PEP 8. Plus, if you later need it somewhere else, you don't have to move anything around.

It's no big deal, but since there's almost no difference I'd suggest doing what PEP 8 says.

Javier Badia
Actually, putting `from x import *` inside a function will generate a SyntaxWarning, at least in 2.5.
Rick Copeland
+8  A: 

There are two occasions where I violate PEP 8 in this regard:

  • Circular imports: module A imports module B, but something in module B needs module A (though this is often a sign that I need to refactor the modules to eliminate the circular dependency)
  • Inserting a pdb breakpoint: import pdb; pdb.set_trace() This is handy b/c I don't want to put import pdb at the top of every module I might want to debug, and it easy to remember to remove the import when I remove the breakpoint.

Outside of these two cases, it's a good idea to put everything at the top. It makes the dependencies clearer.

Rick Copeland
I agree it makes dependencies clearer with regards to the module as a whole. But I believe that it can make the code less clear at the function level to import everything at the top. When you're looking at a the code of a function you might ask yourself: "What is function/class xxx?" (xxx is used inside the function). And you have to look at the very top of the file to see where xxx comes from. This is more of an issue when using from m import xxx. Seeing m.xxx tells you more - at least if there is no doubt as to what m is.
codeape
+2  A: 

One thing to bear in mind: needless imports can cause performance problems. So if this is a function that will be called frequently, you're better off just putting the import at the top. Of course this is an optimization, so if there's a valid case to be made that importing inside a function is more clear than importing at the top of a file, that trumps performance in most cases.

If you're doing IronPython, I'm told that it's better to import inside functions (since compiling code in IronPython can be slow). Thus, you may be able to get a way with importing inside functions then. But other than that, I'd argue that it's just not worth it to fight convention.

As a general rule, I do this if there is an import that is only used within a single function.

Another point I'd like to make is that this may be a potential maintenence problem. What happens if you add a function that uses a module that was previously used by only one function? Are you going to remember to add the import to the top of the file? Or are you going to scan each and every function for imports?

FWIW, there are cases where it makes sense to import inside a function. For example, if you want to set the language in cx_Oracle, you need to set an NLS_LANG environment variable before it is imported. Thus, you may see code like this:

import os

oracle = None

def InitializeOracle(lang):
    global oracle
    os.environ['NLS_LANG'] = lang
    import cx_Oracle
    oracle = cx_Oracle
Jason Baker
I agree with your maintenence issue. Refactoring code can be a bit problematic. If I add a second function that uses a module previously used only by one function - I either move the import to the top, or I break my own general rule by importing the module in the second function as well.
codeape
+2  A: 

I've broken this rule before for modules that are self-testing. That is, they are normally just used for support, but I define a main for them so that if you run them by themselves you can test their functionality. In that case I sometimes import getopt and cmd just in main, because I want it to be clear to someone reading the code that these modules have nothing to do with the normal operation of the module and are only being included for testing.

Daniel Lew
+6  A: 

Here are the four import use cases that we use

  1. import (and from x import y and import x as y) at the top

  2. Choices for Import. At the top.

    import settings
    if setting.something:
        import this as foo
    else:
        import that as foo
    
  3. Conditional Import. Used with JSON, XML libraries and the like. At the top.

    try:
        import this as foo
    except ImportError:
        import that as foo
    
  4. Dynamic Import. So far, we only have one example of this.

    import settings
    module_stuff = {}
    module= __import__( settings.some_module, module_stuff )
    x = module_stuff['x']
    

    Note that this dynamic import doesn't bring in code, but brings in complex data structures written in Python. It's kind of like a pickled piece of data except we pickled it by hand.

    This is also, more-or-less, at the top of a module


Here's what we do to make the code clearer:

  • Keep the modules short.

  • If I have all my imports at the top of the module, I have to go look there to determine what a name is. If the module is short, that's easy to do.

  • In some cases having that extra information close to where a name is used can make the function easier to understand. If the module is short, that's easy to do.

S.Lott
Keeping modules short is of course a very good idea. But to get the benefit of always having "import information" for functions available, the maximum module length would have to be one screen (probably 100 lines max). And that would probably be too short to be practical in most cases.
codeape
I suppose you could take this to a logical extreme. I think there might be a balance point where your module is "small enough" that you don't need fancy import techniques to manage complexity. Our average module size is -- coincidentally -- about 100 lines.
S.Lott
+4  A: 

In the long run I think you'll appreciate having most of your imports at the top of the file, that way you can tell at a glance how complicated your module is by what it needs to import.

If I'm adding new code to an existing file I'll usually do the import where it's needed and then if the code stays I'll makes things more permanent by moving the import line up to the top of the file.

One other point, I prefer to get an ImportError exception before any code is run -- as a sanity check, so that's another reason to import at the top.

I use pyChecker to check for unused modules.

Peter Ericson