ansaurus

Question

Answer 1

+1 A:

Are you actually passing an open file to the function? Maybe try printing type(file) and type(line), as there's something fishy here -- with an open file as the argument, I just can't reproduce your problem! (There are other bugs in your code but none that would cause that exception). Oh btw, as best practice, DON'T use names of builtins, such as file, for your own purposes -- that causes incredible amounts of confusion!

Alex Martelli 2009-05-26 04:17:26

Alright, I added full source to my post and changed the parameter name from 'file' to 'inFile' and after trying the type() command,type(inFile) = <class 'io.BufferedReader'>type(line) = <class 'bytes'>Am I passing the file incorrectly to the function?

Justen 2009-05-26 04:40:15

Answer 2

+1 A:

Use the built-in function open() instead of codecs.open().

You're running afoul of the difference between non-Unicode (Python 3 bytes, Python 2 str) and Unicode (Python 3 str, Python 2 unicode) string types. Python 3 won't convert automatically between non-Unicode and Unicode like Python 2 will. Using codecs.open() without an encoding parameter returns an object which yields bytes when you read from it.

Also, your countLOC function won't work:

for letter in range(comment):
    if not letter.whitespace:
        LOC += 1
        break

That for loop will iterate over the numbers from zero to one less than the position of '#' in the string (letter = 0, 1, 2...); whitespace isn't a method of integers, and even if it were, you're not calling it.

Also, you're never incrementing LOC if the line doesn't contain #.

A "fixed" but otherwise faithful (and inefficient) version of your countLOC:

def countLOC(inFile):
    LOC = 0  
    for line in inFile:
        if line.isspace():
            continue
        comment = line.find('#')
        if comment > 0:
            for letter in line[:comment]:
                if not letter.isspace():
                    LOC += 1
                    break
        else:
            LOC += 1
    return LOC

How I might write the function:

def count_LOC(in_file):
    loc = 0  
    for line in in_file:
        line = line.lstrip()
        if len(line) > 0 and not line.startswith('#'):
            loc += 1
    return loc

Miles 2009-05-26 05:16:39

I figured out my error with the letter.whitespace, forgot to make letter the index of the string. And I know I didn't add to the LOC counter if a '#' wasn't found, I just didn't get that far because of the previous error. Thanks for the code though, I'm having a hard time writing "pythonically" coming from c++. Question on lstrip() -- why use that instead of just strip()?

Justen 2009-05-26 05:25:15

lstrip and strip should give the same results; lstrip should do (slightly) less work so I went with that one.

Miles 2009-05-26 05:44:36

ah okay. Well thanks again for the code, wasn't aware of the str.startswith method, should come in handy. Another question concerning python 3.0, how come line.lstrip is a valid command, but I have to type line = line.lstrip for it to actually work as intended?

Justen 2009-05-26 06:07:05

Strings are immutable (the value of a particular string object can't be changed). String methods don't modify the string—they return a new one with the changes applied.

Miles 2009-05-26 17:36:15

ansaurus

tags:

views:

answers:

Having trouble with str.find()

related questions