ansaurus

Question

How do I determine an open file's size in Python?

Answer 1

+4 A:

Perhaps not what you want, but I'll suggest it anyway.

import os
a = os.path.getsize("C:/TestFolder/Input/1.avi")

Alternatively for an opened file you can use the fstat function, which can be used on an opened file. It takes an integer file handle, not a file object, so you have to use the fileno method on the file object:

a = open("C:/TestFolder/Input/1.avi")
b = os.fstat(a.fileno()).st_size

Dominic Bou-Samra 2009-12-08 14:38:24

Answer 2

+2 A:

Most reliable would be create a wrapping class which would check file's size when you open it, track write and seek operations, count current size based on those operations and prevent from exceeding size limit.

Bartosz 2009-12-08 14:41:27

Answer 3

+1 A:

Or, if the file is already open:

>>> fsock = open('/etc/hosts', 'rb').read()
>>> len(fsock)
444

That's how many bytes the file is.

jathanism 2009-12-08 14:42:18

Answer 4

+1 A:

os.fstat(file_obj.fileno()).st_size should do the trick. I think that it will return the bytes written. You can always do a flush before hand if you are concerned about buffering.

D.Shawley 2009-12-08 14:44:52

Answer 5

+1 A:

I'm not familiar with python, but doesn't the stream object (or whatever you get when opening a file) have a property that contains the current position of the stream?

Similar to what you get with the ftell() C function, or Stream.Position in .NET.

Obviously, this only works if you are positioned at the end of the stream, which you are if you are currently writing to it.

The benefit of this approach is that you don't have to close the file or worry about unflushed data.

Isak Savo 2009-12-08 14:47:45

Answer 6

+3 A:

You could start with something like this:

class TrackedFile(file):
    def __init__(self, filename, mode):
        self.size = 0
        super(TrackedFile, self).__init__(filename, mode)
    def write(self, s):
        self.size += len(s)
        super(TrackedFile, self).write(s)

Then you could use it like this:

>>> f = TrackedFile('palindrome.txt', 'w')
>>> f.size
0
>>> f.write('A man a plan a canal ')
>>> f.size
21
>>> f.write('Panama')
27

Obviously, this implementation doesn't work if you aren't writing the file from scratch, but you could adapt your __init__ method to handle initial data. You might also need to override some other methods: writelines, for instance.

This works regardless of encoding, as strings are just sequences of bytes.

>>> f2 = TrackedFile('palindrome-latin1.txt', 'w')
>>> f2.write(u'A man a plan a canál '.encode('latin1')
>>> f3 = TrackedFile('palindrome-utf8.txt', 'w')
>>> f3.write(u'A man a plan a canál '.encode('utf-8'))
>>> f2.size
21
>>> f3.size
22

jcdyer 2009-12-08 15:17:58

+1: That is a really clever idea. I like it!

jathanism 2009-12-08 15:50:29

That's not actually. It you use ASCII, ISO1559 and UTF-8, the result will be the same, but the on disk size will not be.

e-satis 2009-12-09 17:25:33

No. It works for other encodings too, if you use actual strings. Answer modified to demonstrate.

jcdyer 2009-12-09 17:32:33

The trick is you can't just write unicode objects and rely on the os's encoding.

jcdyer 2009-12-09 17:37:40

ansaurus

tags:

views:

answers:

How do I determine an open file's size in Python?

related questions