ansaurus

Question

Reading a file in Python while logging data in screen

Answer 1

+1 A:

I think Option 1 is totally feasible because you can easily have Python "tail" the logfile in a read-only pipe so that no harm is done to it while screen is still writing to it. While tailing the file, you can perform a specified action any time a new log event is detected in the log file.

If you are curious and would like to see some working code, a personal project of mine utilizes this functionality. The project is called thrasher-logdrop and the guts are logdrop.py. The basic flow is:

Tail a file with do_tail()
Watch for log events with tail_lines()
Perform an action on events with handle_line()

jathanism 2010-09-14 18:24:49

Answer 2

+1 A:

I'd say option 2 is the way to go. You have complete control over what you do with each byte of input, as you receive it. You can have a very simple Python script which simply writes the data to disk as it reads it. Your plotting code can run in an entirely separate process created by fork()ing the first. To get the data from one to the other, you can either (a) have the first process also write to a socketpair() or other IPC mechanism; or (b) configure the output file object to be line-buffered -- causing it to explicitly sync after every full line is written -- and monitor it for new content in the second process.

The problem with option 1 is that you have no control over screen's buffering behavior. You can monitor its logfile for new content, but your logging code needs to be prepared to handle both incomplete lines and large chunks of data all at once. Depending on the exact buffering behavior, you might not even see any data at all until the screen process exits!

llasram 2010-09-14 18:56:07

Answer 3

+3 A:

Both option 1 and 2 will work, but oh boy, in the name of all things good, avoid using threads for this! You'll end up with the worst of both worlds: locking problems, and an exception in the graphing thread will kill the whole program (including the logging thread) anyway. As someone else mentioned, using two separate processes for this is fine. screen is a bit of an odd choice of tools for this purpose, as is writing code by hand in python. I'd just rewrite the talk2controller script as this trivial one:

stty -F /dev/tty.KeySerial1 19200 raw
cat </dev/tty.KeySerial1 >logfile

(You could also use >>logfile if you want each run of the script to append to the file, rather than rewriting it from scratch.)

The other question is about whether it's okay to have a program reading from the file as long as someone else is writing to it. A more specific version of this question is: what if a line of the log is half-written at the time you try to read it?

The answer is: you're allowed to do this, but you're right, you can't guarantee that a line won't be half-written at the time you read it. (If you write your own replacement for cat or screen you could actually make this guarantee by always writing to the file using os.read() instead of sys.stdout.write() or print.)

However, that guarantee isn't needed anyway. You only need to be careful when reading the file and you'll never have a problem. Essentially, an incomplete line is just one that doesn't end with a \n newline character. Thus:

for line in open('logfile'):
    if not line.endswith('\n'): break
    ...handle valid line...

Since the \n character is the last thing written by each line of the log, you know for sure that if you read a \n character, everything before it was written correctly.

apenwarr 2010-09-17 00:26:32

ansaurus

tags:

views:

answers:

Reading a file in Python while logging data in screen

Background

Questions

related questions