ansaurus

Question

How to replicate tee behavior in python when using subprocess?

Answer 1

+2 A:

This is a straightforward port of tee to Python.

import sys
sinks = sys.argv[1:]
sinks = [open(sink, "w") for sink in sinks]
sinks.append(sys.stderr)
while True:
  input = sys.stdin.read(1024)
  if input:
    for sink in sinks:
      sink.write(input)
  else:
    break

I'm running on Linux right now but this ought to work on most platforms.

Now for the subprocess part, I don't know how you want to 'wire' the subprocess's stdin, stdout and stderr to your stdin, stdout, stderr and file sinks, but I know you can do this:

import subprocess
callee = subprocess.Popen( ["python", "-i"],
                           stdin = subprocess.PIPE,
                           stdout = subprocess.PIPE,
                           stderr = subprocess.PIPE
                         )

Now you can access callee.stdin, callee.stdout and callee.stderr like normal files, enabling the above "solution" to work. If you want to get the callee.returncode, you'll need to make an extra call to callee.poll().

Be careful with writing to callee.stdin: if the process has exited when you do that, an error may be rised (on Linux, I get IOError: [Errno 32] Broken pipe).

badp 2010-06-08 13:09:44

This is suboptimal in Linux, since Linux provides an ad-hoc [`tee(f_in, f_out, len, flags)`](http://linux.die.net/man/2/tee) API, but that's not the point right?

badp 2010-06-08 13:44:25

I updated the question, the problem is that I was not able to find how to use subprocess in order to get the data from the two pipes gradually and not all at once at the end of the process.

Sorin Sbarnea 2010-06-08 15:15:41

@Sorin, what if you replaced `read(1024)` with `read(1)`?

badp 2010-06-09 05:53:39

I know that your code should work but there is a small requirement that does break the entire logic: I want to be able to distinguish between stdout and stderr and this means that I have to read from both of them but I do not know which will get new data. Please take a look at the example code.

Sorin Sbarnea 2010-06-09 07:06:50

@Sorin, that means you'll have to either use two threads. One reads on `stdout`, one reads on `stderr`. If you are going to write both to the same file, you can acquire a lock on the sinks when you start reading and release it after writing a line terminator. :/

badp 2010-06-09 14:02:58

Using threads for this does not sounds too appealing to me, maybe we'll find something else. It's strange that this is a common issue but nobody provided a complete solution for it.

Sorin Sbarnea 2010-06-09 17:35:02

@badp I tried the threads a approach but it doesn't work. I updates the question to include the new example.

Sorin Sbarnea 2010-06-30 14:56:36

@Sorin The output you have posted _is_ ordered. You had `line1 line3 line5 line7 line9` on stderr, `line0 line2 line4 line6 line8` on stdout. Sure, in that run the `stderr` thread happened to get output first, which meant you had `line1 line0 line3 line2 line5 line4...` instead of `line0 line1 line2 line3 line4 line5...` -- but you didn't get `line0 line3 line5 line1 line2...` or `line4 line2 line1 line0 line6...` or `line0 liline1 line3 linne2 line3e5...`. I'm afraid that for a program that has to accept arbitrary input this kind of nondeterminism is unaivoidable if not even necessary.

badp 2010-06-30 16:48:31

Answer 2

+1 A:

If you don't want to interact with the process you can use the subprocess module just fine.

Example:

tester.py

import os
import sys

for file in os.listdir('.'):
    print file

sys.stderr.write("Oh noes, a shrubbery!")
sys.stderr.flush()
sys.stderr.close()

testing.py

import subprocess

p = subprocess.Popen(['python', 'tester.py'], stdout=subprocess.PIPE,
                     stdin=subprocess.PIPE, stderr=subprocess.PIPE)

stdout, stderr = p.communicate()
print stdout, stderr

In your situation you can simply write stdout/stderr to a file first. You can send arguments to your process with communicate as well, though I wasn't able to figure out how to continually interact with the subprocess.

Wayne Werner 2010-06-08 13:36:11

This doesn't show you error messages in STDERR in context of STDOUT, which can make debugging shell-scripts etc nearly impossible.

RobM 2010-07-01 09:42:41

Meaning...? In this script anything delivered through STDERR is printed to the screen along with STDOUT. If you're referring to return codes, just use `p.poll()` to retrieve them.

Wayne Werner 2010-07-01 12:44:07

Answer 3

+1 A:

Try this :

import sys

class tee-function :

    def __init__(self, _var1, _var2) :

        self.var1 = _var1
        self.var2 = _var2

    def __del__(self) :

        if self.var1 != sys.stdout and self.var1 != sys.stderr :
            self.var1.close()
        if self.var2 != sys.stdout and self.var2 != sys.stderr :
            self.var2.close()

    def write(self, text) :

        self.var1.write(text)
        self.var2.write(text)

    def flush(self) :

        self.var1.flush()
        self.var2.flush()

stderrsav = sys.stderr

out = open(log, "w")

sys.stderr = tee-function(stderrsav, out)

Catalin Festila 2010-06-11 08:54:44

This is exactly the approach I was about to suggest. Also worth adding some of the file data-descriptors, like `closed`.

RobM 2010-07-01 09:37:27

Just tried it, `subprocess.Popen` calls `fileno()`, triggering an exception.

RobM 2010-07-01 11:40:16

Answer 4

A:

Finally I had to implement tee() command in Python myself.

You can get it from here http://github.com/ssbarnea/tendo/blob/master/tendo/tee.py

Currently it does allow you to do things like:

 tee("python --v") # works just like os.system()

 tee("python --v", "log.txt") # file names

 tee("python --v", file_handle)

 import logging
 tee("python --v", logging.info) # receive a method

The only current limitation is that it is not able to differentiate between stderr and stdout, meaning that it will merge both of them.

Sorin Sbarnea 2010-08-06 11:31:33

ansaurus

tags:

views:

answers:

How to replicate tee behavior in python when using subprocess?

Details

References

current code (2nd try)

related questions