ansaurus

Question

peak memory measurement of long running process in linux

Answer 1

A:

Actually, what I said before:

"""

try

/usr/bin/time -v yourcommand

that should help. if you use only "time", bash will execute the built-in (that does not have "-v")

"""

does not work (returns 0).

I made the following perl script (that I called smaps):

#!/usr/bin/perl
use 5.010;
use strict;
use warnings;
my $max = 0;
while( open my $f, '<', "/proc/$ARGV[0]/smaps" ) {
  local $/; $_ = <$f>;
  $max = $1 if /Rss:\s*(\d+)/ and $1 > $max;
  open my $g, '>', '/tmp/max';
  say $g $max
}

And then I call it (for instance, to watch qgit's memory usage):

bash -c './smaps $$ & exec qgit'

Use single quotes so the "daughter" shell interprets $$ (that will be the same PID of qgit after the exec). this answer, I tested :-D

HTH

Massa 2009-07-03 19:04:08

Output in 2 pieces... still not useful :( User time (seconds): 16.50 System time (seconds): 0.47 Percent of CPU this job got: 100% Elapsed (wall clock) time (h:mm:ss or m:ss): 0:16.96 Average shared text size (kbytes): 0

badkya 2009-07-03 19:18:24

Average unshared data size (kbytes): 0 Average stack size (kbytes): 0 Average total size (kbytes): 0 Maximum resident set size (kbytes): 0 Average resident set size (kbytes): 0 Major (requiring I/O) page faults: 0 Minor (reclaiming a frame) page faults: 255315 Voluntary context switches: 1211 Involuntary context switches: 1232 Swaps: 0 File system inputs: 0 File system outputs: 68472 Socket messages sent: 0 Socket messages received: 0 Page size (bytes): 4096

badkya 2009-07-03 19:18:59

Memory usage fields do not change even if I run it for longer...

badkya 2009-07-03 19:19:53

Answer 2

A:

You could use a munin-node plugin to do this, but it's a little heavyweight. http://munin.projects.linpro.no/

Michael Sofaer 2009-07-03 19:04:52

Answer 3

+3 A:

Just use top -n to iterate a specified number of times, and -d to delay between updates. Also you can grab only the output relevant to your process by grepping its pid, like:

top -n 30 -d 60 | grep <process-id>

Read the top manual page for more information

man top

Of course, you can also grab the column you need by using awk.

licorna 2009-07-03 19:46:06

Answer 4

A:

/proc/pid/smaps, like /proc/pid/maps, only gives information about virtual memory mappings, not actual physical memory usage. top and ps give the RSS, which (depending on what you want to know) may not be a good indicator of memory usage.

One good bet, if you're running on a Linux kernel later than 2.6.28.7, is to use the Pagemap feature. There's a discussion of this and source for some tools at www.eqware.net/Articles/CapturingProcessMemoryUsageUnderLinux.

The page-collect tool is intended to capture memory usage of ALL processes, and so probably imposes a greater CPU burden than you want. You should be easily able to modify it, however, so that it captures data for only a specific process ID. That would reduce the overhead enough so that you could easily run it every few seconds. I haven't tried it, but I think the page-analyze tool should run without change.

EQvan

2009-07-08 04:01:59

Answer 5

+1 A:

Valgrind with massif should not be too heavy, but, I'd recommend using /proc. You can easily write your own monitor script. Here is mine, for your convenience:

#!/bin/bash

ppid=$$
maxmem=0

$@ &
pid=`pgrep -P ${ppid} -n -f $1` # $! may work here but not later
while [[ ${pid} -ne "" ]]; do
    #mem=`ps v | grep "^[ ]*${pid}" | awk '{print $8}'`
        #the previous does not work with MPI
        mem=`cat /proc/${pid}/status | grep VmRSS | awk '{print $2}'`
    if [[ ${mem} -gt ${maxmem} ]]; then
     maxmem=${mem}
    fi
    sleep 1
    savedpid=${pid}
    pid=`pgrep -P ${ppid} -n -f $1`
done
wait ${savedpid} # don't wait, job is finished
exitstatus=$?   # catch the exit status of wait, the same of $@
echo -e "Memory usage for $@ is: ${maxmem} KB. Exit status: ${exitstatus}\n"

Davide 2009-08-13 20:10:32

Answer 6

A:

This depends on which kind of memory you want to monitor.

Monitoring the following M.a.p.d by sorting the number of all the process's one (not all of the thread's) will let you monitor the malloc physical memory each process uses.

You can write a c program to make it even faster but I thought awk was the minimum choice for this purpose.

M.a anonymous mapped memory
- .p private
  - .d dirty == malloc/mmapped heap and stack allocated and written memory
  - .c clean == malloc/mmapped heap and stack memory once allocated, written, then freed, but not reclaimed yet
- .s shared
  - .d dirty == there should be none
  - .c clean == there should be none
M.n named mapped memory
- .p private
  - .d dirty == file mmapped written memory private
  - .c clean == mapped program/library text private mapped
- .s shared
  - .d dirty == file mmapped written memory shared
  - .c clean == mapped library text shared mapped

I would prefer to get the numbers as follows to get the real numbers in least overhead.
You have to sum these up in order to divide what ps shows as RSS and get more accurate numbers not to confuse.

M.a.p.d:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Dirty/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

M.a.p.c:

 awk '/^[0-9a-f]/{if ($6=="") {anon=1}else{anon=0}} /Private_Clean/{if(anon) {asum+=$2}else{nasum+=$2}} END{printf "sum=%d\n",asum}' /proc/<pid>/smaps

M.n.p.d:... and so on

holmes 2010-01-04 14:56:50

Answer 7

A:

Rather than polling /proc a billion times a second, why not just process the output from strace?

http://tstarling.com/blog/2010/06/measuring-memory-usage-with-strace/

Tim Starling 2010-06-23 04:03:39

ansaurus

tags:

views:

answers:

peak memory measurement of long running process in linux

related questions