ansaurus

Question

Answer 1

+2 A:

Since Mnesia queries are just erlang functions I would imagine you can profile them the same way you would profile your own erlang code. http://www.erlang.org/doc/efficiency%5Fguide/profiling.html#id2266192 has more information on the erlang profiling tools available.

Update As a test I ran this at home on a test mnesia instance and using fprof to trace an mnesia qlc query returned output a sample of which I'm including below. So it definitely includes more information than just the query call.

....
{[{{erl_lint,pack_errors,1},                      2,    0.004,    0.004}],     
 { {lists,map,2},                                 2,    0.004,    0.004},     %
 [ ]}.

{[{{mnesia_tm,arrange,3},                         1,    0.004,    0.004}],     
 { {ets,first,1},                                 1,    0.004,    0.004},     %
 [ ]}.

{[{{erl_lint,check_remote_function,5},            2,    0.004,    0.004}],     
 { {erl_lint,check_qlc_hrl,5},                    2,    0.004,    0.004},     %
 [ ]}.

{[{{mnesia_tm,multi_commit,4},                    1,    0.003,    0.003}],     
 { {mnesia_locker,release_tid,1},                 1,    0.003,    0.003},     %
 [ ]}.

{[{{mnesia,add_written_match,4},                  1,    0.003,    0.003}],     
 { {mnesia,add_match,3},                          1,    0.003,    0.003},     %
 [ ]}.

{[{{mnesia_tm,execute_transaction,5},             1,    0.003,    0.003}],     
 { {erlang,erase,1},                              1,    0.003,    0.003},     %
 [ ]}.

{[{{mnesia_tm,intercept_friends,2},               1,    0.002,    0.002}],     
 { {mnesia_tm,intercept_best_friend,2},           1,    0.002,    0.002},     %
 [ ]}.

{[{{mnesia_tm,execute_transaction,5},             1,    0.002,    0.002}],     
 { {mnesia_tm,flush_downs,0},                     1,    0.002,    0.002},     %
 [ ]}.

{[{{mnesia_locker,rlock_get_reply,4},             1,    0.002,    0.002}],     
 { {mnesia_locker,opt_lookup_in_client,3},        1,    0.002,    0.002},     %
 [ ]}.

{[ ],
 { undefined,                                     0,    0.000,    0.000},     %
 [{{shell,eval_exprs,6},                          0,   18.531,    0.000},      
  {{shell,exprs,6},                               0,    0.102,    0.024},      
  {{fprof,just_call,2},                           0,    0.034,    0.027}]}.

Jeremy Wall 2009-12-04 23:11:36

As Gordon already suggested, these tools would only yield something like "you're spending too much time in mnesia:match_object and qlc:eval". While I guess a real answer would be "you are joining against a non-indexed column here and there, making your mnesia slow".

Zed 2009-12-04 23:35:55

not really. qlc:eval and mnesia:match_object call other erlang functions. I'm sure you could get more information than just your spending most of your time in mnesia:match_object. I'll see if I can try it out to be sure though.

Jeremy Wall 2009-12-05 00:55:26

So did this fprof trace give you any hints on how you should alter your schema?

Zed 2009-12-05 08:48:54

Zed is sort of right. I am plowing through my fprof output now :(I did 'work out' that I was using an mnesia:match_object3 when I should've had an mnesia:index_match_object/4 but its not a query analyzer :(

Gordon Guthrie 2009-12-05 09:38:31

Answer 2

+3 A:

I hung back because I don't know much about either Erlang or Mnesia, but I know a lot about performance tuning, and from the discussion so far it sounds pretty typical.

These tools fprof etc. sound like most tools that get their fundamental approach from gprof, namely instrumenting functions, counting invocations, sampling the program counter, etc. Few people have examined the foundations of that practice for a long time. Your frustrations sound typical for users of tools like that.

There's a method that is less-known that you might consider, outlined here. It is based on taking a small number (10-20) of samples of the state of the program at random times, and understanding each one, rather than summarizing. Typically, this means examining the call stack, but you may want to examine other information as well. There are different ways to do this, but I just use the pause button in a debugger. I'm not trying to get precise timing or invocation counts. Those are indirect clues at best. Instead I ask of each sample "What is it doing and why?" If I find that it is doing some particular activity, such as performing the X query where it's looking for y type answer for the purpose z, and it's doing it on more than one sample, then the fraction of samples it's doing it on is a rough but reliable estimate of what fraction of the time it is doing that. Chances are good that it is something I can do something about, and get a good speedup.

Here's a case study of the use of the method.

Mike Dunlavey 2009-12-08 03:18:57

Very interesting comment. Not entirely sure how to do it in Erlang though... but it does suggest a number of ways to do things...We have quite a complex set of data access routines that do some (strange) stuff and we know we have a big data access layer rewrite to do - so I will bear this technique in mind. Thanks

Gordon Guthrie 2009-12-08 17:03:32

@Gordon: I look for a debugger with a pause or ctrl-break button. There are tools like `pstack` that let you get stackshots from outside. There are profiling tools that sample the call stack, but they don't necessarily do it in a useful way, i.e. sampling on wall-clock time, and letting you analyze individual representative samples.

Mike Dunlavey 2009-12-08 17:08:28

Mike Dunlavey 2009-12-08 17:17:51

Answer 3

+2 A:

Mike Dunlavey's suggestion reminds me about redbug that allow you to sample calls in production systems. Think of it as an easy-to-use erlang:trace that doesnt give you enough rope to hang your production system.

Using something like this call should give you lots of stack traces to identify where your mnesia transactions are called from:

redbug:start(10000,100,{mnesia,transaction,[stack]}).

Its not possible to get call duration for these traces though.

If you have organized all mnesia lookups into modules that export an api to perform them, you could also use redbug to get a call-frequency on specific queries only.

Christian 2009-12-13 20:45:22

Thanks, I will look into it...

Gordon Guthrie 2009-12-13 22:20:59

@Christian: That's helpful. My theory is that you don't need to know call durations. Rather you need to know what % of time each call is active (what % of stack samples it is on, assuming samples are taken at "random" times) because that tells you what fraction of overall time that call is responsible for, that you could save if you could remove it.

Mike Dunlavey 2009-12-14 21:10:13

ansaurus

tags:

views:

answers:

Profiling Mnesia Queries

Expanded In The Light Of Discussion

related questions