ansaurus

Question

Can anyone explain how the oracle "hash group" works?

Answer 1

+1 A:

Hash group (and hash joins, as well as other operations such as sorts etc.) can use either optimal (i.e. in-memory), one-pass or multi-pass methods. The last two methods use TEMP storage and is thus much slower.

By increasing the number of possible items you might have exceeded the number of items that will fit in memory reserved for this type of operations.

Try looking at v$sql_workarea_active whilst the query is running, to see if this is the case. Or look at v$sql_workarea for historical information. It will also give you an indication of how much memory and/or temp space is needed for the operation.

If turns out to be the actual problem - try increasing the pga_aggregate_target initialization parameter, if possible. The amount of memory available for optimal hash/sort operations is usually around a 5% fraction of the pga_aggregate_target.

See the Performance Tuning Guide for more detail.

CaptainPicard 2008-09-30 22:38:12

Answer 2

+2 A:

"'m guessing that it was creating hash entries for each unique x,y,probability value and then summing probability for each unique x,y value" -- almost certainly so, since that is what the query requires.

You can check for the likelihood of a query requiring temporary dfisk space to complete a sort or group-by (etc) by using the explain plan.

explain plan for
select x,y,sum(probability) from .... group by x,y
/

select * from table(dbms_xplan.display)
/

If the optimizer can correctly deduce from statistics the approximate unique number of combinations of x and y then there's a pretty good chance that in the TempSpc column of the output of the second query it will show you just how much disk space (if any) will be required to complete the query (no column = no disk space requirement).

Way too much information here: http://download.oracle.com/docs/cd/B19306_01/appdev.102/b14258/d_xplan.htm#i999234

If the temp space usage is high then as CaptP says, it may be time for some memory tweakage. On databases that perform a lot of sorts and aggregations it is common to specify a higher PGA target than an SGA target.

David Aldridge 2008-10-01 03:05:25

Answer 3

A:

Is your *PGA_AGGREGATE_TARGET* set to zero by any chance? It's unlikely that it's the HASH GROUPBY on its own that caused the issue, it's probably something before it or after it. Downgrade your *OPTIMIZER_FEATURES_ENABLE* to 10.1.0.4 and rerun the query - you'll see that now you'll get a SORT GROUPBY which should pretty much always be outperformed by a HASH GROUPBY, unless your PGA sizing is set to MANUAL and your hash work area is undersized.

Andrew from NZSG 2008-10-01 03:21:15

Hmmm, there's also a hidden parameter to disable hash group by ... can't remember the name right now though ...

David Aldridge 2008-10-01 04:21:22

ansaurus

tags:

views:

answers:

Can anyone explain how the oracle "hash group" works?

related questions