ansaurus

Question

Oracle bug? SELECT returns no dupes, INSERT from SELECT has duplicate rows

Answer 1

A:

Several options occur to me.

The dupes you see were already in the destination table ??
If in your Select, you reference the table you are Inserting into, ( ? ), then The Insert is interacting with the select in your combined

Insert ... Select ... From ...

In such a way (cartesian products ?) as to create the duplicates

Charles Bretana 2009-12-08 17:18:52

Point 2 seems to imply that the SELECT could "see" records that the INSERT is inserting, which would not be the case under Oracle's read consistency model.

David Aldridge 2009-12-08 17:21:06

Thanks for the reply. Unfortunately neither applies in this case. The select is going into an empty table. All of the tables in the query are on the database link. The only thing passed into the query is a date (e.g. INSERT INTO target SELECT * FROM view WHERE view.datecol = dQueryDate)

Joe Harris 2009-12-08 17:25:35

Answer 2

A:

I can't help but think that maybe you are experiencing a side-effect from something else related to the table. Are there any triggers which may be manipulating data?

Adam Hawkes 2009-12-08 17:33:41

Hmm, good question. I don't have access to the DDL on the other side but I'll find out. There are definitely no triggers on the table being inserted into.

Joe Harris 2009-12-08 17:40:47

Answer 3

A:

How did you determine that there are no dupes in the original table?

As others have noted this seems to be the simpledst explanation for this strange behaviour.

IronGoofy 2009-12-08 19:26:06

There are very strong keys in the original *tables*. What I'm seeing is duplicates of those keys, but only when inserted into another table.

Joe Harris 2009-12-08 19:35:58

Is there a target table for each originating table? Or are you "massaging" the data in between? If there is a view (or views) in between, what's the definition of that?

IronGoofy 2009-12-08 20:10:33

Answer 4

+2 A:

One thing that comes to mind is that generally an optimizer plan for a SELECT will prefer a FIRST_ROWS plan to give rows back to the caller early, but an INSERT...SELECT will prefer an ALL_ROWS plan as it is going to have to deliver the full dataset. I'd check the query plans using DBMS_XPLAN.DISPLAY_CURSOR (using the sql_id from V$SQL).

I have a semi-complex view running over a DB link; 4 inner joins over large-ish tables and 5 left joins over mid-size tables. ... All of the tables in the query are on the database link

Again, a potential trouble-spot. If all the tables in the SELECT were on the other end of the DB link, the whole query would be sent to the remote database and the resultset returned. Once you throw the INSERT in, it is more likely that the local database will take charge of the query and pull all the data from the child tables over. But that may depend on whether the view is defined in the local database or the remote database. In the latter case, as far as the local optimizer is concerned there is just one remote object and it gets data from that, and the remote database will do the join.

What happens if you just go to the remote DB and do the INSERT on a table there ?

Gary 2009-12-08 21:34:50

@Gary Nice pointer. I was able to get to the bottom of this by using "EXPLAIN PLAN FOR {my query};" and "SELECT * FROM TABLE(dbms_xplan.display);". The explain that *actually gets used* for the INSERT is very different from the SELECT.{Note added to original question}

Joe Harris 2009-12-09 10:59:52

Answer 5

A:

Check your JOINs carefully. Potentially you have no duplicates in the individual tables, but underspecified joins can cause inadvertant CROSS JOINs so that your result set has duplicates due to multiplicity and, when inserted, this violates a uniqueness constraint in your destination table.

What I do in this case is to nest the query in a view or CTE and try to detect the duplicates straight from the SELECT:

WITH resultset AS (
    -- blah, blah
)
SELECT a, b, c, COUNT(*)
FROM resultset
GROUP BY a, b, c
HAVING COUNT(*) > 1

Cade Roux 2009-12-08 21:41:15

Answer 6

A:

I would suggest getting a plan on the query you are running and looking for a CARTESIAN JOIN in there. This could indicate a missing condition that is causing duplicated rows.

WW 2009-12-09 00:34:57

Answer 7

A:

Diagnosing the problem :

Why not create a table that will hold all columns from your view and insert the complete view resultset ?

Then you will be able to see if your view is indeed only returning unique rows.

Steve De Caux 2009-12-09 08:46:47

Answer 8

A:

AS @Pop has already suggested this behaviour could happen if you are using a different login in SQLPlus to the login when your insert is running. (That is if the other login has a table/view/synonym with the same name)

hamishmcn 2009-12-09 09:02:59

ansaurus

tags:

views:

answers:

Oracle bug? SELECT returns no dupes, INSERT from SELECT has duplicate rows

related questions