ansaurus

Question

How do I prevent the loading of duplicate rows in to an Oracle table?

Answer 1

+2 A:

Use Oracle MERGE statement. Some explanations here.

Cătălin Pitiș 2009-10-19 13:14:57

Answer 2

+2 A:

You dint inform about what release of Oracle you have. Have a look at there for merge command.

Basically like this

---- Loop through all the rows from a record temp_emp_rec
MERGE INTO hr.employees e
     USING temp_emp_rec t
     ON (e.emp_ID = t.emp_ID)
     WHEN MATCHED THEN
    --- _You can update_
    UPDATE
     SET first_name = t.first_name,
          last_name = t.last_name
    --- _Insert into the table_
    WHEN NOT MATCHED THEN
    INSERT (emp_id, first_name, last_name)
    VALUES (t.emp_id, t.first_name, t.last_name);

Guru 2009-10-19 13:15:16

Answer 3

+5 A:

What do you mean by "duplicate"? If you have a column which defines a unique row you should setup a unique constraint against that column. One typically creates a unique index on this column, which will automatically setup the constraint.

EDIT: Yes, as commented below you should setup a "bad" file for SQL*Loader to capture invalid rows. But I think that establishing the unique index is probably a good idea from a data-integrity standpoint.

Adam Hawkes 2009-10-19 15:32:28

A very good point - I should have mentioned though that I am loading up to 50 million rows per day and therefore want to use SQL*Loader to carry out the data load. I believe that SQL*Loader will fail the entire file if it contains duplicates which violate a unique index.

2009-10-21 11:11:29

You can tell SQL*Loader what to do with rejected rows. Try specifying a 'badfile' parameter on the command line, with a suitably high 'errors' parameter.

Hobo 2009-10-21 11:42:12

@Adam - sorry, that was directed at ginsoakedboy, not you. I reckon a combination of a unique index and suitable SQL*Loader parameters is the way to go.

Hobo 2009-10-21 15:51:59

Thanks - I will try that out

2009-10-21 16:51:09

Answer 4

+1 A:

I would use integrity constraints defined on the appropriate table columns.

This page from the Oracle concepts manual gives an overview, if you also scroll down you will see what types of constraints are available.

carpenteri 2009-10-20 11:20:33

A good approach to be sure, but in order to meet my performance needs (50 million rows/day) I am using SQL*Loader to load the rows. I think that SQL*Loader will fail entire files if they contain duplicates if I add such an index, which isn't acceptable for my application.

2009-10-21 11:16:22

ansaurus

tags:

views:

answers:

How do I prevent the loading of duplicate rows in to an Oracle table?

related questions