ansaurus

Question

Answer 1

A:

So what you currently have is something like this ...

DESCRIPTION            SUPPLIER_CODE  COST_EX_TAX
Widget                 X23                  42.00 
Brass gadget           X23                  42.00 
Flange                 X42                  23.00 
Flange, steel          X42                  23.00

... and what you want is ...

DESCRIPTION            SUPPLIER_CODE  COST_EX_TAX
Brass gadget, Widget   X23                  42.00 
Flange, Flange, steel  X42                  23.00

This still doesn't seem like the right approach. That concatenated DESCRIPTION seems wrong to me. However you know your data and your customer's requirements better than I do.

APC 2010-01-29 05:37:29

I think maybe I didn't explain properly (not sure how else to phrase it in thread title..)What I'm looking to do is to alter the rows with the new data, then delete duplicates leaving a single row for each.. Does that make more sense? How would I describe this process?

Richo 2010-01-29 05:44:27

Oh, and the reason for the dupes is that I'm importing data from a customers lists into our system, for their old purpose, the customers data format worked but it's not suitable for our model.

Richo 2010-01-29 05:48:59

That's exactly what I'm after!However I think I may have sorted it with bkm's solution.

Richo 2010-01-29 06:47:00

Have it sorted now, did some manual kludgery with the oid's by hand to translate from a bunch of rows each containing a single array to being a logical array with a stack of oid's in it in vim.Thanks again for your patience.

Richo 2010-01-29 07:16:25

@Richo - you're welcome. Thanks for the points, I'm not sure that I actually provided you with a solution. Your data sample does make the scenario a lot clearer - although I *still* think the data model looks wrong ;)

APC 2010-01-29 07:21:29

While you didn't personally solve my problem, the problem is now solved, and I don't think it'd be solved without your help; so I'd argue you were key to my solving it.I agree the data model isn't amazing, but unfortunately 'tell the customers their data is dodgy and unworkable' is not an option, so here I am.Thanks for your time. Have a good one.

Richo 2010-01-29 07:39:57

Answer 2

+2 A:

The following approach will work

Identify only the duplicate rows and store them in a new table.
Delete duplicate rows from parent table
Concatenate the description column in the table containing only duplicate rows. Concatenate using a group by clause.
Insert all rows from the result of step 3 into the original table.

bkm 2010-01-29 05:53:29

Thanks bkm. I can do the move into the temp table, I'm a bit vague on how it actually changes my problem though? I was attempting to do it in place with a group by.. and not really getting anywhere. More specifically, I'm not sure how moving the rows to a table that has fewer (but not 0) rows I'm not interested in helps?Do you have any links?

Richo 2010-01-29 05:58:10

Oh wait, I think I just got it:1) Create temp table2) Insert only the refined rows generated by the cat + group by into temp table3) Hose all dupes in original table4) Reinsert only clean rows from temp table?Thanks, I'll try that.

Richo 2010-01-29 06:01:34

ansaurus

tags:

views:

answers:

Rename data, then delete duplicates

related questions