ansaurus

Question

Answer 1

A:

Bulk operations are lightning fast in SQL. If you can translate your XML document to a series of SQL queries, that could dramatically improve performance.

Edit: I don't grok what you're trying to do, but it seems to me you start with a flat table, and end with a bunch of other tables. Why not do it like this:

insert into Product
(id, name)
select source_id, data_field
from FlatTable

This is pretty fast, at the cost of being less flexible than XML mappings.

Andomar 2009-06-10 08:22:17

I've added a simplified XML example.

Michael Barth 2009-06-10 09:20:45

Could you add a simplified row of the "flat" table? Is there a special meaning to the "data_1" labels, for example does "data_1" correspond to "source_id" because source_id is the first field?

Andomar 2009-06-10 10:16:52

The flat table has one row for every field of the first xml. (in this case, it has "source_id" and "data_field" columns). The source_id is used by the mapping to identify the column in the flat table. This column is taken and its "data_field" value is copied over into the target (which holds the table and column info for the mapping).

Michael Barth 2009-06-10 10:26:06

Alas, that's not possible since there are some dependencies (e.g. ObjPropertyValue needs a ObjPropertyID, which is automatically generated while filling the table ObjProperty. My programm does relate those) and other values need to be modified (some IDs are generated by concatenating other IDs) or detected and inserted while importing (MimeType based on file extension).

Michael Barth 2009-06-10 11:27:14

Answer 2

A:

Okay, two things: First, I've changed the insert-per-row approach to caching around 1000 rows and inserting them with a single MySQL insert (see multiple inserts).

Second, and probably the most important, was that I was having lots of duplicates per product, which accumulated into a big, bloody mess taking 1 hour to import. After eliminating those duplicates before the import I'm down to 10 seconds for the same action...

One should check the result of his selects for duplicates before importing them. In this case I wanted to select every single product once, but I selected each product with every language version. (meaning I've got 4 products which are basically the same, just in another language)

Michael Barth 2009-06-10 12:12:32

ansaurus

tags:

views:

answers:

Optimize MySQL table to tables import

related questions