ansaurus

Question

Most efficient way to import data from one MySQL database to another

Answer 1

+2 A:

Are you sure there are no duplicate IDs? Or, if there are, are you always going to overwrite them with data from the second database?

Additionally, do you do any processing on the data you obtain from DB1 / DB2 prior to inserting / updating it into 3rd database?

If the answers are "yes" to the first question and "no" to the third, it will likely be a lot faster to use LOAD DATA INFILE. Select data from DB1 and DB2 and load them in sequence.

ChssPly76 2009-11-19 01:51:12

unfortunately the answers to the 1st and 3rd questions as "no" and "yes".

nickf 2009-11-19 02:23:43

So what kind processing do you need to do then? As far as duplicate IDs go, if you're using your `source` column to distinguish, you can very much keep doing that.

ChssPly76 2009-11-19 03:04:26

mostly trimming data, sometimes changing type (string to date, string to int, etc)

nickf 2009-11-19 05:05:18

Fair enough. Is there some way for you to determine what records have been added / changed in DB1 / DB2 since last sync (something like `lastUpdated` timestamp on record, for example)? That would (presumably) reduce the number of records that have to be copied over. Beyond that, use batch updates if you can (not sure what you're using to process the data on the back end) and, if using InnoDB, be sure to commit / restart transaction every 1000 records or so (you can play with that number a bit to see what gives best performance)

ChssPly76 2009-11-19 05:29:42

Answer 2

A:

Well on your On Duplicate Key Update, there is not need to update field1 and field2 as they are the key and have been matched.

The other question is: do you care if 1 sets field3 to one value and then 2 sets it to another -- and again tomorrow, and the day after -- is this something to know happened?

Don 2009-11-19 01:52:19

the data from the "other" databases won't overwrite each other since there is the "source" column to make it unique per DB.

nickf 2009-11-19 02:24:46

Answer 3

A:

Have you considered using federated tables?

Damir Sudarevic 2009-11-19 16:07:18

doesn't that only help if you have multiple servers?

nickf 2009-11-20 13:00:15

ansaurus

tags:

views:

answers:

Most efficient way to import data from one MySQL database to another

related questions