ansaurus

Question

Answer 1

+4 A:

I'd suggest to use the DB-provided export/import tools. Oracle and PostgreSQL supports both XML and CSV formats.

If you'd like to stick to JDBC, then place the columns in the SELECT query of the ResultSet in the same order as the values in the INSERT query of the PreparedStatement and just do the following instead of that whole if/switch block:

preparedStatement.setObject(index, resultSet.getObject(index));

However, I don't expect that this will greatly improve the performance. The DB-provided export/import facilities can do it much more efficient than one can ever do in Java.

BalusC 2010-09-23 18:24:26

Seconded. The best way to accomplish this (IMHO) is to write queries against the Oracle tables that map the data to the PostgreSQL tables and run those to create the CSV files. Then, use the PostgreSQL COPY command to import the data. 10M rows per hour is nothing at that point--I routinely import 28M into a single table, build indexes, and derive two aggregate tables all in under an hour in PostgreSQL.

Matthew Wood 2010-09-23 19:00:38

I shall have to investigate this.

Tony Ennis 2010-09-23 23:44:42

Answer 2

A:

You might try creating a class of some sort (perhaps an array of some variety) to hold information on the result set columns and types, which are invariant while processing a given result set. Then pull the values out of the array rather than making the calls to findColumn and getColumnType each time you need them. This should cut down greatly on the calls to findColumn and getColumnType and should help improve your runtime.

Good luck.

Bob Jarvis 2010-09-23 18:29:46

Answer 3

+1 A:

This line is likely to be problematic:

if ( rs.getObject(oracleColumnName) != null ){

Instead use:

if ( rs.getObject(columnIndex) != null ){

In Oracle getObject(String) is O(n) - at least on 10g it was. This method looks like it is called for each column for every row in the result set. You shouldn't fetch the metadata on every call. Move the all of the calls that relate to the metadata to happen once for each query and pass them through to this method as you move through the result set.

Michael Barker 2010-09-23 18:31:12

Answer 4

A:

The information you retrieve is constant for the whole handling of that table.

In order to execute this only once per table, you could create a dataholder containing members for column name, column index, column type and the hasColumn flag. Pass an instance of this data object instead of the columnName, initialise it the first time and use the data for the rest of the table. Cashing data like this will save you 2*num_rows*num_cols calls retrieving meta data.

rsp 2010-09-23 18:52:39

ansaurus

tags:

views:

answers:

Oracle to Postgres data transfer

related questions