ansaurus

Question

Fastest 'update' on jdbc with PreparedStatement and executeBatch

Answer 1

A:

You should use Spring Batch operations with the JdbcTemplate

javaExpert 2010-02-03 17:07:09

But he doesn't want to use Spring

Dan 2010-02-03 17:09:04

But maybe he should, Spring is all the rage these days

javaExpert 2010-02-03 17:17:16

Answer 2

+2 A:

Try this as a benchmark:

Use the built-in SQL tools to do a bulk extract of the entire table. All rows. All columns.
Drop (or rename) the table.
Use a simple flat-file read/write to create a new file with the updates applied.
Use the bulk-load utility that comes with your database to rebuild the entire table from the extracted file.
Add indexes after the reload.

You may find that this is faster than any SQL solution. We stopped using UPDATES for a data warehouse because extract -> flat file process -> load was much faster than SQL.

S.Lott 2010-02-03 17:10:38

It may be a rubbish answer but there are nicer ways of saying that.

Dan 2010-02-03 17:20:53

@dan: you are right, Dan. I was too [email protected]: thanks for your unhelpful answer. I wonder how you manage to get so many points in StackOverflow. Surely the guys at StackOverflow should consider reworking their algorithms

zerohibernation 2010-02-03 17:25:06

Clearly S.Lott thinks you can get closer to an answer if you do a benchmark. Maybe you asked because you didn't know how to do it, because you hadn't time neither to learn enough about your DB. If there's a definitive answer someone will write it. If there's not, you'll have one not-so-helpful answer. And know what? I'm voting it because it encourage self-learning and understanding beyond a simple, questionable answer. We are programmers. We make things work.

helios 2010-02-03 17:43:21

@zerohibernation: (1) It worked for me. (2) It may not work for you. (3) I'm not sure how much more detail you need. Code? (4) If you don't "benchmark", all you're doing is taking my word for it. (5) You can ignore my answer politely.

S.Lott 2010-02-03 18:08:44

@s.lott: I take it back. His answer is not too bad. Sincere apologies.@helios: you say:I'm voting it because it encourage self-learning and understanding beyond a simple, questionable answer.so if somebody replies: 'go and find it out yourself' you could vote them again.

zerohibernation 2010-02-04 10:12:48

@zerohibernation: I don't think I said go and find out for yourself. I think I suggested benchmarking the algorithm I described. I'm not sure, but there seems to be a difference. Perhaps my answer was unclear?

S.Lott 2010-02-04 11:20:50

+1 for measuring, whether drop and rebuild of indexes increases performance depends on the number of rows already in the table and if they have to be preserved

stacker 2010-02-09 07:07:59

Answer 3

A:

Since batching uses buffering on client side and then sends everything as a single request, it might be wise to execute batches with 5000 rows. You should watch you memory consumption when adding 100.000 rows.

Sometime it works faster to push data in several loads instead of 1 single load(using JDBC, at least based on my previous experience).

adrian.tarau 2010-02-03 18:35:41

ansaurus

tags:

views:

answers:

Fastest 'update' on jdbc with PreparedStatement and executeBatch

related questions