I have some database table and need to process records from it 5 at a time as long as app is running. So, it looks like this:
- Get a record that hasn't been processed yet or not processing now by other threads.
- Process it (this is a long process that depends on internet connection so it could timeout/throw errors).
- Move to the next record. When reached end of table start from beginning.
I don't have much experience with threads, so I see two possible strategies:
Approach A.
1.Create new ExecutorService:
ExecutorService taskExecutor = Executors.newFixedThreadPool(5);
2.Add 5 tasks to it:
for (int i = 0; i < 5; i++) {
taskExecutor.execute(new MyTask());
}
3.Each task will be infinite loop, that: reads a record from the table, processes it, and then gets another record.
The problems with this approach is how to inform other threads about which records are currenly processing. To do this I can either use "status" field in the table or just use some CopyOnWriteArraySet that holds currently processing ID's.
Approach B.
1.Create the same ExecutorService:
ExecutorService taskExecutor = Executors.newFixedThreadPool(5);
2. Have an infinite loop that selects records that need to be processed and passes them to the executor:
while (true) {
//get next record here
taskExecutor.execute(new MyTask(record));
//monitor the queue and wait until some thread is done processing,
//so I can add another record
}
3.Each task processes a single record.
The problem with this approach is that I need to add tasks to the executor's queue slower than they are processed to not let them pile up over time. It means I need to monitor not only which tasks are currently running but also when they are done processing, so I can add new records to the queue.
Personally I think first approach is better (easier), but I feel that the second one is more correct. What do you think? Or maybe I should do something completely different?
Also I can use Spring or Quartz libraries for this if needed.
Thanks.