[pybsddb] Batch import is slowing down
Amirouche Boubekki
amirouche at hypermove.net
Wed Jun 24 22:46:20 CEST 2015
On 2015-06-24 22:30, Amirouche Boubekki wrote:
> On 2015-06-24 18:56, Jesus Cea wrote:
>> On 24/06/15 12:09, Amirouche Boubekki wrote:
>>> Indeed the documentation is very good, you did a very good job with
>>> that.
>>
>> I didn't write it :-). In fact neither Oracle did. It was mostly
>> written
>> in the SleepyCat era :), before Oracle bought them.
>>
>> Anyway I am not related to Oracle in any way at all.
>>
>>> - The first set of 10 000 entries took 3 seconds to load.
>>> - 49th set took 2 minutes
>>
>> If could be a cache size issue. I can't really tell without checking
>> it
>> myself. What does "db_stat -m" show?. Initialize the stats with
>> "db_stat
>> -mZ" from time to time and compare results between the beginning and
>> the
>> slow part.
>
> I see nothing suspicious in the log. They are attached. I'll be very
> happy if you can help me.
>
> Here are the timings:
>
> 10000 0:00:03.643761
> 20000 0:00:04.169674
> 30000 0:00:05.834176
> 40000 0:00:05.220522
> 50000 0:00:06.411978
> 60000 0:00:08.391819
> 70000 0:00:12.561121
> 80000 0:00:06.480804
> 90000 0:00:07.154480
> 100000 0:00:07.106232
Seems like; if I do checkpoints more often, it faster here is a new
timings with checkpoints every 1000 entries, cut to make appear only the
ten thousands
10000 0:00:00.623182
20000 0:00:00.987063
30000 0:00:01.568719
40000 0:00:01.659039
50000 0:00:02.171126
60000 0:00:01.960296
70000 0:00:02.301340
80000 0:00:02.216895
90000 0:00:02.549782
100000 0:00:03.089665
More information about the pybsddb
mailing list