[pybsddb] Batch import is slowing down

Jesus Cea jcea at jcea.es
Sat Jun 27 20:36:23 CEST 2015


On 27/06/15 18:30, Amirouche Boubekki wrote:

>> without sync with gc
>>> first: 175 seconds
>>> second: 368
>>> third: 603
> 
>> without sync without gc
>>> first: 172 seconds
>>> second: 359
>>> third: 581

Ok, your issue is not Garbage Collection.

My next step now would be to time your BSDDB3 calls.

Profile your code, locate the slow routine and post it here.

>> txn less environment without gc
>>> first: 72 seconds
>>> second: 87
>>> third: 101

In my experience ACI (with durability disabled and async writes to the
log) is quite performant. Not the 1:5 degradation you are seeing, for sure.

> Configuration of the ACID environment:

Have you tried the ACI configuration I proposed a few days ago?

> Primary tables use hash backend. In the worst case an entry will do:

What are the indexes?. If you have locality, a btree could be better.
Probably not an issue since everything you are doing so far fits in the
cache.

-- 
Jesús Cea Avión                         _/_/      _/_/_/        _/_/_/
jcea at jcea.es - http://www.jcea.es/     _/_/    _/_/  _/_/    _/_/  _/_/
Twitter: @jcea                        _/_/    _/_/          _/_/_/_/_/
jabber / xmpp:jcea at jabber.org  _/_/  _/_/    _/_/          _/_/  _/_/
"Things are not so easy"      _/_/  _/_/    _/_/  _/_/    _/_/  _/_/
"My name is Dump, Core Dump"   _/_/_/        _/_/_/      _/_/  _/_/
"El amor es poner tu felicidad en la felicidad de otro" - Leibniz

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 473 bytes
Desc: OpenPGP digital signature
URL: <https://mailman.jcea.es/pipermail/pybsddb/attachments/20150627/00f3942d/attachment.asc>


More information about the pybsddb mailing list