optimize cleanUpBSNulls function by Paczesiowa · Pull Request #25 · hdbc/hdbc-postgresql

Paczesiowa · 2012-12-12T15:06:05Z

this uses a lot less memory than the concatMap one. in our app there are queries with length of 10k chars (inserting files to db) and the concatMap one was creating huge amounts of small bytestrings, which would consume additional 1gb of memory and a few extra seconds for gc.

dcoutts · 2014-04-09T17:49:44Z

Here's a simpler version using the new bytestring builder:

import qualified Data.ByteString as B
import qualified Data.ByteString.Lazy as BL
import qualified Data.ByteString.Builder as B
import qualified Data.ByteString.Builder.Prim as BP
import Data.ByteString.Builder.Prim ((>$<), (>*<))
import Data.Word (Word8)

escapeBSNulls :: B.ByteString -> BL.ByteString
escapeBSNulls = B.toLazyByteString . BP.primMapByteStringBounded conv
  where
    conv :: BP.BoundedPrim Word8
    conv = BP.condB (==0) (BP.liftFixedToBounded replacement)
                          (BP.liftFixedToBounded BP.word8)
    replacement :: BP.FixedPrim a
    replacement = const ('\\', ('0', ('0', '0')))
              >$< BP.char8 >*< BP.char8 >*< BP.char8 >*< BP.char8

In my benchmark on a 13M data file criterion says it is about 20x faster than the low-level version from this pull req.

dcoutts · 2014-04-09T17:55:01Z

Note that the large factor is likely an effect of using test data with a lot of \0s (an executable file) not from using such a large input (it's actually an even bigger factor for a 10k prefix of the same binary test file).

optimize cleanUpBSNulls function

0eaef28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize cleanUpBSNulls function#25

optimize cleanUpBSNulls function#25
Paczesiowa wants to merge 1 commit intohdbc:masterfrom
Paczesiowa:optimize-cleanUpBSNulls

Paczesiowa commented Dec 12, 2012

Uh oh!

dcoutts commented Apr 9, 2014

Uh oh!

dcoutts commented Apr 9, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Paczesiowa commented Dec 12, 2012

Uh oh!

dcoutts commented Apr 9, 2014

Uh oh!

dcoutts commented Apr 9, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants