Provide unconsN/unsnocN? #524

hasufell · 2022-07-03T12:35:23Z

In an attempt to improve performance of filepath functions using ShortByteString I figured that unpack slowed down a couple of functions. Moving to several calls of uncons seemed to improve performance. In particular:

 readDriveUNC :: FILEPATH -> Maybe (FILEPATH, FILEPATH)
-readDriveUNC bs = case unpack bs of
-  (s1:s2:q:s3:xs)
-    | q == _question && L.all isPathSeparator [s1,s2,s3] ->
-      case L.map toUpper xs of
-          (u:n:c:s4:_)
-            | u == _U && n == _N && c == _C && isPathSeparator s4 ->
-              let (a,b) = readDriveShareName (pack (L.drop 4 xs))
-              in Just (pack (s1:s2:_question:s3:L.take 4 xs) <> a, b)
-          _ -> case readDriveLetter (pack xs) of
-                   -- Extended-length path.
-                   Just (a,b) -> Just (pack [s1,s2,_question,s3] <> a, b)
-                   Nothing -> Nothing
-  _ -> Nothing
+readDriveUNC bs
+  | Just (s1, r1) <- uncons bs
+  , Just (s2, r2) <- uncons r1
+  , Just (q,  r3) <- uncons r2
+  , Just (s3, xs) <- uncons r3
+  , q == _question
+  , L.all isPathSeparator [s1,s2,s3] =
+      if | Just (toUpper -> u, k1) <- uncons xs
+         , Just (toUpper -> n, k2) <- uncons k1
+         , Just (toUpper -> c, k3) <- uncons k2
+         , Just (s4,           rr) <- uncons k3
+         , u == _U
+         , n == _N
+         , c == _C
+         , isPathSeparator s4 ->
+              let (a,b) = readDriveShareName rr
+              in Just (pack [s1,s2,_question,s3,u,n,c,s4] <> a, b)
+         | otherwise -> case readDriveLetter xs of
+                          -- Extended-length path.
+                          Just (a,b) -> Just (pack [s1,s2,_question,s3] <> a, b)
+                          Nothing -> Nothing
+  | otherwise = Nothing

https://gitlab.haskell.org/haskell/filepath/-/merge_requests/116/diffs

The 3 consecutive calls to uncons are not only awkward, but also incur 3 copies for the tail.

So I'm wondering if a function like this might be useful (at least for ShortByteString):

unconsN :: Int -> ShortByteString -> Maybe ([Word8], ShortByteString)

The obvious disadvantage here is that you'll get partial pattern matching on the Word list, because we don't have dependent types.

Providing uncons2, uncons3 and using a tuple instead might be an alternative, but less general.

The other way would be to figure out why unpack is so slow. Afaiu it's only semi lazy, e.g. unpacks the first 100 bytes strictly.

The text was updated successfully, but these errors were encountered:

hasufell · 2022-07-03T12:39:36Z

I'll see if I can provide a minimal benchmark for this.

The filepath function using those, went from:

    splitDrive (windows): 
      8.27 μs ± 299 ns

to:

    splitDrive (windows): 
      867  ns ±  21 ns

hasufell · 2022-07-03T14:02:14Z

hasufell@ddf9180

All
  ShortByteString
    ShortByteString unpack/uncons comparison
      unpack and look at first 5 elements: OK (2.21s)
        15.8 ms ± 665 μs
      uncons consecutively 5 times:        OK (0.93s)
        393  μs ±  38 μs
      unconsN 5:                           OK (0.16s)
        73.2 ns ± 6.6 ns

All 3 tests passed (3.30s)

So:

unpack is the slowest
uncons consecutively is at least twice as fast for n = 5
unconsN is the fastest

Implementation of unconsN is in the link.

Fixes haskell#524

sjakobi · 2022-07-03T15:40:01Z

I think it would be good to figure out the performance problem with unpack first.

sjakobi · 2022-07-03T15:43:28Z

In particular I wonder whether the actual performance problem might lie with your use of pack. Instead, I think it might be better to use explicitly use ByteString.Short.drop on the original ShortByteString.

hasufell · 2022-07-03T15:45:41Z

In particular I wonder whether the actual performance problem might lie with your use of pack.

See the PR. It's not about pack: https://github.com/haskell/bytestring/pull/525/files#diff-c29f395d853c89b91b13dca506d85e777afb0a0343d817021f642f10798fb1a8R234

hasufell · 2022-07-03T15:49:01Z

Instead, I think it might be better to use explicitly use ByteString.Short.drop on the original ShortByteString.

I don't think so, because drop gives you no guarantees about the length of the input bytestring. So you end up re-inventing unconsN with a combination of length, drop and unpack, which is rather fragile.

hasufell · 2022-07-03T16:00:21Z

I think it would be good to figure out the performance problem with unpack first.

The current implementation is:

unpackBytes :: ShortByteString -> [Word8]
unpackBytes sbs = unpackAppendBytesLazy sbs []

unpackAppendBytesLazy :: ShortByteString -> [Word8] -> [Word8]
unpackAppendBytesLazy sbs = go 0 (length sbs)
  where
    sz = 100

    go off len ws
      | len <= sz = unpackAppendBytesStrict sbs off len ws
      | otherwise = unpackAppendBytesStrict sbs off sz  remainder
                      where remainder = go (off+sz) (len-sz) ws

unpackAppendBytesStrict :: ShortByteString -> Int -> Int -> [Word8] -> [Word8]
unpackAppendBytesStrict !sbs off len = go (off-1) (off-1 + len)
  where
    go !sentinal !i !acc
      | i == sentinal = acc
      | otherwise     = let !w = indexWord8Array (asBA sbs) i
                         in go sentinal (i-1) (w:acc)

which I don't understand at all... in fact.

I changed it to

unpackBytes :: ShortByteString -> [Word8]
unpackBytes sbs = let ix = length sbs - 1
                  in List.map (unsafeIndex sbs) [0..ix]

and that seemed to speed it up considerably:

All
  ShortByteString
    ShortByteString unpack/uncons comparison
      unpack and look at first 5 elements: OK (0.29s)
        62.2 ns ± 3.5 ns
      uncons consecutively 5 times:        OK (0.96s)
        412  μs ±  35 μs
      unconsN 5:                           OK (0.17s)
        77.1 ns ± 6.0 ns

but I'm not sure if there's different memory behavior and if that does something to inlining and list fusion.

Fixes haskell#524

hasufell added a commit to hasufell/bytestring that referenced this issue Jul 3, 2022

Add Data.ByteString.Short.unconsN

d1896da

Fixes haskell#524

hasufell linked a pull request Jul 3, 2022 that will close this issue

Add Data.ByteString.Short.unconsN #525

Draft

hasufell mentioned this issue Jul 3, 2022

Speed up Data.ByteString.Short.unpack #526

Merged

hasufell added a commit to hasufell/bytestring that referenced this issue Sep 30, 2022

Add Data.ByteString.Short.unconsN

a77eedf

Fixes haskell#524

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide unconsN/unsnocN? #524

Provide unconsN/unsnocN? #524

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading

sjakobi commented Jul 3, 2022

sjakobi commented Jul 3, 2022

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022

hasufell commented Jul 3, 2022 •

edited

Loading

Provide unconsN/unsnocN? #524

Provide unconsN/unsnocN? #524

Comments

hasufell commented Jul 3, 2022 • edited Loading

hasufell commented Jul 3, 2022 • edited Loading

hasufell commented Jul 3, 2022 • edited Loading

sjakobi commented Jul 3, 2022

sjakobi commented Jul 3, 2022

hasufell commented Jul 3, 2022 • edited Loading

hasufell commented Jul 3, 2022

hasufell commented Jul 3, 2022 • edited Loading

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading

hasufell commented Jul 3, 2022 •

edited

Loading