perf: Improve repr performance #918

TrevorBergeron · 2024-08-23T00:28:51Z

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
Ensure the tests and linter pass
Code coverage does not decrease (if any source code was changed)
Appropriate docs were updated (if necessary)

Fixes #<issue_number_goes_here> 🦕

tswast · 2024-08-28T21:01:14Z

bigframes/core/__init__.py

-        return cls(node)
-
-    @classmethod
-    def from_cached(


I assume this was dead code?

yes, probably from a recent refactor

bigframes/core/blocks.py

tswast · 2024-08-28T21:08:08Z

bigframes/core/nodes.py



 ## Put ordering in here or just add order_by node above?
 @dataclass(frozen=True)
-class ReadTableNode(BigFrameNode):
+class ReadTableNode(LeafNode):


Do we want a row_count here? If there are no filters applied, we should be able to get this from table metadata (though it could be slightly inaccurate if there was anything in the streaming buffer, which can show up in query results before the count arrives at the table metadata).

Yes, that is probably right. In new revision will stuff some more table metadata into these nodes and use that to provide row_count where possible.

tswast · 2024-08-28T21:09:05Z

bigframes/core/nodes.py

@@ -398,6 +415,11 @@ def relation_ops_created(self) -> int:
        # Assume worst case, where readgbq actually has baked in analytic operation to generate index
        return 3

+    @property
+    def supports_fast_head(self) -> bool:
+        # TODO: Be more lenient for small tables, or those clustered on non-sequential order key


+1, LIMIT clause is quite cheap on clustered tables.

From what I can tell, this is not actually the case yet. All the files still get dispatched for ORDER BY+LIMIT, and so full cost is paid. Only filter over materialized row numbers is granting significant saving from my experiements. This may change in time.

tswast · 2024-08-28T21:16:32Z

bigframes/core/tree_properties.py

+    """Can get head fast if can push head operator down to leafs and operators preserve rows."""
+    if isinstance(node, nodes.LeafNode):
+        return node.supports_fast_head
+    # TODO: In theory we can push head down through concat, but requires some dedicated logic


Are you thinking something like: limit applied to each + an overall limit clause? Does the BigQuery engine not push down limit through a union if we apply it at the top level?

I guess I didn't think too hard about this. I am not aware of engine-level ordered head optimization that would prevent dispatching all the underlying data

tswast · 2024-08-28T21:18:27Z

bigframes/session/executor.py

+        self, array_value: bigframes.core.ArrayValue, n_rows: int
+    ) -> tuple[bigquery.table.RowIterator, bigquery.QueryJob]:
+        """
+        A 'peek' efficiently accesses a small number of rows in the dataframe.


Update this docstring.

tswast · 2024-08-28T21:20:53Z

bigframes/session/executor.py

+        return self._run_execute_query(sql=sql)
+
+    def get_row_count(self, array_value: bigframes.core.ArrayValue) -> int:
+        # optimized plan less likely to have count-destroying operators like filter or join


What's this comment mean? Do you mean that if the count is not None, then we have an optimized plan?

removed comment. essential idea is to use the row_num metadata from cached execution where possible

tswast · 2024-08-28T21:22:05Z

bigframes/session/executor.py

@@ -218,7 +274,7 @@ def _run_execute_query(
            else:
                raise

-    def _with_cached_executions(self, node: nodes.BigFrameNode) -> nodes.BigFrameNode:
+    def _get_optimized_plan(self, node: nodes.BigFrameNode) -> nodes.BigFrameNode:


Could we get a docstring for this? Or maybe rename? I'd love to understand when we'd use this? It seems we'd always want to use an "optimized" plan, as no downsides are documented here.

added docstring. only optimization right now is to apply caching. no real downsides at this stage, other than mutating the structure of the tree, which is fine at execution stage (don't want to do it earlier as it would interfere with implicit joining).

tswast · 2024-08-30T21:39:15Z

bigframes/core/nodes.py

 @dataclass(frozen=True)
-class ReadTableNode(BigFrameNode):
+class GbqTable:


This is so we can get something hashable? So we can make sure we only have the fields we care about? A docstring with the purpose would be helpful here.

* perf: Improve repr performance * extract gbq metadata from nodes to common struct * clarify fast head * fix physical_schema to be bq client types * add classmethod annotation to GbqTable struct factory method * add classmethod annotation to GbqTable struct factory method --------- Co-authored-by: Tim Sweña (Swast) <swast@google.com>

product-auto-label bot added size: l api: bigquery labels Aug 23, 2024

perf: Improve repr performance

b378198

TrevorBergeron force-pushed the faster_repr branch from db383cc to b378198 Compare August 23, 2024 20:06

TrevorBergeron requested a review from tswast August 23, 2024 20:07

TrevorBergeron marked this pull request as ready for review August 23, 2024 20:07

TrevorBergeron requested review from a team as code owners August 23, 2024 20:07

blunderbuss-gcf bot assigned jiaxunwu Aug 23, 2024

tswast added the owlbot:run label Aug 28, 2024

gcf-owl-bot bot removed the owlbot:run label Aug 28, 2024

tswast reviewed Aug 28, 2024

View reviewed changes

TrevorBergeron added 3 commits August 28, 2024 23:21

Merge remote-tracking branch 'github/main' into faster_repr

4d0676f

extract gbq metadata from nodes to common struct

7d74511

clarify fast head

a54d3a6

TrevorBergeron requested a review from tswast August 29, 2024 19:53

TrevorBergeron and others added 5 commits August 29, 2024 20:12

fix physical_schema to be bq client types

dd23425

add classmethod annotation to GbqTable struct factory method

1bf4c3b

add classmethod annotation to GbqTable struct factory method

0f351e2

Merge branch 'main' into faster_repr

d173181

Merge branch 'main' into faster_repr

06aa294

tswast approved these changes Aug 30, 2024

View reviewed changes

tswast enabled auto-merge (squash) August 30, 2024 21:44

tswast merged commit 46f2dd7 into main Aug 30, 2024
21 of 23 checks passed

tswast deleted the faster_repr branch August 30, 2024 22:28

release-please bot mentioned this pull request Aug 30, 2024

chore(main): release 1.16.0 #911

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Improve repr performance #918

perf: Improve repr performance #918

TrevorBergeron commented Aug 23, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 28, 2024

TrevorBergeron Aug 29, 2024

tswast Aug 30, 2024

perf: Improve repr performance #918

perf: Improve repr performance #918

Conversation

TrevorBergeron commented Aug 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment