gh-131525: Cache the result of tuple_hash #131529

mdboom · 2025-03-20T21:04:27Z

Back in 2013, it was determined that caching the result of tuple_hash did not have any significant speedup.

However, a lot has changed since then, and in a recent experiment to add a tuple hash cache back in, the mdp benchmark increased by 86%.

Admittedly, there was no measurable improvement on any other benchmark, but it also seems to have no downside, including for memory usage when measured with max_rss.

Issue: Caching the tuple hash calculation speeds up some code significantly #131525

gpshead · 2025-03-21T03:37:26Z

Objects/tupleobject.c

@@ -530,6 +547,8 @@ tuple_repeat(PyObject *self, Py_ssize_t n)
    if (np == NULL)
        return NULL;

+    _PyTuple_RESET_HASH_CACHE(np);


is there any reason not to just move this into tuple_alloc() itself instead of repeating it everywhere tuple_alloc() is called?

I don't personally have an objection. Most *_alloc functions only do allocation, not initialization, so I didn't want to confuse that. But the cached hash is kind of a special case...

markshannon · 2025-03-21T10:41:41Z

Objects/tupleobject.c

 static Py_hash_t
 tuple_hash(PyObject *op)
 {
    PyTupleObject *v = _PyTuple_CAST(op);
+
+    // For the empty singleton, we don't need to dereference the pointer
+    if (op == (PyObject *)&_Py_SINGLETON(tuple_empty)) {


You could set the hash of () statically in pycore_runtime_init.h, then this test would be unnecessary.

Yes, but interestingly, this is measurably faster -- it doesn't have to chase the pointer.

Seems like the difference here is in the noise, so might as well do this at build time and skip this extra branch.

picnixz

Just cosmetics nits and I think it's good.

Note: it's also possible to use a static inline function instead for _PyTuple_RESET_HASH_CACHE, but that would mean that *op needs to be a PyObject but I think not all calls pass a true PyObject.

Include/internal/pycore_tuple.h

Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>

kumaraditya303 · 2025-03-25T09:07:04Z

Objects/tupleobject.c

 static Py_hash_t
 tuple_hash(PyObject *op)
 {
    PyTupleObject *v = _PyTuple_CAST(op);
+
+    if (v->ob_hash != -1) {


The loads of ob_hash need relaxed memory ordering i.e. use FT_ATOMIC_LOAD_SSIZE_RELAXED

Still learning the free-threaded conventions. Can you explain why (or point me at an explanation)? Since the write is atomic and we don't care if there is a race (worst case, the hash gets recomputed), and the thread sanitizer CI isn't complaining, I don't see why this would be necessary -- but also I'm new to this stuff so maybe I'm missing something.

The FT_... macros translate to plain loads/stores on the default build, so there is no overhead there.

It is necessary because accessing a field from multiple threads without any kind of synchronization is undefined behavior.
Even on the free-threaded build the RELAXED part means that they is no synchronization at the hardware level on most platforms, it just convinces the compiler and sanitizers that this is OK.

bedevere-app · 2025-03-25T09:07:22Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

mdboom · 2025-03-25T14:05:03Z

I have made the requested changes; please review again

bedevere-app · 2025-03-25T14:05:10Z

Thanks for making the requested changes!

@picnixz, @kumaraditya303: please review the changes made to this pull request.

brandtbucher · 2025-03-25T21:38:13Z

Now that all of these places have been replaced with a function call to _PyTuple_Recycle, is that sufficient, or would you prefer that these comments remain?

We should probably just remove the old comments and keep one in _PyTuple_Recycle, in my opinion.

vstinner · 2025-03-26T07:14:14Z

This PR changes 102 files, it's hard to review. Most files are generated by Argument Clinic. Would it be possible to have a first PR which only adds the ob_hash member? So the second PR would only contain the interesting changes, making it easier to review.

mdboom · 2025-03-26T11:46:57Z

This PR changes 102 files, it's hard to review. Most files are generated by Argument Clinic. Would it be possible to have a first PR which only adds the ob_hash member? So the second PR would only contain the interesting changes, making it easier to review.

GitHub seems to be doing the right thing by hiding the generated changes. Is that not happening for you?

I'd be happy to rearrange the commits, but I don't think merging a PR if the second one may not be approved makes sense. Let me know, as that would also invalidate the existing review comments on this PR.

Objects/tupleobject.c

Co-authored-by: Chris Eibl <138194463+chris-eibl@users.noreply.github.com>

pythongh-131525: Cache the result of tuple_hash

b3c3977

mdboom requested review from erlend-aasland, corona10, ethanfurman, rhettinger, a team, berkerpeksag, gpshead, pganssle, abalkin, ericsnowcurrently, brettcannon, ncoghlan, warsaw, tiran, picnixz, methane, markshannon, 1st1, asvetlov, kumaraditya303 and willingc as code owners March 20, 2025 21:04

bedevere-app bot added the awaiting core review label Mar 20, 2025

bedevere-app bot mentioned this pull request Mar 20, 2025

Caching the tuple hash calculation speeds up some code significantly #131525

Open

mdboom added 3 commits March 20, 2025 17:12

Fix debug builds

8a1f93e

Add blurb

1442437

Fix formatting

9d1c089

rhettinger removed their request for review March 20, 2025 22:22

gpshead reviewed Mar 21, 2025

View reviewed changes

markshannon reviewed Mar 21, 2025

View reviewed changes

mdboom requested review from picnixz and corona10 and removed request for corona10 March 24, 2025 14:33

picnixz approved these changes Mar 24, 2025

View reviewed changes

Include/internal/pycore_tuple.h Outdated Show resolved Hide resolved

Include/internal/pycore_tuple.h Outdated Show resolved Hide resolved

bedevere-app bot added awaiting merge and removed awaiting core review labels Mar 24, 2025

mdboom and others added 2 commits March 24, 2025 10:50

Update Include/internal/pycore_tuple.h

2c8f077

Co-authored-by: Bénédikt Tran <10796600+picnixz@users.noreply.github.com>

Fix alignment

a6edcea

kumaraditya303 requested changes Mar 25, 2025

View reviewed changes

bedevere-app bot removed the awaiting merge label Mar 25, 2025

bedevere-app bot added the awaiting changes label Mar 25, 2025

atomic load

5faa68d

bedevere-app bot removed the awaiting changes label Mar 25, 2025

mdboom requested a review from kumaraditya303 March 25, 2025 14:05

bedevere-app bot added the awaiting change review label Mar 25, 2025

bedevere-app bot requested a review from picnixz March 25, 2025 14:05

gpshead approved these changes Mar 26, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting change review labels Mar 26, 2025

chris-eibl reviewed Mar 26, 2025

View reviewed changes

Objects/tupleobject.c Outdated Show resolved Hide resolved

Update Objects/tupleobject.c

da6fa30

Co-authored-by: Chris Eibl <138194463+chris-eibl@users.noreply.github.com>

mdboom merged commit 8614f86 into python:main Mar 27, 2025
50 checks passed

bedevere-app bot removed the awaiting merge label Mar 27, 2025

lgeiger mentioned this pull request Mar 31, 2025

gh-131525: Remove _HashedSeq wrapper from lru_cache #131922

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Sponsors

gh-131525: Cache the result of tuple_hash #131529

gh-131525: Cache the result of tuple_hash #131529

mdboom commented Mar 20, 2025 •

edited by colesbury

Loading

gpshead Mar 21, 2025

mdboom Mar 21, 2025

markshannon Mar 21, 2025

mdboom Mar 21, 2025

mdboom Mar 21, 2025

picnixz left a comment

kumaraditya303 Mar 25, 2025

mdboom Mar 25, 2025

markshannon Mar 25, 2025

bedevere-app bot commented Mar 25, 2025

mdboom commented Mar 25, 2025

bedevere-app bot commented Mar 25, 2025

brandtbucher commented Mar 25, 2025

vstinner commented Mar 26, 2025

mdboom commented Mar 26, 2025

gh-131525: Cache the result of tuple_hash #131529

gh-131525: Cache the result of tuple_hash #131529

Conversation

mdboom commented Mar 20, 2025 • edited by colesbury Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

picnixz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bedevere-app bot commented Mar 25, 2025

mdboom commented Mar 25, 2025

bedevere-app bot commented Mar 25, 2025

brandtbucher commented Mar 25, 2025

vstinner commented Mar 26, 2025

mdboom commented Mar 26, 2025

mdboom commented Mar 20, 2025 •

edited by colesbury

Loading