Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gh-130704: Strength reduce LOAD_FAST{_LOAD_FAST} #130708

Merged
merged 81 commits into from
Apr 1, 2025
Merged
Changes from all commits
Commits
Show all changes
81 commits
Select commit Hold shift + click to select a range
d716faa
Experiment with borrowing load_fast
mpage Feb 5, 2025
3736923
Checkpoint poc
mpage Feb 5, 2025
7a14254
Fix pyframe copy
mpage Feb 6, 2025
b1607aa
Strengthen refs when frame is copied
mpage Feb 6, 2025
e765735
Cleanup
mpage Feb 6, 2025
291ace9
Consider all instructions when computing mutations
mpage Feb 6, 2025
17d6dd6
Add a super instruction
mpage Feb 7, 2025
0a74052
Don't optimize during quickening
mpage Feb 12, 2025
afbfd88
Use abstract interpretation
mpage Feb 11, 2025
696c630
Fix test_generators
mpage Feb 12, 2025
483ac7a
Optimize returns
mpage Feb 13, 2025
259d5db
Remove unused arg
mpage Feb 13, 2025
aeafa98
Make sure we convert borrowed refs on frame
mpage Feb 15, 2025
85f9a64
Don't test with malformed bytecode
mpage Feb 15, 2025
b6ab2f7
Make sure we convert borrowed refs to func/code when copying generato…
mpage Feb 15, 2025
fd1ad3d
Add support for disassembling LOAD_FAST_BORROW_LOAD_FAST_BORROW
mpage Feb 15, 2025
eee2195
Make sure exc_obj is always defined
mpage Feb 15, 2025
d75ec9a
Make sure we store new stackrefs for frame executable/funcobj
mpage Feb 19, 2025
66f5351
Remove refcount check
mpage Feb 19, 2025
7ef6a0b
Don't hardcode initial refcount in refcount tests
mpage Feb 19, 2025
2af2bbc
Remove invalid bytecode from `test_peepholer`
mpage Feb 19, 2025
bf19b7d
Fix invalid bytecode in `test_peepholer.DirectCfgOptimizerTests.test_…
mpage Feb 19, 2025
a9bca03
Fix tests that checked for `LOAD_FAST` instructions that are now opti…
mpage Feb 20, 2025
293c317
Update disassembly in test_dis to match new bytecode
mpage Feb 20, 2025
a12ccd9
Fix refleak in _BINARY_OP_INPLACE_ADD_UNICODE
mpage Feb 21, 2025
1ef26c5
Create new references to fast locals overwritten via f_locals
mpage Feb 21, 2025
1eb9226
Implement two missing opcodes in the static analysis
mpage Feb 21, 2025
7291c49
Use g_block_list when resetting stack depth
mpage Feb 21, 2025
90bf8df
Avoid reallocating state for each basic block
mpage Feb 21, 2025
9bfa922
Generators
mpage Feb 21, 2025
bf6222b
Move optimize after all other passes have run
mpage Feb 24, 2025
dd97d0c
Don't promote borrowed references in STORE_FAST
mpage Feb 24, 2025
6680709
Track reasons for not being able to optimize instructions
mpage Feb 24, 2025
6568fd9
Rename PyStackRef_DupDeferred
mpage Feb 24, 2025
6fde7b0
Rename _PyStackRef_StealIfUnborrowed
mpage Feb 25, 2025
de13810
Avoid extra copies in take_ownership
mpage Feb 25, 2025
fdeae7d
Make the default build work
mpage Feb 25, 2025
1aed281
Add docs for new opcodes
mpage Feb 25, 2025
c332912
Fix flag array size computation
mpage Feb 26, 2025
76a75a7
Add a high level comment explaining our approach
mpage Feb 26, 2025
dd6426f
Bump magic number after merge (was bumped on main)
mpage Feb 28, 2025
bf68eb9
Add more tests
mpage Feb 28, 2025
474a587
Update commented out assertion
mpage Feb 28, 2025
4b3aacf
Add NEWS entry
mpage Feb 28, 2025
e692037
Remove debug print
mpage Feb 28, 2025
2ecfe08
Merge branch 'main' into load-fast-borrow-absinterp
mpage Feb 28, 2025
f98d91d
Fix doctest
mpage Feb 28, 2025
725dc8e
Fix JIT tests
mpage Mar 1, 2025
03d35b2
Fix missed doctest
mpage Mar 1, 2025
39ff3f0
Fix narrowing
mpage Mar 1, 2025
f012a9f
Formatting
mpage Mar 1, 2025
b4b7f73
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 13, 2025
8ea82b5
Implement PyStackRef_{Is,Make}HeapSafe
mpage Mar 13, 2025
9bec9f5
Add missing error handling
mpage Mar 13, 2025
902ae84
Simplify frees
mpage Mar 13, 2025
b0ea38f
Get rid of `PyStackRef_IsBorrowed`
mpage Mar 13, 2025
1f5cfcd
Use PyStackRef_Borrow as the new API
mpage Mar 14, 2025
d1e8e45
Make the default build work
mpage Mar 14, 2025
00c95cc
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 17, 2025
cc01a30
Regen frozenmain
mpage Mar 17, 2025
6c5faab
Add a workaround for failing tests rather than change marshal.c
mpage Mar 17, 2025
32bd0c6
Update dis.rst to reflect support for LOAD_FAST_BORROW in the default…
mpage Mar 17, 2025
fdb8a82
Exclude immortal objects when keeping overwritten locals alive
mpage Mar 17, 2025
a962017
Use a tuple to store overwritten fast locals
mpage Mar 17, 2025
6c2f07d
Fix off-by-one error
mpage Mar 17, 2025
85b0b00
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 20, 2025
5ff2dea
Fix post-merge issues
mpage Mar 20, 2025
0c1e67f
English is hard
mpage Mar 20, 2025
ae2ec65
Improve readability of test cases
mpage Mar 20, 2025
03c474e
Elaborate in the blurb
mpage Mar 20, 2025
60665c9
Remove parameter to calculate stackdepth
mpage Mar 20, 2025
ac8940b
Update comment
mpage Mar 20, 2025
f12573f
Test optimize_load_fast as part of OptimizeCfg
mpage Mar 21, 2025
44f7ffc
Remove test with invalid bytecode
mpage Mar 21, 2025
818e94e
Add helper macro for pushing refs
mpage Mar 21, 2025
c30e1e9
Handle opcodes that leave at least one input on the stack
mpage Mar 22, 2025
112cee6
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 23, 2025
80fc5aa
Avoid having stackref only visible from the c stack
mpage Mar 23, 2025
492cce1
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 24, 2025
2e38f0d
Merge branch 'main' into load-fast-borrow-absinterp
mpage Mar 31, 2025
2c55722
Merge branch 'main' into load-fast-borrow-absinterp
mpage Apr 1, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
20 changes: 17 additions & 3 deletions Doc/library/dis.rst
Original file line number Diff line number Diff line change
@@ -76,7 +76,7 @@ the following command can be used to display the disassembly of
2 RESUME 0
<BLANKLINE>
3 LOAD_GLOBAL 1 (len + NULL)
LOAD_FAST 0 (alist)
LOAD_FAST_BORROW 0 (alist)
CALL 1
RETURN_VALUE

@@ -215,7 +215,7 @@ Example:
...
RESUME
LOAD_GLOBAL
LOAD_FAST
LOAD_FAST_BORROW
CALL
RETURN_VALUE

@@ -1402,13 +1402,28 @@ iterations of the loop.
This opcode is now only used in situations where the local variable is
guaranteed to be initialized. It cannot raise :exc:`UnboundLocalError`.

.. opcode:: LOAD_FAST_BORROW (var_num)

Pushes a borrowed reference to the local ``co_varnames[var_num]`` onto the
stack.

.. versionadded:: 3.14

.. opcode:: LOAD_FAST_LOAD_FAST (var_nums)

Pushes references to ``co_varnames[var_nums >> 4]`` and
``co_varnames[var_nums & 15]`` onto the stack.

.. versionadded:: 3.13


.. opcode:: LOAD_FAST_BORROW_LOAD_FAST_BORROW (var_nums)

Pushes borrowed references to ``co_varnames[var_nums >> 4]`` and
``co_varnames[var_nums & 15]`` onto the stack.

.. versionadded:: 3.14

.. opcode:: LOAD_FAST_CHECK (var_num)

Pushes a reference to the local ``co_varnames[var_num]`` onto the stack,
@@ -2023,4 +2038,3 @@ instructions:

.. deprecated:: 3.13
All jumps are now relative. This list is empty.

6 changes: 6 additions & 0 deletions Include/internal/pycore_frame.h
Original file line number Diff line number Diff line change
@@ -28,6 +28,12 @@ struct _frame {
PyEval_GetLocals requires a borrowed reference so the actual reference
is stored here */
PyObject *f_locals_cache;
/* A tuple containing strong references to fast locals that were overwritten
* via f_locals. Borrowed references to these locals may exist in frames
* closer to the top of the stack. The references in this tuple act as
* "support" for the borrowed references, ensuring that they remain valid.
*/
PyObject *f_overwritten_fast_locals;
/* The frame data, if this frame object owns the frame */
PyObject *_f_frame_data[1];
};
18 changes: 16 additions & 2 deletions Include/internal/pycore_opcode_metadata.h

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

50 changes: 36 additions & 14 deletions Include/internal/pycore_stackref.h
Original file line number Diff line number Diff line change
@@ -172,6 +172,12 @@ PyStackRef_MakeHeapSafe(_PyStackRef ref)
return ref;
}

static inline _PyStackRef
PyStackRef_Borrow(_PyStackRef ref)
{
return PyStackRef_DUP(ref)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is there an issue for adding lifetime checking for this?
If not, could you make one and assign it to either yourself or me?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

}

#define PyStackRef_CLEAR(REF) \
do { \
_PyStackRef *_tmp_op_ptr = &(REF); \
@@ -253,6 +259,25 @@ _PyStackRef_FromPyObjectSteal(PyObject *obj)
}
# define PyStackRef_FromPyObjectSteal(obj) _PyStackRef_FromPyObjectSteal(_PyObject_CAST(obj))

static inline bool
PyStackRef_IsHeapSafe(_PyStackRef stackref)
{
if (PyStackRef_IsDeferred(stackref)) {
PyObject *obj = PyStackRef_AsPyObjectBorrow(stackref);
return obj == NULL || _Py_IsImmortal(obj) || _PyObject_HasDeferredRefcount(obj);
}
return true;
}

static inline _PyStackRef
PyStackRef_MakeHeapSafe(_PyStackRef stackref)
{
if (PyStackRef_IsHeapSafe(stackref)) {
return stackref;
}
PyObject *obj = PyStackRef_AsPyObjectBorrow(stackref);
return (_PyStackRef){ .bits = (uintptr_t)(Py_NewRef(obj)) | Py_TAG_PTR };
}

static inline _PyStackRef
PyStackRef_FromPyObjectStealMortal(PyObject *obj)
@@ -311,25 +336,16 @@ PyStackRef_DUP(_PyStackRef stackref)
{
assert(!PyStackRef_IsNull(stackref));
if (PyStackRef_IsDeferred(stackref)) {
assert(_Py_IsImmortal(PyStackRef_AsPyObjectBorrow(stackref)) ||
_PyObject_HasDeferredRefcount(PyStackRef_AsPyObjectBorrow(stackref))
);
return stackref;
}
Py_INCREF(PyStackRef_AsPyObjectBorrow(stackref));
return stackref;
}

static inline bool
PyStackRef_IsHeapSafe(_PyStackRef ref)
{
return true;
}

static inline _PyStackRef
PyStackRef_MakeHeapSafe(_PyStackRef ref)
PyStackRef_Borrow(_PyStackRef stackref)
{
return ref;
return (_PyStackRef){ .bits = stackref.bits | Py_TAG_DEFERRED };
}

// Convert a possibly deferred reference to a strong reference.
@@ -399,7 +415,6 @@ static inline void PyStackRef_CheckValid(_PyStackRef ref) {
assert(!_Py_IsStaticImmortal(obj));
break;
case Py_TAG_REFCNT:
assert(obj == NULL || _Py_IsImmortal(obj));
break;
default:
assert(0);
@@ -413,21 +428,28 @@ static inline void PyStackRef_CheckValid(_PyStackRef ref) {
#endif

#ifdef _WIN32
#define PyStackRef_RefcountOnObject(REF) (((REF).bits & Py_TAG_BITS) == 0)
#define PyStackRef_RefcountOnObject(REF) (((REF).bits & Py_TAG_REFCNT) == 0)
#define PyStackRef_AsPyObjectBorrow BITS_TO_PTR_MASKED
#define PyStackRef_Borrow(REF) (_PyStackRef){ .bits = ((REF).bits) | Py_TAG_REFCNT};
#else
/* Does this ref not have an embedded refcount and thus not refer to a declared immmortal object? */
static inline int
PyStackRef_RefcountOnObject(_PyStackRef ref)
{
return (ref.bits & Py_TAG_BITS) == 0;
return (ref.bits & Py_TAG_REFCNT) == 0;
}

static inline PyObject *
PyStackRef_AsPyObjectBorrow(_PyStackRef ref)
{
return BITS_TO_PTR_MASKED(ref);
}

static inline _PyStackRef
PyStackRef_Borrow(_PyStackRef ref)
{
return (_PyStackRef){ .bits = ref.bits | Py_TAG_REFCNT };
}
#endif

static inline PyObject *
Loading
Loading