KEMBAR78
bpo-37907: Slightly improve performance of PyLong_AsSsize_t() with large longs by sir-sigurd · Pull Request #15363 · python/cpython · GitHub
Skip to content

Conversation

@sir-sigurd
Copy link
Contributor

@sir-sigurd sir-sigurd commented Aug 21, 2019

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a completely trivial internal change. I don't think it needs a NEWS entry.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It would be good to add a comment here (and other similar lines below). Something along the lines of

/* The right hand of this comparison if the largest unsigned long value
 * that can be shifted left by PyLong_SHIFT bits without overflow. */

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO it's quite obvious from the code.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would prefer ULONG_MAX rather than (unsigned long)-1. Same comment for (size_t)-1 below.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would prefer ULONG_MAX rather than (unsigned long)-1. Same comment for (size_t)-1 below.

if (i < 0) {
sign = -1;
i = -(i);
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be possible to start at "x = v->ob_digit[0];" and "i=1;" to avoid starting the loop with x=0 which means one useless if at the first iteration.

With 64-bit unsigned long and PyLong_SHIFT, we can even combine two digits without having to check for overflow, no?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we can even combine two digits without having to check for overflow, no?

Something like:

/* in the default: case, i >= 2 */
assert(i >= 2);
#if ((ULONG_MAX >> PyLong_SHIFT)) >= ((1UL << PyLong_SHIFT) - 1)
  /* use 2 digits */
  --i;
  x= digit[i];
  x <<=PyLong_SHIFT;
  --i;
  x |= digit[i];
#else
  /* use 1 digit */
  --i
  assert(ULONG_MAX >= ((1UL << PyLong_SHIFT) - 1);
  x= digit[i];
#endif
while (--i >= 0) { ... }

@bedevere-bot
Copy link

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

@hongweipeng
Copy link
Contributor

There is a similar overflow check in PyLong_AsLongLongAndOverflow. Can you optimize it together?

@thatbirdguythatuknownot
Copy link
Contributor

Is this pull request supposed to be closed? I'm still getting speed-ups when implemented in the current version of Python.

@mdickinson
Copy link
Member

@thatbirdguythatuknownot

Is this pull request supposed to be closed?

It's awaiting changes from @sir-sigurd.

@eendebakpt
Copy link
Contributor

@sir-sigurd Changes look good to me, but the PR needs a rebase.

@skirpichev
Copy link
Contributor

Continued in #135585

@skirpichev skirpichev closed this Jun 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

awaiting changes performance Performance or resource usage skip news

Projects

None yet

Development

Successfully merging this pull request may close these issues.