llvm-project

mirror of https://github.com/encounter/llvm-project.git synced 2026-03-30 11:27:19 -07:00

Author	SHA1	Message	Date
Nikita Popov	9ebeac6788	[ConstantRange][CVP] Make use of abs poison flag Pass the abs poison flag to the underlying ConstantRange implementation, allowing CVP to simplify based on it. Importantly, this recognizes that abs with poison flag is actually non-negative...	2020-07-30 23:06:10 +02:00
Nikita Popov	94f8120cb9	[ConstantRange] Support abs with poison flag This just adds the ConstantRange support, including exhaustive testing. It's not wired up to the IR intrinsic flag yet.	2020-07-30 22:49:28 +02:00
Nikita Popov	d8a98a9c35	[ConstantRange][CVP] Compute min/max/abs intrinsic ranges Wire up ConstantRange::intrinsic() to the existing primitives for min, max and abs. The poison flag on abs is not yet taken into account.	2020-07-30 22:21:34 +02:00
Nikita Popov	897bdca4b8	[ConstantRange] Add API for intrinsics (NFC) This adds a common API for compute constant ranges of intrinsics. The intention here is that a) we can reuse the same code across different passes that handle constant ranges, i.e. this can be reused in SCCP b) we only have to add knowledge about supported intrinsics to ConstantRange, not any consumers. Differential Revision: https://reviews.llvm.org/D84587	2020-07-29 22:16:27 +02:00
Florian Hahn	d307174e1d	[ConstantRange] Use APInt::or/APInt::and for single elements. Currently ConstantRange::binaryAnd/binaryOr results are too pessimistic for single element constant ranges. If both operands are single element ranges, we can use APInt's AND and OR implementations directly. Note that some other binary operations on constant ranges can cover the single element cases naturally, but for OR and AND this unfortunately is not the case. Reviewers: nikic, spatel, lebedev.ri Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D76446	2020-04-01 09:50:24 +01:00
Florian Hahn	7caba33907	[ConstantRange] Add initial support for binaryXor. The initial implementation just delegates to APInt's implementation of XOR for single element ranges and conservatively returns the full set otherwise. Reviewers: nikic, spatel, lebedev.ri Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D76453	2020-03-24 12:59:50 +00:00
Florian Hahn	b35c585a9a	[ConstantRange] Respect destination bitwidth for cast results. We returning a full set, we should use ResultBitWidth. Otherwise we might it assertions when the resulting constant ranges are used later on. Reviewers: nikic, spatel, reames Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D71937	2019-12-27 17:38:34 +00:00
Roman Lebedev	9a20c79ddc	[NFC][KnownBits] Add getMinValue() / getMaxValue() methods As it can be seen from accompanying cleanup, it is not unheard of to write `~Known.Zero` meaning "what maximal value can this KnownBits produce". But i think `~Known.Zero` isn't that self-explanatory, as compared to a method with a name. Note that not all `~Known.Zero` places were cleaned up, only those where this arguably improves things.	2019-12-03 20:04:51 +03:00
Roman Lebedev	7dddfa2a9c	[NFC] ConstantRange::subWithNoWrap(): fixup comment	2019-11-08 17:52:43 +03:00
Roman Lebedev	5a9fd76d2f	[ConstantRange] Add umul_sat()/smul_sat() methods Summary: To be used in `ConstantRange::mulWithNoOverflow()`, may in future be useful for when saturating shift/mul ops are added. These are precise as far as i can tell. I initially though i will need `APInt::[us]mul_sat()` for these, but it turned out much simpler to do what `ConstantRange::multiply()` does - perform multiplication in twice the bitwidth, and then truncate. Though here we want saturating signed truncation. Reviewers: nikic, reames, spatel Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69994	2019-11-08 17:52:43 +03:00
Roman Lebedev	72a21ad6c9	[CR] ConstantRange::sshl_sat(): check sigdness of the min/max, not ranges This was pointed out in review, but forgot to stage this change into the commit itself..	2019-11-08 10:32:56 +03:00
Roman Lebedev	e0ea842bae	[ConstantRange] Add `ushl_sat()`/`sshl_sat()` methods. Summary: To be used in `ConstantRange::shlWithNoOverflow()`, may in future be useful for when saturating shift/mul ops are added. Unlike `ConstantRange::shl()`, these are precise. Reviewers: nikic, spatel, reames Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69960	2019-11-08 10:31:04 +03:00
Roman Lebedev	69ce2ae990	[ConstantRange][LVI] Use overflow flags from `sub` to constrain the range Summary: This notably improves non-negativity deduction: ``` \| statistic \| old \| new \| delta \| % change \| \| correlated-value-propagation.NumAShrs \| 209 \| 227 \| 18 \| 8.6124% \| \| correlated-value-propagation.NumAddNSW \| 4972 \| 4988 \| 16 \| 0.3218% \| \| correlated-value-propagation.NumAddNUW \| 7141 \| 7148 \| 7 \| 0.0980% \| \| correlated-value-propagation.NumAddNW \| 12113 \| 12136 \| 23 \| 0.1899% \| \| correlated-value-propagation.NumAnd \| 442 \| 445 \| 3 \| 0.6787% \| \| correlated-value-propagation.NumNSW \| 7160 \| 7176 \| 16 \| 0.2235% \| \| correlated-value-propagation.NumNUW \| 13306 \| 13316 \| 10 \| 0.0752% \| \| correlated-value-propagation.NumNW \| 20466 \| 20492 \| 26 \| 0.1270% \| \| correlated-value-propagation.NumSDivs \| 207 \| 212 \| 5 \| 2.4155% \| \| correlated-value-propagation.NumSExt \| 6279 \| 6679 \| 400 \| 6.3704% \| \| correlated-value-propagation.NumSRems \| 28 \| 29 \| 1 \| 3.5714% \| \| correlated-value-propagation.NumShlNUW \| 2793 \| 2796 \| 3 \| 0.1074% \| \| correlated-value-propagation.NumShlNW \| 3964 \| 3967 \| 3 \| 0.0757% \| \| correlated-value-propagation.NumUDivs \| 353 \| 358 \| 5 \| 1.4164% \| \| instcount.NumAShrInst \| 13763 \| 13741 \| -22 \| -0.1598% \| \| instcount.NumAddInst \| 277349 \| 277348 \| -1 \| -0.0004% \| \| instcount.NumLShrInst \| 27437 \| 27463 \| 26 \| 0.0948% \| \| instcount.NumOrInst \| 102677 \| 102678 \| 1 \| 0.0010% \| \| instcount.NumSDivInst \| 8732 \| 8727 \| -5 \| -0.0573% \| \| instcount.NumSExtInst \| 80872 \| 80468 \| -404 \| -0.4996% \| \| instcount.NumSRemInst \| 1679 \| 1678 \| -1 \| -0.0596% \| \| instcount.NumTruncInst \| 62154 \| 62153 \| -1 \| -0.0016% \| \| instcount.NumUDivInst \| 2526 \| 2527 \| 1 \| 0.0396% \| \| instcount.NumURemInst \| 1589 \| 1590 \| 1 \| 0.0629% \| \| instcount.NumZExtInst \| 69405 \| 69809 \| 404 \| 0.5821% \| \| instcount.TotalInsts \| 7439575 \| 7439574 \| -1 \| 0.0000% \| ``` Reviewers: nikic, reames, spatel Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69942	2019-11-07 16:18:03 +03:00
Roman Lebedev	7fbe5d4b2a	[ConstantRange] Add `subWithNoWrap()` method Summary: Much like D67339, adds ConstantRange handling for when we know no-wrap behavior of the `sub`. Unlike addWithNoWrap(), we only get lucky re returning empty set for signed wrap. For unsigned, we must perform overflow check manually. A patch that makes use of this in LVI (CVP) to be posted later. Reviewers: nikic, shchenz, efriedma Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69918	2019-11-07 01:30:53 +03:00
Roman Lebedev	365d729e10	[ConstantRange] Cleanup addWithNoWrap() by just piggybacking on sadd_sat()/uadd_sat() As discussed in https://reviews.llvm.org/D69918 that happens to work as intended, and returns empty set if there is always an overflow because we get lucky with intersection. Since there's now an explicit test for that, let's prefer cleaner code.	2019-11-07 01:30:53 +03:00
Roman Lebedev	1f665046fb	[LVI][CVP] LazyValueInfoImpl::solveBlockValueBinaryOp(): use no-wrap flags from `add` op Summary: This was suggested in https://reviews.llvm.org/D69277#1717210 In this form (this is what was suggested, right?), the results aren't staggering (especially since given LVI cross-block focus) this does catch some things (as per test-suite), but not too much: \| statistic \| old \| new \| delta \| % change \| \| correlated-value-propagation.NumAddNSW \| 4981 \| 4982 \| 1 \| 0.0201% \| \| correlated-value-propagation.NumAddNW \| 12125 \| 12126 \| 1 \| 0.0082% \| \| correlated-value-propagation.NumCmps \| 1199 \| 1202 \| 3 \| 0.2502% \| \| correlated-value-propagation.NumDeadCases \| 112 \| 111 \| -1 \| -0.8929% \| \| correlated-value-propagation.NumMulNSW \| 275 \| 278 \| 3 \| 1.0909% \| \| correlated-value-propagation.NumMulNUW \| 1323 \| 1326 \| 3 \| 0.2268% \| \| correlated-value-propagation.NumMulNW \| 1598 \| 1604 \| 6 \| 0.3755% \| \| correlated-value-propagation.NumNSW \| 7158 \| 7167 \| 9 \| 0.1257% \| \| correlated-value-propagation.NumNUW \| 13304 \| 13310 \| 6 \| 0.0451% \| \| correlated-value-propagation.NumNW \| 20462 \| 20477 \| 15 \| 0.0733% \| \| correlated-value-propagation.NumOverflows \| 4 \| 7 \| 3 \| 75.0000% \| \| correlated-value-propagation.NumPhis \| 15366 \| 15381 \| 15 \| 0.0976% \| \| correlated-value-propagation.NumSExt \| 6273 \| 6277 \| 4 \| 0.0638% \| \| correlated-value-propagation.NumShlNSW \| 1172 \| 1171 \| -1 \| -0.0853% \| \| correlated-value-propagation.NumShlNUW \| 2793 \| 2794 \| 1 \| 0.0358% \| \| correlated-value-propagation.NumSubNSW \| 730 \| 736 \| 6 \| 0.8219% \| \| correlated-value-propagation.NumSubNUW \| 2044 \| 2046 \| 2 \| 0.0978% \| \| correlated-value-propagation.NumSubNW \| 2774 \| 2782 \| 8 \| 0.2884% \| \| instcount.NumAddInst \| 277586 \| 277569 \| -17 \| -0.0061% \| \| instcount.NumAndInst \| 66056 \| 66054 \| -2 \| -0.0030% \| \| instcount.NumBrInst \| 709147 \| 709146 \| -1 \| -0.0001% \| \| instcount.NumCallInst \| 528579 \| 528576 \| -3 \| -0.0006% \| \| instcount.NumExtractValueInst \| 18307 \| 18301 \| -6 \| -0.0328% \| \| instcount.NumOrInst \| 102660 \| 102665 \| 5 \| 0.0049% \| \| instcount.NumPHIInst \| 318008 \| 318007 \| -1 \| -0.0003% \| \| instcount.NumSelectInst \| 46373 \| 46370 \| -3 \| -0.0065% \| \| instcount.NumSExtInst \| 79496 \| 79488 \| -8 \| -0.0101% \| \| instcount.NumShlInst \| 40654 \| 40657 \| 3 \| 0.0074% \| \| instcount.NumTruncInst \| 62251 \| 62249 \| -2 \| -0.0032% \| \| instcount.NumZExtInst \| 68211 \| 68221 \| 10 \| 0.0147% \| \| instcount.TotalBlocks \| 843910 \| 843909 \| -1 \| -0.0001% \| \| instcount.TotalInsts \| 7387448 \| 7387423 \| -25 \| -0.0003% \| Reviewers: nikic, reames Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69321	2019-10-23 18:17:32 +03:00
Roman Lebedev	4b6223263a	[ConstantRange] makeGuaranteedNoWrapRegion(): `shl` support Summary: If all the shifts amount are already poison-producing, then we can add more poison-producing flags ontop: https://rise4fun.com/Alive/Ocwi Otherwise, we should only consider the possible range of shift amts that don't result in poison. For unsigned range not not overflow, we must not shift out any set bits, and the actual limit for `x` can be computed by backtransforming the maximal value we could ever get out of the `shl` - `-1` through `lshr`. If the `x` is any larger than that then it will overflow. Likewise for signed range, but just in signed domain.. This is based on the general idea outlined by @nikic in https://reviews.llvm.org/D68672#1714990 Reviewers: nikic, sanjoy Reviewed By: nikic Subscribers: hiraditya, llvm-commits, nikic Tags: #llvm Differential Revision: https://reviews.llvm.org/D69217 llvm-svn: 375370	2019-10-20 19:36:55 +00:00
Chen Zheng	9806a1d5f9	[ConstantRange] [NFC] replace addWithNoSignedWrap with addWithNoWrap. llvm-svn: 374016	2019-10-08 03:00:31 +00:00
Chen Zheng	bfec045626	[ConstantRange] add helper function addWithNoWrap(). Differential Revision: https://reviews.llvm.org/D67339 llvm-svn: 373205	2019-09-30 12:57:53 +00:00
Nikita Popov	c061b99c5b	[ConstantRange] Add sdiv() support The implementation is conceptually simple: We separate the LHS and RHS into positive and negative components and then also compute the positive and negative components of the result, taking into account that e.g. only pos/pos and neg/neg will give a positive result. However, there's one significant complication: SignedMin / -1 is UB for sdiv, and we can't just ignore it, because the APInt result of SignedMin would break the sign segregation. Instead we drop SignedMin or -1 from the corresponding ranges, taking into account some edge cases with wrapped ranges. Because of the sign segregation, the implementation ends up being nearly fully precise even for wrapped ranges (the remaining imprecision is due to ranges that are both signed and unsigned wrapping and are divided by a trivial divisor like 1). This means that the testing cannot just check the signed envelope as we usually do. Instead we collect all possible results in a bitvector and construct a better sign wrapped range (than the full envelope). Differential Revision: https://reviews.llvm.org/D61238 llvm-svn: 362430	2019-06-03 18:19:54 +00:00
Nikita Popov	332c100562	[ValueTracking][ConstantRange] Distinguish low/high always overflow In order to fold an always overflowing signed saturating add/sub, we need to know in which direction the always overflow occurs. This patch splits up AlwaysOverflows into AlwaysOverflowsLow and AlwaysOverflowsHigh to pass through this information (but it is not used yet). Differential Revision: https://reviews.llvm.org/D62463 llvm-svn: 361858	2019-05-28 18:08:31 +00:00
Nikita Popov	8b1fa07639	[CVP] Remove unnecessary checks for empty GNWR; NFC The guaranteed no-wrap region is never empty, it always contains at least zero, so these optimizations don't ever apply. To make this more obviously true, replace the conversative return in makeGNWR with an assertion. llvm-svn: 361698	2019-05-25 14:11:55 +00:00
Nikita Popov	f610110f1a	[ConstantRange] Simplify makeGNWR implementation; NFC Compute results in more direct ways, avoid subset intersect operations. Extract the core code for computing mul nowrap ranges into separate static functions, so they can be reused. llvm-svn: 360189	2019-05-07 20:34:46 +00:00
Nikita Popov	d5a403fb80	[ConstantRange] Add srem() support Add support for srem() to ConstantRange so we can use it in LVI. For srem the sign of the result matches the sign of the LHS. For the RHS only the absolute value is important. Apart from that the logic is like urem. Just like for urem this is only an approximate implementation. The tests check a few specific cases and run an exhaustive test for conservative correctness (but not exactness). Differential Revision: https://reviews.llvm.org/D61207 llvm-svn: 360055	2019-05-06 16:59:37 +00:00
Nikita Popov	7a94795b2b	[ConstantRange] Add makeExactNoWrapRegion() I got confused on the terminology, and the change in D60598 was not correct. I was thinking of "exact" in terms of the result being non-approximate. However, the relevant distinction here is whether the result is * Largest range such that: Forall Y in Other: Forall X in Result: X BinOp Y does not wrap. (makeGuaranteedNoWrapRegion) * Smallest range such that: Forall Y in Other: Forall X not in Result: X BinOp Y wraps. (A hypothetical makeAllowedNoWrapRegion) * Both. (makeExactNoWrapRegion) I'm adding a separate makeExactNoWrapRegion method accepting a single APInt (same as makeExactICmpRegion) and using it in the places where the guarantee is relevant. Differential Revision: https://reviews.llvm.org/D60960 llvm-svn: 359402	2019-04-28 15:40:56 +00:00

1 2 3 4 5

103 Commits