llvm-capstone

mirror of https://github.com/capstone-engine/llvm-capstone.git synced 2024-11-23 22:00:10 +00:00

Author	SHA1	Message	Date
Kazu Hirata	ca4af13e48	[clang] Don't use Optional::getValue (NFC)	2022-06-20 22:59:26 -07:00
Kazu Hirata	0916d96d12	Don't use Optional::hasValue (NFC)	2022-06-20 20:17:57 -07:00
Kazu Hirata	064a08cd95	Don't use Optional::hasValue (NFC)	2022-06-20 20:05:16 -07:00
Kazu Hirata	5413bf1bac	Don't use Optional::hasValue (NFC)	2022-06-20 11:33:56 -07:00
Kazu Hirata	452db157c9	[clang] Don't use Optional::hasValue (NFC)	2022-06-20 10:51:34 -07:00
Balázs Kéri	60f3b07118	[clang][analyzer] Add checker for bad use of 'errno'. Extend checker 'ErrnoModeling' with a state of 'errno' to indicate the importance of the 'errno' value and how it should be used. Add a new checker 'ErrnoChecker' that observes use of 'errno' and finds possible wrong uses, based on the "errno state". The "errno state" should be set (together with value of 'errno') by other checkers (that perform modeling of the given function) in the future. Currently only a test function can set this value. The new checker has no user-observable effect yet. Reviewed By: martong, steakhal Differential Revision: https://reviews.llvm.org/D122150	2022-06-20 10:07:31 +02:00
Kazu Hirata	06decd0b41	[clang] Use value_or instead of getValueOr (NFC)	2022-06-18 23:21:34 -07:00
isuckatcs	e77ac66b8c	[Static Analyzer] Structured binding to data members Introducing structured binding to data members. Differential Revision: https://reviews.llvm.org/D127643	2022-06-17 19:50:10 +02:00
isuckatcs	92bf652d40	[Static Analyzer] Small array binding policy If a lazyCompoundVal to a struct is bound to the store, there is a policy which decides whether a copy gets created instead. This patch introduces a similar policy for arrays, which is required to model structured binding to arrays without false negatives. Differential Revision: https://reviews.llvm.org/D128064	2022-06-17 18:56:13 +02:00
Jennifer Yu	bb83f8e70b	[OpenMP] Initial parsing and sema for 'parallel masked' construct Differential Revision: https://reviews.llvm.org/D127454	2022-06-16 18:01:15 -07:00
Balazs Benics	929e60b6bd	[analyzer] Relax constraints on const qualified regions The arithmetic restriction seems to be artificial. The comment below seems to be stale. Thus, we remove both. Depends on D127306. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127763	2022-06-15 17:08:27 +02:00
Balazs Benics	f4fc3f6ba3	[analyzer] Treat system globals as mutable if they are not const Previously, system globals were treated as immutable regions, unless it was the `errno` which is known to be frequently modified. D124244 wants to add a check for stores to immutable regions. It would basically turn all stores to system globals into an error even though we have no reason to believe that those mutable sys globals should be treated as if they were immutable. And this leads to false-positives if we apply D124244. In this patch, I'm proposing to treat mutable sys globals actually mutable, hence allocate them into the `GlobalSystemSpaceRegion`, UNLESS they were declared as `const` (and a primitive arithmetic type), in which case, we should use `GlobalImmutableSpaceRegion`. In any other cases, I'm using the `GlobalInternalSpaceRegion`, which is no different than the previous behavior. --- In the tests I added, only the last `expected-warning` was different, compared to the baseline. Which is this: ```lang=C++ void test_my_mutable_system_global_constraint() { assert(my_mutable_system_global > 2); clang_analyzer_eval(my_mutable_system_global > 2); // expected-warning {{TRUE}} invalidate_globals(); clang_analyzer_eval(my_mutable_system_global > 2); // expected-warning {{UNKNOWN}} It was previously TRUE. } void test_my_mutable_system_global_assign(int x) { my_mutable_system_global = x; clang_analyzer_eval(my_mutable_system_global == x); // expected-warning {{TRUE}} invalidate_globals(); clang_analyzer_eval(my_mutable_system_global == x); // expected-warning {{UNKNOWN}} It was previously TRUE. } ``` --- Unfortunately, the taint checker will be also affected. The `stdin` global variable is a pointer, which is assumed to be a taint source, and the rest of the taint propagation rules will propagate from it. However, since mutable variables are no longer treated immutable, they also get invalidated, when an opaque function call happens, such as the first `scanf(stdin, ...)`. This would effectively remove taint from the pointer, consequently disable all the rest of the taint propagations down the line from the `stdin` variable. All that said, I decided to look through `DerivedSymbol`s as well, to acquire the memregion in that case as well. This should preserve the previously existing taint reports. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127306	2022-06-15 17:08:27 +02:00
Balazs Benics	96ccb690a0	[analyzer][NFC] Prefer using isa<> instead getAs<> in conditions Depends on D125709 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127742	2022-06-15 16:58:13 +02:00
Balazs Benics	481f860324	[analyzer][NFC] Remove dead field of UnixAPICheckers Initially, I thought there is some fundamental bug here by not using the bool fields, but it turns out D55425 split this checker into two separate ones; making these fields dead. Depends on D127836, which uncovered this issue. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127838	2022-06-15 16:50:12 +02:00
Balazs Benics	6c4f9998ae	[analyzer] Fix StreamErrorState hash bug The `Profile` function was incorrectly implemented. The `StreamErrorState` has an implicit `bool` conversion operator, which will result in a different hash than faithfully hashing the raw value of the enum. I don't have a test for it, since it seems difficult to find one. Even if we would have one, any change in the hashing algorithm would have a chance of breaking it, so I don't think it would justify the effort. Depends on D127836, which uncovered this issue by marking the related `Profile` function dead. Reviewed By: martong, balazske Differential Revision: https://reviews.llvm.org/D127839	2022-06-15 16:50:12 +02:00
Balazs Benics	f1b18a79b7	[analyzer][NFC] Remove dead code and modernize surroundings Thanks @kazu for helping me clean these parts in D127799. I'm leaving the dump methods, along with the unused visitor handlers and the forwarding methods. The dead parts actually helped to uncover two bugs, to which I'm going to post separate patches. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127836	2022-06-15 16:50:12 +02:00
Balazs Benics	40940fb2a6	[analyzer][NFC] Substitute the SVal::evalMinus and evalComplement functions Depends on D126127 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127734	2022-06-14 18:56:43 +02:00
Balazs Benics	cfc915149c	[analyzer][NFC] Relocate unary transfer functions This is an initial step of removing the SimpleSValBuilder abstraction. The SValBuilder alone should be enough. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126127	2022-06-14 18:56:43 +02:00
Balazs Benics	de6ba9704d	[analyzer][Casting] Support isa, cast, dyn_cast of SVals This change specializes the LLVM RTTI mechanism for SVals. After this change, we can use the well-known `isa`, `cast`, `dyn_cast`. Examples: // SVal V = ...; // Loc MyLoc = ...; bool IsInteresting = isa<loc::MemRegionVal, loc::GotoLabel>(MyLoc); auto MRV = cast<loc::MemRegionVal>(MyLoc); Optional<loc::MemRegionVal> MaybeMRV = dyn_cast<loc::MemRegionVal>(V) The current `SVal::getAs` and `castAs` member functions are redundant at this point, but I believe that they are still handy. The member function version is terse and reads left-to-right, which IMO is a great plus. However, we should probably add a variadic `isa` member function version to have the same casting API in both cases. Thanks for the extensive TMP help @bzcheeseman! Reviewed By: bzcheeseman Differential Revision: https://reviews.llvm.org/D125709	2022-06-14 13:43:04 +02:00
Balazs Benics	ffe7950ebc	Reland "[analyzer] Deprecate `-analyzer-store region` flag" I'm trying to remove unused options from the `Analyses.def` file, then merge the rest of the useful options into the `AnalyzerOptions.def`. Then make sure one can set these by an `-analyzer-config XXX=YYY` style flag. Then surface the `-analyzer-config` to the `clang` frontend; After all of this, we can pursue the tablegen approach described https://discourse.llvm.org/t/rfc-tablegen-clang-static-analyzer-engine-options-for-better-documentation/61488 In this patch, I'm proposing flag deprecations. We should support deprecated analyzer flags for exactly one release. In this case I'm planning to drop this flag in `clang-16`. In the clang frontend, now we won't pass this option to the cc1 frontend, rather emit a warning diagnostic reminding the users about this deprecated flag, which will be turned into error in clang-16. Unfortunately, I had to remove all the tests referring to this flag, causing a mass change. I've also added a test for checking this warning. I've seen that `scan-build` also uses this flag, but I think we should remove that part only after we turn this into a hard error. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126215	2022-06-14 09:20:41 +02:00
Kazu Hirata	f13019f836	[clang] Use any_of and none_of (NFC)	2022-06-12 10:17:12 -07:00
Kazu Hirata	f5ef2c5838	[clang] Convert for_each to range-based for loops (NFC)	2022-06-10 22:39:45 -07:00
Nico Weber	8406839d19	Revert "[analyzer] Deprecate `-analyzer-store region` flag" This reverts commit `d50d9946d1`. Broke check-clang, see comments on https://reviews.llvm.org/D126067 Also revert dependent change "[analyzer] Deprecate the unused 'analyzer-opt-analyze-nested-blocks' cc1 flag" This reverts commit `07b4a6d046`. Also revert "[analyzer] Fix buildbots after introducing a new frontend warning" This reverts commit `90374df15d`. (See https://reviews.llvm.org/rG90374df15ddc58d823ca42326a76f58e748f20eb)	2022-06-10 08:50:13 -04:00
Balazs Benics	b73c2280f5	[analyzer][NFC] Remove unused RegionStoreFeatures Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126216	2022-06-10 13:02:26 +02:00
Balazs Benics	d50d9946d1	[analyzer] Deprecate `-analyzer-store region` flag I'm trying to remove unused options from the `Analyses.def` file, then merge the rest of the useful options into the `AnalyzerOptions.def`. Then make sure one can set these by an `-analyzer-config XXX=YYY` style flag. Then surface the `-analyzer-config` to the `clang` frontend; After all of this, we can pursue the tablegen approach described https://discourse.llvm.org/t/rfc-tablegen-clang-static-analyzer-engine-options-for-better-documentation/61488 In this patch, I'm proposing flag deprecations. We should support deprecated analyzer flags for exactly one release. In this case I'm planning to drop this flag in `clang-16`. In the clang frontend, now we won't pass this option to the cc1 frontend, rather emit a warning diagnostic reminding the users about this deprecated flag, which will be turned into error in clang-16. Unfortunately, I had to remove all the tests referring to this flag, causing a mass change. I've also added a test for checking this warning. I've seen that `scan-build` also uses this flag, but I think we should remove that part only after we turn this into a hard error. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126215	2022-06-10 12:57:15 +02:00
Balazs Benics	07a7fd314a	[analyzer] Print the offending function at EndAnalysis crash I've faced crashes in the past multiple times when some `check::EndAnalysis` callback caused some crash. It's really anoying that it doesn't tell which function triggered this callback. This patch adds the well-known trace for that situation as well. Example: 1. <eof> parser at end of file 2. While analyzing stack: #0 Calling test11 Note that this does not have tests. I've considered `unittests` for this purpose, by using the `ASSERT_DEATH()` similarly how we check double eval called functions in `ConflictingEvalCallsTest.cpp`, however, that the testsuite won't invoke the custom handlers. Only the message of the `llvm_unreachable()` will be printed. Consequently, it's not applicable for us testing this feature. I've also considered using an end-to-end LIT test for this. For that, we would need to somehow overload the `clang_analyzer_crash()` `ExprInspection` handler, to get triggered by other events than the `EvalCall`. I'm not saying that we could not come up with a generic way of causing crash in a specific checker callback, but I'm not sure if that would worth the effort. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D127389	2022-06-10 12:21:17 +02:00
Gabor Marton	bc2c759aee	[analyzer] Fix assertion failure after getKnownValue call Depends on D126560. `getKnownValue` has been changed by the parent patch in a way that simplification was removed. This is not correct when the function is called by the Checkers. Thus, a new internal function is introduced, `getConstValue`, which simply queries the constraint manager. This `getConstValue` is used internally in the `SimpleSValBuilder` when a binop is evaluated, this way we avoid the recursion into the `Simplifier`. Differential Revision: https://reviews.llvm.org/D127285	2022-06-09 16:13:57 +02:00
Vince Bridgers	c7fa4e8a8b	[analyzer] Fix null pointer deref in CastValueChecker A crash was seen in CastValueChecker due to a null pointer dereference. The fix uses QualType::getAsString to avoid the null dereference when a CXXRecordDecl cannot be obtained. A small reproducer is added, and cast value notes LITs are updated for the new debug messages. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D127105	2022-06-07 13:34:06 -04:00
Gabor Marton	8131ee4c43	[analyzer] Remove NotifyAssumeClients Depends on D126560. Differential Revision: https://reviews.llvm.org/D126878	2022-06-07 13:02:03 +02:00
Gabor Marton	17e9ea6138	[analyzer][NFC] Add LLVM_UNLIKELY to assumeDualImpl Aligned with the measures we had in D124674, this condition seems to be unlikely. Nevertheless, I've made some new measurments with stats just for this, and data confirms this is indeed unlikely. Differential Revision: https://reviews.llvm.org/D127190	2022-06-07 12:48:48 +02:00
Gabor Marton	f66f4d3b07	[analyzer] Track assume call stack to detect fixpoint Assume functions might recurse (see `reAssume` or `tryRearrange`). During the recursion, the State might not change anymore, that means we reached a fixpoint. In this patch, we avoid infinite recursion of assume calls by checking already visited States on the stack of assume function calls. This patch renders the previous "workaround" solution (D47155) unnecessary. Note that this is not an NFC patch. If we were to limit the maximum stack depth of the assume calls to 1 then would it be equivalent with the previous solution in D47155. Additionally, in D113753, we simplify the symbols right at the beginning of evalBinOpNN. So, a call to `simplifySVal` in `getKnownValue` (added in D51252) is no longer needed. Fixes https://github.com/llvm/llvm-project/issues/55851 Differential Revision: https://reviews.llvm.org/D126560	2022-06-07 08:36:11 +02:00
Kazu Hirata	e0039b8d6a	Use llvm::less_second (NFC)	2022-06-04 22:48:32 -07:00
Kazu Hirata	4969a6924d	Use llvm::less_first (NFC)	2022-06-04 21:23:18 -07:00
Balazs Benics	7d24641f89	[llvm][analyzer][NFC] Introduce SFINAE for specializing FoldingSetTraits Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126803	2022-06-02 19:46:38 +02:00
Balazs Benics	cf1f1b7240	[analyzer][NFC] Uplift checkers after D126801 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126802	2022-06-02 19:46:38 +02:00
Balazs Benics	33ca5a447e	[analyzer][NFC] Add partial specializations for ProgramStateTraits I'm also hoisting common code from the existing specializations into a common trait impl to reduce code duplication. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126801	2022-06-02 19:46:38 +02:00
Gabor Marton	81e44414aa	[analyzer][NFC] Move overconstrained check from reAssume to assumeDualImpl Depends on D126406. Checking of the overconstrained property is much better suited here. Differential Revision: https://reviews.llvm.org/D126707	2022-06-02 11:41:19 +02:00
Gabor Marton	160798ab9b	[analyzer] Handle SymbolCast in SValBuilder Make the SimpleSValBuilder to be able to look up and use a constraint for an operand of a SymbolCast, when the operand is constrained to a const value. This part of the SValBuilder is responsible for constant folding. We need this constant folding, so the engine can work with less symbols, this way it can be more efficient. Whenever a symbol is constrained with a constant then we substitute the symbol with the corresponding integer. If a symbol is constrained with a range, then the symbol is kept and we fall-back to use the range based constraint manager, which is not that efficient. This patch is the natural extension of the existing constant folding machinery with the support of SymbolCast symbols. Differential Revision: https://reviews.llvm.org/D126481	2022-06-01 08:42:04 +02:00
Balazs Benics	a73b50ad06	Revert "[llvm][clang][bolt][NFC] Use llvm::less_first() when applicable" This reverts commit `3988bd1398`. Did not build on this bot: https://lab.llvm.org/buildbot#builders/215/builds/6372 /usr/include/c++/9/bits/predefined_ops.h:177:11: error: no match for call to ‘(llvm::less_first) (std::pair<long unsigned int, llvm::bolt::BinaryBasicBlock>&, const std::pair<long unsigned int, std::nullptr_t>&)’ 177 \| { return bool(_M_comp(__it, __val)); }	2022-05-27 11:19:18 +02:00
Balazs Benics	3988bd1398	[llvm][clang][bolt][NFC] Use llvm::less_first() when applicable One could reuse this functor instead of rolling out your own version. There were a couple other cases where the code was similar, but not quite the same, such as it might have an assertion in the lambda or other constructs. Thus, I've not touched any of those, as it might change the behavior in some way. As per https://discourse.llvm.org/t/submitting-simple-nfc-patches/62640/3?u=steakhal Chris Lattner > LLVM intentionally has a “yes, you can apply common sense judgement to > things” policy when it comes to code review. If you are doing mechanical > patches (e.g. adopting less_first) that apply to the entire monorepo, > then you don’t need everyone in the monorepo to sign off on it. Having > some +1 validation from someone is useful, but you don’t need everyone > whose code you touch to weigh in. Differential Revision: https://reviews.llvm.org/D126068	2022-05-27 11:15:23 +02:00
Balazs Benics	f13050eca3	[analyzer][NFCi] Annotate major nonnull returning functions This patch annotates the most important analyzer function APIs. Also adds a couple of assertions for uncovering any potential issues earlier in the constructor; in those cases, the member functions were already dereferencing the members unconditionally anyway. Measurements showed no performance impact, nor crashes. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126198	2022-05-27 11:05:50 +02:00
Gabor Marton	6ab69efe61	[analyzer][NFC] Rename GREngine->CoreEngine, GRExprEngine->ExprEngine in comments and txt files fixes #115	2022-05-27 11:04:35 +02:00
Balazs Benics	3a666dd37a	[analyzer][NFC] Use MemRegion::getRegion()'s return value unconditionally Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126123	2022-05-27 10:07:06 +02:00
Balazs Benics	813acb1297	[analyzer][NFC] Remove unused SVal::hasConjuredSymbol Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126130	2022-05-27 10:07:06 +02:00
Balazs Benics	81066603a8	[analyzer][NFC] Remove unused nonloc::ConcreteInt::evalBinOp Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126129	2022-05-27 10:07:06 +02:00
Balazs Benics	f6eab43764	[analyzer][NFC] Inline loc::ConcreteInt::evalBinOp This patch also refactored some of the enclosing parts. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126128	2022-05-27 10:07:06 +02:00
Balazs Benics	ee8987d585	[analyzer][NFC] Inline ExprEngine::evalMinus Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126125	2022-05-27 10:07:06 +02:00
Balazs Benics	7a2d6dea73	[analyzer][NFC] Inline ExprEngine::evalComplement Reviewed By: martong Differential Revision: https://reviews.llvm.org/D126124	2022-05-27 10:07:06 +02:00
Gabor Marton	88abc50398	[analyzer][solver] Handle UnarySymExpr in RangeConstraintSolver Fixes https://github.com/llvm/llvm-project/issues/55241 Differential Revision: https://reviews.llvm.org/D125395	2022-05-26 14:09:46 +02:00
Gabor Marton	b5b2aec1ff	[analyzer] Add UnarySymExpr This patch adds a new descendant to the SymExpr hierarchy. This way, now we can assign constraints to symbolic unary expressions. Only the unary minus and bitwise negation are handled. Differential Revision: https://reviews.llvm.org/D125318	2022-05-26 14:00:27 +02:00
Gabor Marton	ca3d962548	[analyzer] Return from reAssume if State is posteriorly overconstrained Depends on D124758. That patch introduced serious regression in the run-time in some special cases. This fixes that. Differential Revision: https://reviews.llvm.org/D126406	2022-05-26 13:50:40 +02:00
Gabor Marton	f75bc5bfc8	[analyzer] Fix symbol simplification assertion failure Fixes https://github.com/llvm/llvm-project/issues/55546 The assertion mentioned in the issue is triggered because an inconsistency is formed in the Sym->Class and Class->Sym relations. A simpler but similar inconsistency is demonstrated here: https://reviews.llvm.org/D114887 . Previously in `removeMember`, we didn't remove the old symbol's Sym->Class relation. Back then, we explained it with the following two bullet points: > 1) This way constraints for the old symbol can still be found via it's > equivalence class that it used to be the member of. > 2) Performance and resource reasons. We can spare one removal and thus one > additional tree in the forest of `ClassMap`. This patch do remove the old symbol's Sym->Class relation in order to keep the Sym->Class relation consistent with the Class->Sym relations. Point 2) above has negligible performance impact, empirical measurements do not show any noticeable difference in the run-time. Point 1) above seems to be a not well justified statement. This is because we cannot create a new symbol that would be equal to the old symbol after the simplification had happened. The reason for this is that the SValBuilder uses the available constant constraints for each sub-symbol. Differential Revision: https://reviews.llvm.org/D126281	2022-05-25 10:55:50 +02:00
Gabor Marton	96fba640cf	[analyzer][NFC] Factor out the copy-paste code repetition of assumeDual and assumeInclusiveRangeDual Depends on D125892. There might be efficiency and performance implications by using a lambda. Thus, I am going to conduct measurements to see if there is any noticeable impact. I've been thinking about two more alternatives: 1) Make `assumeDualImpl` a variadic template and (perfect) forward the arguments for the used `assume` function. 2) Use a macros. I have concerns though, whether these alternatives would deteriorate the readability of the code. Differential Revision: https://reviews.llvm.org/D125954	2022-05-23 09:32:44 +02:00
Gabor Marton	32f189b0d9	[analyzer] Implement assumeInclusiveRange in terms of assumeInclusiveRangeDual Depends on D124758. This is the very same thing we have done for assumeDual, but this time we do it for assumeInclusiveRange. This patch is basically a no-brainer copy of that previous patch. Differential Revision: https://reviews.llvm.org/D125892	2022-05-23 09:32:44 +02:00
Jay Foad	6bec3e9303	[APInt] Remove all uses of zextOrSelf, sextOrSelf and truncOrSelf Most clients only used these methods because they wanted to be able to extend or truncate to the same bit width (which is a no-op). Now that the standard zext, sext and trunc allow this, there is no reason to use the OrSelf versions. The OrSelf versions additionally have the strange behaviour of allowing extending to a smaller width, or truncating to a larger width, which are also treated as no-ops. A small amount of client code relied on this (ConstantRange::castOp and MicrosoftCXXNameMangler::mangleNumber) and needed rewriting. Differential Revision: https://reviews.llvm.org/D125557	2022-05-19 11:23:13 +01:00
Usama Hameed	dd7233bc67	[Analyzer] Remove extra space from NSErrorChecker message. Differential Revision: https://reviews.llvm.org/D125840	2022-05-18 14:35:12 -07:00
Gabor Marton	56b9b97c1e	[clang][analyzer][ctu] Make CTU a two phase analysis This new CTU implementation is the natural extension of the normal single TU analysis. The approach consists of two analysis phases. During the first phase, we do a normal single TU analysis. During this phase, if we find a foreign function (that could be inlined from another TU) then we don’t inline that immediately, we rather mark that to be analysed later. When the first phase is finished then we start the second phase, the CTU phase. In this phase, we continue the analysis from that point (exploded node) which had been enqueued during the first phase. We gradually extend the exploded graph of the single TU analysis with the new node that was created by the inlining of the foreign function. We count the number of analysis steps of the first phase and we limit the second (ctu) phase with this number. This new implementation makes it convenient for the users to run the single-TU and the CTU analysis in one go, they don't need to run the two analysis separately. Thus, we name this new implementation as "onego" CTU. Discussion: https://discourse.llvm.org/t/rfc-much-faster-cross-translation-unit-ctu-analysis-implementation/61728 Differential Revision: https://reviews.llvm.org/D123773	2022-05-18 10:35:52 +02:00
Balazs Benics	a1025e6ffe	[analyzer] Introduce clang_analyzer_dumpSvalType introspection function In some rare cases the type of an SVal might be interesting. This introspection function exposes this information in tests. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D125532	2022-05-13 17:07:58 +02:00
Balazs Benics	d5ffc1ed8b	[analyzer][NFC] Tighten some of the SValBuilder return types This is purely a cosmetic change. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D125463	2022-05-13 17:04:34 +02:00
Endre Fülöp	094fb13b88	[analyzer] Add taint to the BoolAssignmentChecker BoolAssignment checker is now taint-aware and warns if a tainted value is assigned. Original author: steakhal Reviewed By: martong Differential Revision: https://reviews.llvm.org/D125360	2022-05-13 09:27:28 +02:00
Tomasz Kamiński	14742443a2	Reland "[analyzer] Canonicalize SymIntExpr so the RHS is positive when possible" This PR changes the `SymIntExpr` so the expression that uses a negative value as `RHS`, for example: `x +/- (-N)`, is modeled as `x -/+ N` instead. This avoids producing a very large `RHS` when the symbol is cased to an unsigned number, and as consequence makes the value more robust in presence of casts. Note that this change is not applied if `N` is the lowest negative value for which negation would not be representable. Reviewed By: steakhal Patch By: tomasz-kaminski-sonarsource! Differential Revision: https://reviews.llvm.org/D124658	2022-05-12 15:40:11 +02:00
Gabor Marton	34ac048aef	[analyzer] Replace adjacent assumeInBound calls to assumeInBoundDual This is to minimize superfluous assume calls. Depends on D124758 Differential Revision: https://reviews.llvm.org/D124761	2022-05-10 10:16:55 +02:00
Gabor Marton	1c1c1e25f9	[analyzer] Implement assume in terms of assumeDual Summary: By evaluating both children states, now we are capable of discovering infeasible parent states. In this patch, `assume` is implemented in the terms of `assumeDuali`. This might be suboptimal (e.g. where there are adjacent assume(true) and assume(false) calls, next patches addresses that). This patch fixes a real CRASH. Fixes https://github.com/llvm/llvm-project/issues/54272 Differential Revision: https://reviews.llvm.org/D124758	2022-05-10 10:16:55 +02:00
Gabor Marton	c4fa05f5f7	[analyzer] Indicate if a parent state is infeasible In some cases a parent State is already infeasible, but we recognize this only if an additonal constraint is added. This patch is the first of a series to address this issue. In this patch `assumeDual` is changed to clone the parent State but with an `Infeasible` flag set, and this infeasible-parent is returned both for the true and false case. Then when we add a new transition in the exploded graph and the destination is marked as infeasible, the node will be a sink node. Related bug: https://github.com/llvm/llvm-project/issues/50883 Actually, this patch does not solve that bug in the solver, rather with this patch we can handle the general parent-infeasible cases. Next step would be to change the State API and require all checkers to use the `assume*Dual` API and deprecate the simple `assume` calls. Hopefully, the next patch will introduce `assumeInBoundDual` and will solve the CRASH we have here: https://github.com/llvm/llvm-project/issues/54272 Differential Revision: https://reviews.llvm.org/D124674	2022-05-10 10:16:55 +02:00
Fred Tingaud	1ec1cdcfb4	[analyzer] Inline operator delete when MayInlineCXXAllocator is set. This patch restores the symmetry between how operator new and operator delete are handled by also inlining the content of operator delete when possible. Patch by Fred Tingaud. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D124845	2022-05-09 15:44:33 +02:00
Balazs Benics	da5b5ae852	Revert "[analyzer] Canonicalize SymIntExpr so the RHS is positive when possible" It seems like multiple users are affected by a crash introduced by this commit, thus I'm reverting it for the time being. Read more about the found reproducers at Phabricator. Differential Revision: https://reviews.llvm.org/D124658 This reverts commit `f0d6cb4a5c`.	2022-05-06 12:13:51 +02:00
Brian Tracy	87a55137e2	Fix "the the" typo in documentation and user facing strings There are many more instances of this pattern, but I chose to limit this change to .rst files (docs), anything in libcxx/include, and string literals. These have the highest chance of being seen by end users. Reviewed By: #libc, Mordante, martong, ldionne Differential Revision: https://reviews.llvm.org/D124708	2022-05-05 17:52:08 +02:00
Tomasz Kamiński	f0d6cb4a5c	[analyzer] Canonicalize SymIntExpr so the RHS is positive when possible This PR changes the `SymIntExpr` so the expression that uses a negative value as `RHS`, for example: `x +/- (-N)`, is modeled as `x -/+ N` instead. This avoids producing a very large `RHS` when the symbol is cased to an unsigned number, and as consequence makes the value more robust in presence of casts. Note that this change is not applied if `N` is the lowest negative value for which negation would not be representable. Reviewed By: steakhal Patch By: tomasz-kaminski-sonarsource! Differential Revision: https://reviews.llvm.org/D124658	2022-05-05 17:48:49 +02:00
einvbri	df5801806d	[analyzer] Get direct binding for specific punned case Region store was not able to see through this case to the actual initialized value of STRUCT ff. This change addresses this case by getting the direct binding. This was found and debugged in a downstream compiler, with debug guidance from @steakhal. A positive and negative test case is added. The specific case where this issue was exposed. typedef struct { int a:1; int b[2]; } STRUCT; int main() { STRUCT ff = {0}; STRUCT* pff = &ff; int a = ((int)pff + 1); return a; } Reviewed By: steakhal, martong Differential Revision: https://reviews.llvm.org/D124349	2022-05-05 04:53:45 -05:00
Ali Shuja Siddiqui	cf7cd664f3	[analyzer] Check for std::__addressof for inner pointer checker This is an extension to diff D99260. This adds an additional exception for `std::__addressof` in `InnerPointerChecker`. Patch By alishuja (Ali Shuja Siddiqui)! Reviewed By: martong, alishuja Differential Revision: https://reviews.llvm.org/D109467	2022-05-03 14:05:19 +02:00
Marco Antognini	68ee5ec07d	[Analyzer] Fix assumptions about const field with member-initializer Essentially, having a default member initializer for a constant member does not necessarily imply the member will have the given default value. Remove part of `a2e053638b` ([analyzer] Treat more const variables and fields as known contants., 2018-05-04). Fix #47878 Reviewed By: r.stahl, steakhal Differential Revision: https://reviews.llvm.org/D124621	2022-05-03 11:27:45 +02:00
Marco Antognini	f34639828f	[Analyzer] Minor cleanups in StreamChecker Remove unnecessary conversion to Optional<> and incorrect assumption that BindExpr can return a null state. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D124681	2022-05-02 17:50:10 +02:00
Marco Antognini	5a47accda8	[Analyzer] Fix clang::ento::taint::dumpTaint definition Ensure the definition is in the "taint" namespace, like its declaration. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D124462	2022-05-02 17:44:06 +02:00
Balazs Benics	5ce7050f70	[analyzer] Allow exploded graph dumps in release builds Historically, exploded graph dumps were disabled in non-debug builds. It was done so probably because a regular user should not dump the internal representation of the analyzer anyway and the dump methods might introduce unnecessary binary size overhead. It turns out some of the users actually want to dump this. Note that e.g. `LiveExpressionsDumper`, `LiveVariablesDumper`, `ControlDependencyTreeDumper` etc. worked previously, and they are unaffected by this change. However, `CFGViewer` and `CFGDumper` still won't work for a similar reason. AFAIK only these two won't work after this change. Addresses #53873 --- baseline \| binary \| size \| size after strip \| \| clang \| 103M \| 83M \| \| clang-tidy \| 67M \| 54M \| after this change \| binary \| size \| size after strip \| \| clang \| 103M \| 84M \| \| clang-tidy \| 67M \| 54M \| CMake configuration: ``` cmake -S llvm -GNinja -DBUILD_SHARED_LIBS=OFF -DCMAKE_BUILD_TYPE=Release -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_C_COMPILER=clang -DLLVM_ENABLE_ASSERTIONS=OFF -DLLVM_USE_LINKER=lld -DLLVM_ENABLE_DUMP=OFF -DLLVM_ENABLE_PROJECTS="clang;clang-tools-extra" -DLLVM_ENABLE_Z3_SOLVER=ON -DLLVM_TARGETS_TO_BUILD="X86" ``` Built by `clang-14.0.0`. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D124442	2022-05-02 11:42:08 +02:00
Balazs Benics	fd7efe33f1	[analyzer] Fix cast evaluation on scoped enums in ExprEngine We ignored the cast if the enum was scoped. This is bad since there is no implicit conversion from the scoped enum to the corresponding underlying type. The fix is basically: isIntegralOrEnumerationType() -> isIntegralOrUnscopedEnumerationType() This materialized in crashes on analyzing the LLVM itself using the Z3 refutation. Refutation synthesized the given Z3 Binary expression (`BO_And` of `unsigned char` aka. 8 bits and an `int` 32 bits) with the wrong bitwidth in the end, which triggered an assert. Now, we evaluate the cast according to the standard. This bug could have been triggered using the Z3 CM according to https://bugs.llvm.org/show_bug.cgi?id=44030 Fixes #47570 #43375 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D85528	2022-05-02 10:54:26 +02:00
Balazs Benics	5a2e595eb8	[analyzer] Fix Static Analyzer g_memdup false-positive `g_memdup()` allocates and copies memory, thus we should not assume that the returned memory region is uninitialized because it might not be the case. PS: It would be even better to copy the bindings to mimic the actual content of the buffer, but this works too. Fixes #53617 Reviewed By: martong Differential Revision: https://reviews.llvm.org/D124436	2022-05-02 10:35:51 +02:00
Andrew Ng	57c55165eb	[analyzer] Fix return of llvm::StringRef to destroyed std::string This issue was discovered whilst testing with ASAN. Differential Revision: https://reviews.llvm.org/D124683	2022-05-01 12:24:32 +01:00
Artem Dergachev	f68c0a2f58	[analyzer] Add path note tags to standard library function summaries. The patch is straightforward except the tiny fix in BugReporterVisitors.cpp that suppresses a default note for "Assuming pointer value is null" when a note tag from the checker is present. This is probably the right thing to do but also definitely not a complete solution to the problem of different sources of path notes being unaware of each other, which is a large and annoying issue that we have to deal with. Note tags really help there because they're nicely introspectable. The problem is demonstrated by the newly added getenv() test. Differential Revision: https://reviews.llvm.org/D122285	2022-04-28 17:17:05 -07:00
Balazs Benics	be744da01f	[analyzer] Fix ValistChecker false-positive involving symbolic pointers In the following example: int va_list_get_int(va_list va) { return va_arg(va, int); // FP } The `*va` expression will be something like `Element{SymRegion{va}, 0, va_list}`. We use `ElementRegions` for representing the result of the dereference. In this case, the `IsSymbolic` was set to `false` in the `getVAListAsRegion()`. Hence, before checking if the memregion is a SymRegion, we should take the base of that region. Analogously to the previous example, one can craft other cases: struct MyVaList { va_list l; }; int va_list_get_int(struct MyVaList va) { return va_arg(va.l, int); // FP } But it would also work if the `va_list` would be in the base or derived part of a class. `ObjCIvarRegions` are likely also susceptible. I'm not explicitly demonstrating these cases. PS: Check the `MemRegion::getBaseRegion()` definition. Fixes #55009 Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D124239	2022-04-26 08:49:05 +02:00
Vince Bridgers	3566bbe62f	[analyzer] Add option for AddrSpace in core.NullDereference check This change adds an option to detect all null dereferences for non-default address spaces, except for address spaces 256, 257 and 258. Those address spaces are special since null dereferences are not errors. All address spaces can be considered (except for 256, 257, and 258) by using -analyzer-config core.NullDereference:DetectAllNullDereferences=true. This option is false by default, retaining the original behavior. A LIT test was enhanced to cover this case, and the rst documentation was updated to describe this behavior. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D122841	2022-04-24 03:51:49 -05:00
Vince Bridgers	5114db933d	[analyzer] Clean checker options from bool to DefaultBool (NFC) A recent review emphasized the preference to use DefaultBool instead of bool for checker options. This change is a NFC and cleans up some of the instances where bool was used, and could be changed to DefaultBool. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D123464	2022-04-23 14:47:29 -05:00
Nathan James	cfb8169059	[clang] Add a raw_ostream operator<< overload for QualType Under the hood this prints the same as `QualType::getAsString()` but cuts out the middle-man when that string is sent to another raw_ostream. Also cleaned up all the call sites where this occurs. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D123926	2022-04-20 22:09:05 +01:00
Aaron Ballman	9955f14aaf	[C2x] Disallow functions without prototypes/functions with identifier lists WG14 has elected to remove support for K&R C functions in C2x. The feature was introduced into C89 already deprecated, so after this long of a deprecation period, the committee has made an empty parameter list mean the same thing in C as it means in C++: the function accepts no arguments exactly as if the function were written with (void) as the parameter list. This patch implements WG14 N2841 No function declarators without prototypes (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2841.htm) and WG14 N2432 Remove support for function definitions with identifier lists (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2432.pdf). It also adds The -fno-knr-functions command line option to opt into this behavior in other language modes. Differential Revision: https://reviews.llvm.org/D123955	2022-04-20 13:28:15 -04:00
Denys Petrov	e37726beb2	[analyzer] Implemented RangeSet::Factory::castTo function to perform promotions, truncations and conversions. Summary: Handle casts for ranges working similarly to APSIntType::apply function but for the whole range set. Support promotions, truncations and conversions. Example: promotion: char [0, 42] -> short [0, 42] -> int [0, 42] -> llong [0, 42] truncation: llong [4295033088, 4295033130] -> int [65792, 65834] -> short [256, 298] -> char [0, 42] conversion: char [-42, 42] -> uint [0, 42]U[4294967254, 4294967295] -> short[-42, 42] Differential Revision: https://reviews.llvm.org/D103094	2022-04-19 22:34:03 +03:00
Tom Ritter	82f3ed9904	[analyzer] Expose Taint.h to plugins Reviewed By: NoQ, xazax.hun, steakhal Differential Revision: https://reviews.llvm.org/D123155	2022-04-19 16:55:01 +02:00
Kristóf Umann	fd8e5762f8	[analyzer] Don't track function calls as control dependencies I recently evaluated ~150 of bug reports on open source projects relating to my GSoC'19 project, which was about tracking control dependencies that were relevant to a bug report. Here is what I found: when the condition is a function call, the extra notes were almost always unimportant, and often times intrusive: void f(int x) { x = nullptr; if (alwaysTrue()) // We don't need a whole lot of explanation // here, the function name is good enough. x = 5; } It almost always boiled down to a few "Returning null pointer, which participates in a condition later", or similar notes. I struggled to find a single case where the notes revealed anything interesting or some previously hidden correlation, which is kind of the point of condition tracking. This patch checks whether the condition is a function call, and if so, bails out. The argument against the patch is the popular feedback we hear from some of our users, namely that they can never have too much information. I was specifically fishing for examples that display best that my contribution did more good than harm, so admittedly I set the bar high, and one can argue that there can be non-trivial trickery inside functions, and function names may not be that descriptive. My argument for the patch is all those reports that got longer without any notable improvement in the report intelligibility. I think the few exceptional cases where this patch would remove notable information are an acceptable sacrifice in favor of more reports being leaner. Differential Revision: https://reviews.llvm.org/D116597	2022-04-08 10:16:58 +02:00
Gabor Marton	e63b81d10e	[analyzer][ctu] Only import const and trivial VarDecls Do import the definition of objects from a foreign translation unit if that's type is const and trivial. Differential Revision: https://reviews.llvm.org/D122805	2022-04-01 13:49:39 +02:00
Vince Bridgers	4d5b824e3d	[analyzer] Avoid checking addrspace pointers in cstring checker This change fixes an assert that occurs in the SMT layer when refuting a finding that uses pointers of two different sizes. This was found in a downstream build that supports two different pointer sizes, The CString Checker was attempting to compute an overlap for the 'to' and 'from' pointers, where the pointers were of different sizes. In the downstream case where this was found, a specialized memcpy routine patterned after memcpy_special is used. The analyzer core hits on this builtin because it matches the 'memcpy' portion of that builtin. This cannot be duplicated in the upstream test since there are no specialized builtins that match that pattern, but the case does reproduce in the accompanying LIT test case. The amdgcn target was used for this reproducer. See the documentation for AMDGPU address spaces here https://llvm.org/docs/AMDGPUUsage.html#address-spaces. The assert seen is: `Solver->getSort(LHS) == Solver->getSort(RHS) && "AST's must have the same sort!"' Ack to steakhal for reviewing the fix, and creating the test case. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D118050	2022-03-31 17:34:56 +02:00
Vince Bridgers	fe8b2236ef	[analyzer] Fix "RhsLoc and LhsLoc bitwidth must be same" clang: <root>/clang/lib/StaticAnalyzer/Core/SimpleSValBuilder.cpp:727: void assertEqualBitWidths(clang::ento::ProgramStateRef, clang::ento::Loc, clang::ento::Loc): Assertion `RhsBitwidth == LhsBitwidth && "RhsLoc and LhsLoc bitwidth must be same!"' This change adjusts the bitwidth of the smaller operand for an evalBinOp as a result of a comparison operation. This can occur in the specific case represented by the test cases for a target with different pointer sizes. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D122513	2022-03-29 17:08:19 -05:00
Mike Rice	f82ec5532b	[OpenMP] Initial parsing/sema for the 'omp target parallel loop' construct Adds basic parsing/sema/serialization support for the #pragma omp target parallel loop directive. Differential Revision: https://reviews.llvm.org/D122359	2022-03-24 09:19:00 -07:00
Vince Bridgers	9ef7ac51af	[analyzer] Fix crash in RangedConstraintManager.cpp This change fixes a crash in RangedConstraintManager.cpp:assumeSym due to an unhandled BO_Div case. clang: <root>clang/lib/StaticAnalyzer/Core/RangedConstraintManager.cpp:51: virtual clang::ento::ProgramStateRef clang::ento::RangedConstraintManager::assumeSym(clang::ento::ProgramStateRef, clang::ento::SymbolRef, bool): Assertion `BinaryOperator::isComparisonOp(Op)' failed. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D122277	2022-03-23 08:26:40 -05:00
Vince Bridgers	5fdc4dd777	[analyzer] refactor makeIntValWithPtrWidth, remove getZeroWithPtrWidth (NFC) This is a NFC refactoring to change makeIntValWithPtrWidth and remove getZeroWithPtrWidth to use types when forming values to match pointer widths. Some targets may have different pointer widths depending upon address space, so this needs to be comprehended. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D120134	2022-03-23 08:26:37 -05:00
Mike Rice	2cedaee6f7	[OpenMP] Initial parsing/sema for the 'omp parallel loop' construct Adds basic parsing/sema/serialization support for the #pragma omp parallel loop directive. Differential Revision: https://reviews.llvm.org/D122247	2022-03-22 13:55:47 -07:00
Vince Bridgers	985888411d	[analyzer] Refactor makeNull to makeNullWithWidth (NFC) Usages of makeNull need to be deprecated in favor of makeNullWithWidth for architectures where the pointer size should not be assumed. This can occur when pointer sizes can be of different sizes, depending on address space for example. See https://reviews.llvm.org/D118050 as an example. This was uncovered initially in a downstream compiler project, and tested through those systems tests. steakhal performed systems testing across a large set of open source projects. Co-authored-by: steakhal Resolves: https://github.com/llvm/llvm-project/issues/53664 Reviewed By: NoQ, steakhal Differential Revision: https://reviews.llvm.org/D119601	2022-03-22 07:35:13 -05:00
Mike Rice	6bd8dc91b8	[OpenMP] Initial parsing/sema for the 'omp target teams loop' construct Adds basic parsing/sema/serialization support for the #pragma omp target teams loop directive. Differential Revision: https://reviews.llvm.org/D122028	2022-03-18 13:48:32 -07:00
Mike Rice	79f661edc1	[OpenMP] Initial parsing/sema for the 'omp teams loop' construct Adds basic parsing/sema/serialization support for the #pragma omp teams loop directive. Differential Revision: https://reviews.llvm.org/D121713	2022-03-16 14:39:18 -07:00
phyBrackets	90a6e35478	[analyzer][NFC] Merge similar conditional paths Reviewed By: aaron.ballman, steakhal Differential Revision: https://reviews.llvm.org/D121045	2022-03-07 22:05:27 +05:30
Endre Fülöp	4fd6c6e65a	[analyzer] Add more propagations to Taint analysis Add more functions as taint propators to GenericTaintChecker. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D120369	2022-03-07 13:18:54 +01:00
Shivam	56eaf869be	[analyzer] Done some changes to detect Uninitialized read by the char array manipulation functions Few weeks back I was experimenting with reading the uninitialized values from src , which is actually a bug but the CSA seems to give up at that point . I was curious about that and I pinged @steakhal on the discord and according to him this seems to be a genuine issue and needs to be fix. So I goes with fixing this bug and thanks to @steakhal who help me creating this patch. This feature seems to break some tests but this was the genuine problem and the broken tests also needs to fix in certain manner. I add a test but yeah we need more tests,I'll try to add more tests.Thanks Reviewed By: steakhal, NoQ Differential Revision: https://reviews.llvm.org/D120489	2022-03-04 00:21:06 +05:30
Shivam	bd1917c88a	[analyzer] Done some changes to detect Uninitialized read by the char array manipulation functions Few weeks back I was experimenting with reading the uninitialized values from src , which is actually a bug but the CSA seems to give up at that point . I was curious about that and I pinged @steakhal on the discord and according to him this seems to be a genuine issue and needs to be fix. So I goes with fixing this bug and thanks to @steakhal who help me creating this patch. This feature seems to break some tests but this was the genuine problem and the broken tests also needs to fix in certain manner. I add a test but yeah we need more tests,I'll try to add more tests.Thanks Reviewed By: steakhal, NoQ Differential Revision: https://reviews.llvm.org/D120489	2022-03-03 23:21:26 +05:30
Kristóf Umann	d832078904	[analyzer] Improve NoOwnershipChangeVisitor's understanding of deallocators The problem with leak bug reports is that the most interesting event in the code is likely the one that did not happen -- lack of ownership change and lack of deallocation, which is often present within the same function that the analyzer inlined anyway, but not on the path of execution on which the bug occured. We struggle to understand that a function was responsible for freeing the memory, but failed. D105819 added a new visitor to improve memory leak bug reports. In addition to inspecting the ExplodedNodes of the bug pat, the visitor tries to guess whether the function was supposed to free memory, but failed to. Initially (in D108753), this was done by checking whether a CXXDeleteExpr is present in the function. If so, we assume that the function was at least party responsible, and prevent the analyzer from pruning bug report notes in it. This patch improves this heuristic by recognizing all deallocator functions that MallocChecker itself recognizes, by reusing MallocChecker::isFreeingCall. Differential Revision: https://reviews.llvm.org/D118880	2022-03-03 11:27:56 +01:00
Simon Pilgrim	ca94f28d15	[clang] ExprEngine::VisitCXXNewExpr - remove superfluous nullptr tests FD has already been dereferenced	2022-03-02 15:59:10 +00:00
Kristóf Umann	32ac21d049	[NFC][analyzer] Allow CallDescriptions to be matched with CallExprs Since CallDescriptions can only be matched against CallEvents that are created during symbolic execution, it was not possible to use it in syntactic-only contexts. For example, even though InnerPointerChecker can check with its set of CallDescriptions whether a function call is interested during analysis, its unable to check without hassle whether a non-analyzer piece of code also calls such a function. The patch adds the ability to use CallDescriptions in syntactic contexts as well. While we already have that in Signature, we still want to leverage the ability to use dynamic information when we have it (function pointers, for example). This could be done with Signature as well (StdLibraryFunctionsChecker does it), but it makes it even less of a drop-in replacement. Differential Revision: https://reviews.llvm.org/D119004	2022-03-01 17:13:04 +01:00
Balázs Kéri	d8a2afb244	[clang][analyzer] Add modeling of 'errno'. Add a checker to maintain the system-defined value 'errno'. The value is supposed to be set in the future by existing or new checkers that evaluate errno-modifying function calls. Reviewed By: NoQ, steakhal Differential Revision: https://reviews.llvm.org/D120310	2022-03-01 08:20:33 +01:00
Dawid Jurczak	b3e2dac27c	[NFC] Don't pass temporary LangOptions to Lexer Since https://reviews.llvm.org/D120334 we shouldn't pass temporary LangOptions to Lexer. This change fixes stack-use-after-scope UB in LocalizationChecker found by sanitizer-x86_64-linux-fast buildbot and resolve similar issue in HeaderIncludes.	2022-02-28 20:43:28 +01:00
Endre Fülöp	34a7387986	[analyzer] Add more sources to Taint analysis Add more functions as taint sources to GenericTaintChecker. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D120236	2022-02-28 11:33:02 +01:00
Aaron Ballman	f9e8e92cf5	Revert "[clang][analyzer] Add modeling of 'errno'." This reverts commit `29b512ba32`. This broke several build bots: https://lab.llvm.org/buildbot/#/builders/86/builds/30183 https://lab.llvm.org/buildbot/#/builders/216/builds/488	2022-02-25 07:21:01 -05:00
Balázs Kéri	29b512ba32	[clang][analyzer] Add modeling of 'errno'. Add a checker to maintain the system-defined value 'errno'. The value is supposed to be set in the future by existing or new checkers that evaluate errno-modifying function calls. Reviewed By: NoQ, steakhal Differential Revision: https://reviews.llvm.org/D120310	2022-02-25 12:42:55 +01:00
Fangrui Song	ecff9b65b5	[analyzer] Just use default capture after `7fd60ee6e0`	2022-02-24 10:06:11 -08:00
Fangrui Song	7fd60ee6e0	[analyzer] Fix -Wunused-lambda-capture in -DLLVM_ENABLE_ASSERTIONS=off builds	2022-02-24 00:13:13 -08:00
Balazs Benics	7036413dc2	Revert "Revert "[analyzer] Fix taint rule of fgets and setproctitle_init"" This reverts commit `2acead35c1`. Let's try `REQUIRES: asserts`.	2022-02-23 12:55:31 +01:00
Balazs Benics	a848a5cf2f	Revert "Revert "[analyzer] Fix taint propagation by remembering to the location context"" This reverts commit `d16c5f4192`. Let's try `REQUIRES: asserts`.	2022-02-23 12:53:07 +01:00
Balazs Benics	fa0a80e017	Revert "Revert "[analyzer] Add failing test case demonstrating buggy taint propagation"" This reverts commit `b8ae323cca`. Let's try `REQUIRES: asserts`.	2022-02-23 10:48:06 +01:00
Artem Dergachev	e0e174845b	[analyzer] Fix a crash in NoStateChangeVisitor with body-farmed stack frames. LocationContext::getDecl() isn't useful for obtaining the "farmed" body because the (synthetic) body statement isn't actually attached to the (natural-grown) declaration in the AST. Differential Revision: https://reviews.llvm.org/D119509	2022-02-17 10:13:34 -08:00
Balazs Benics	b3c0014e5a	Revert "Revert "[analyzer] Prevent misuses of -analyze-function"" This reverts commit `620d99b7ed`. Let's see if removing the two offending RUN lines makes this patch pass. Not ideal to drop tests but, it's just a debugging feature, probably not that important.	2022-02-16 10:33:21 +01:00
Balazs Benics	b8ae323cca	Revert "[analyzer] Add failing test case demonstrating buggy taint propagation" This reverts commit `744745ae19`. I'm reverting this since this patch caused a build breakage. https://lab.llvm.org/buildbot/#/builders/91/builds/3818	2022-02-14 18:45:46 +01:00
Balazs Benics	d16c5f4192	Revert "[analyzer] Fix taint propagation by remembering to the location context" This reverts commit `b099e1e562`. I'm reverting this since the head of the patch stack caused a build breakage. https://lab.llvm.org/buildbot/#/builders/91/builds/3818	2022-02-14 18:45:46 +01:00
Balazs Benics	2acead35c1	Revert "[analyzer] Fix taint rule of fgets and setproctitle_init" This reverts commit `bf5963bf19`. I'm reverting this since the head of the patch stack caused a build breakage. https://lab.llvm.org/buildbot/#/builders/91/builds/3818	2022-02-14 18:45:46 +01:00
Balazs Benics	bf5963bf19	[analyzer] Fix taint rule of fgets and setproctitle_init There was a typo in the rule. `{{0}, ReturnValueIndex}` meant that the discrete index is `0` and the variadic index is `-1`. What we wanted instead is that both `0` and `-1` are in the discrete index list. Instead of this, we wanted to express that both `0` and the `ReturnValueIndex` is in the discrete arg list. The manual inspection revealed that `setproctitle_init` also suffered a probably incomplete propagation rule. Reviewed By: Szelethus, gamesh411 Differential Revision: https://reviews.llvm.org/D119129	2022-02-14 16:55:55 +01:00
Balazs Benics	b099e1e562	[analyzer] Fix taint propagation by remembering to the location context Fixes the issue D118987 by mapping the propagation to the callsite's LocationContext. This way we can keep track of the in-flight propagations. Note that empty propagation sets won't be inserted. Reviewed By: NoQ, Szelethus Differential Revision: https://reviews.llvm.org/D119128	2022-02-14 16:55:55 +01:00
Balazs Benics	744745ae19	[analyzer] Add failing test case demonstrating buggy taint propagation Recently we uncovered a serious bug in the `GenericTaintChecker`. It was already flawed before D116025, but that was the patch that turned this silent bug into a crash. It happens if the `GenericTaintChecker` has a rule for a function, which also has a definition. char fgets(char s, int n, FILE fp) { nested_call(); // no parameters! return (char )0; } // Within some function: fgets(..., tainted_fd); When the engine inlines the definition and finds a function call within that, the `PostCall` event for the call will get triggered sooner than the `PostCall` for the original function. This mismatch violates the assumption of the `GenericTaintChecker` which wants to propagate taint information from the `PreCall` event to the `PostCall` event, where it can actually bind taint to the return value of the same call. Let's get back to the example and go through step-by-step. The `GenericTaintChecker` will see the `PreCall<fgets(..., tainted_fd)>` event, so it would 'remember' that it needs to taint the return value and the buffer, from the `PostCall` handler, where it has access to the return value symbol. However, the engine will inline fgets and the `nested_call()` gets evaluated subsequently, which produces an unimportant `PreCall<nested_call()>`, then a `PostCall<nested_call()>` event, which is observed by the `GenericTaintChecker`, which will unconditionally mark tainted the 'remembered' arg indexes, trying to access a non-existing argument, resulting in a crash. If it doesn't crash, it will behave completely unintuitively, by marking completely unrelated memory regions tainted, which is even worse. The resulting assertion is something like this: Expr.h: const Expr *CallExpr::getArg(unsigned int) const: Assertion `Arg < getNumArgs() && "Arg access out of range!"' failed. The gist of the backtrace: CallExpr::getArg(unsigned int) const SimpleFunctionCall::getArgExpr(unsigned int) CallEvent::getArgSVal(unsigned int) const GenericTaintChecker::checkPostCall(const CallEvent &, CheckerContext&) const Prior to D116025, there was a check for the argument count before it applied taint, however, it still suffered from the same underlying issue/bug regarding propagation. This path does not intend to fix the bug, rather start a discussion on how to fix this. --- Let me elaborate on how I see this problem. This pre-call, post-call juggling is just a workaround. The engine should by itself propagate taint where necessary right where it invalidates regions. For the tracked values, which potentially escape, we need to erase the information we know about them; and this is exactly what is done by invalidation. However, in the case of taint, we basically want to approximate from the opposite side of the spectrum. We want to preserve taint in most cases, rather than cleansing them. Now, we basically sanitize all escaping tainted regions implicitly, since invalidation binds a fresh conjured symbol for the given region, and that has not been associated with taint. IMO this is a bad default behavior, we should be more aggressive about preserving taint if not further spreading taint to the reachable regions. We have a couple of options for dealing with it (let's call it //tainting policy//): 1) Taint only the parameters which were tainted prior to the call. 2) Taint the return value of the call, since it likely depends on the tainted input - if any arguments were tainted. 3) Taint all escaped regions - (maybe transitively using the cluster algorithm) - if any arguments were tainted. 4) Not taint anything - this is what we do right now :D The `ExprEngine` should not deal with taint on its own. It should be done by a checker, such as the `GenericTaintChecker`. However, the `Pre`-`PostCall` checker callbacks are not designed for this. `RegionChanges` would be a much better fit for modeling taint propagation. What we would need in the `RegionChanges` callback is the `State` prior invalidation, the `State` after the invalidation, and a `CheckerContext` in which the checker can create transitions, where it would place `NoteTags` for the modeled taint propagations and report errors if a taint sink rule gets violated. In this callback, we could query from the prior State, if the given value was tainted; then act and taint if necessary according to the checker's tainting policy. By using RegionChanges for this, we would 'fix' the mentioned propagation bug 'by-design'. Reviewed By: Szelethus Differential Revision: https://reviews.llvm.org/D118987	2022-02-14 16:55:55 +01:00
phyBrackets	6745b6a0f1	[analyzer][NFCi] Use the correct BugType in CStringChecker. There is different bug types for different types of bugs but the emitAdditionOverflowbug seems to use bugtype BT_NotCSting but actually it have to use BT_AdditionOverflow . Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D119462	2022-02-14 20:54:59 +05:30
Balazs Benics	abc873694f	[analyzer] Restrict CallDescription fuzzy builtin matching `CallDescriptions` for builtin functions relaxes the match rules somewhat, so that the `CallDescription` will match for calls that have some prefix or suffix. This was achieved by doing a `StringRef::contains()`. However, this is somewhat problematic for builtins that are substrings of each other. Consider the following: `CallDescription{ builtin, "memcpy"}` will match for `__builtin_wmemcpy()` calls, which is unfortunate. This patch addresses/works around the issue by checking if the characters around the function's name are not part of the 'name' semantically. In other words, to accept a match for `"memcpy"` the call should not have alphanumeric (`[a-zA-Z]`) characters around the 'match'. So, `CallDescription{ builtin, "memcpy"}` will not match on: - `__builtin_wmemcpy: there is a `w` alphanumeric character before the match. - `__builtin_memcpyFOoBar_inline`: there is a `F` character after the match. - `__builtin_memcpyX_inline`: there is an `X` character after the match. But it will still match for: - `memcpy`: exact match - `__builtin_memcpy`: there is an _ before the match - `__builtin_memcpy_inline`: there is an _ after the match - `memcpy_inline_builtinFooBar`: there is an _ after the match Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D118388	2022-02-11 10:45:18 +01:00
Sylvestre Ledru	f2c2e924e7	Fix a typo (occured => occurred) Reported: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1005195	2022-02-08 21:35:26 +01:00
Balazs Benics	620d99b7ed	Revert "[analyzer] Prevent misuses of -analyze-function" This reverts commit `841817b1ed`. Ah, it still fails on build bots for some reason. Pinning the target triple was not enough.	2022-02-08 17:42:46 +01:00
Balazs Benics	841817b1ed	[analyzer] Prevent misuses of -analyze-function Sometimes when I pass the mentioned option I forget about passing the parameter list for c++ sources. It would be also useful newcomers to learn about this. This patch introduces some logic checking common misuses involving `-analyze-function`. Reviewed-By: martong Differential Revision: https://reviews.llvm.org/D118690	2022-02-08 17:27:57 +01:00
Jun Zhang	65adf7c211	[NFC][Analyzer] Use range based for loop. Use range base loop loop to improve code readability. Differential Revision: https://reviews.llvm.org/D119103	2022-02-07 15:45:58 +08:00
Rashmi Mudduluru	faabdfcf7f	[analyzer] Add support for __attribute__((returns_nonnull)). Differential Revision: https://reviews.llvm.org/D118657	2022-02-02 11:46:52 -08:00
Balazs Benics	e99abc5d8a	Revert "[analyzer] Prevent misuses of -analyze-function" This reverts commit `9d6a615973`. Exit Code: 1 Command Output (stderr): -- /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/Analysis/analyze-function-guide.cpp:53:21: error: CHECK-EMPTY-NOT: excluded string found in input // CHECK-EMPTY-NOT: Every top-level function was skipped. ^ <stdin>:1:1: note: found here Every top-level function was skipped. ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Input file: <stdin> Check file: /scratch/buildbot/bothome/clang-ve-ninja/llvm-project/clang/test/Analysis/analyze-function-guide.cpp -dump-input=help explains the following input dump. Input was: <<<<<< 1: Every top-level function was skipped. not:53 !~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ error: no match expected 2: Pass the -analyzer-display-progress for tracking which functions are analyzed. >>>>>>	2022-02-02 11:44:27 +01:00
Balazs Benics	9d6a615973	[analyzer] Prevent misuses of -analyze-function Sometimes when I pass the mentioned option I forget about passing the parameter list for c++ sources. It would be also useful newcomers to learn about this. This patch introduces some logic checking common misuses involving `-analyze-function`. Reviewed-By: martong Differential Revision: https://reviews.llvm.org/D118690	2022-02-02 11:31:22 +01:00
Tres Popp	262cc74e0b	Fix pair construction with an implicit constructor inside.	2022-01-18 18:01:52 +01:00
Endre Fülöp	17f74240e6	[analyzer][NFC] Refactor GenericTaintChecker to use CallDescriptionMap GenericTaintChecker now uses CallDescriptionMap to describe the possible operation in code which trigger the introduction (sources), the removal (filters), the passing along (propagations) and detection (sinks) of tainted values. Reviewed By: steakhal, NoQ Differential Revision: https://reviews.llvm.org/D116025	2022-01-18 16:04:04 +01:00
Denys Petrov	d835dd4cf5	[analyzer] Produce SymbolCast symbols for integral types in SValBuilder::evalCast Summary: Produce SymbolCast for integral types in `evalCast` function. Apply several simplification techniques while producing the symbols. Added a boolean option `handle-integral-cast-for-ranges` under `-analyzer-config` flag. Disabled the feature by default. Differential Revision: https://reviews.llvm.org/D105340	2022-01-18 16:08:04 +02:00
Kazu Hirata	17d4bd3d78	[clang] Fix bugprone argument comments (NFC) Identified with bugprone-argument-comment.	2022-01-09 00:19:49 -08:00
Kazu Hirata	40446663c7	[clang] Use true/false instead of 1/0 (NFC) Identified with modernize-use-bool-literals.	2022-01-09 00:19:47 -08:00
Kazu Hirata	d1b127b5b7	[clang] Remove unused forward declarations (NFC)	2022-01-08 11:56:40 -08:00
Qiu Chaofan	c2cc70e4f5	[NFC] Fix endif comments to match with include guard	2022-01-07 15:52:59 +08:00
Kazu Hirata	d677a7cb05	[clang] Remove redundant member initialization (NFC) Identified with readability-redundant-member-init.	2022-01-02 10:20:23 -08:00
Kazu Hirata	298367ee6e	[clang] Use nullptr instead of 0 or NULL (NFC) Identified with modernize-use-nullptr.	2021-12-29 08:34:20 -08:00
Kazu Hirata	6c335b1a45	[clang] Remove unused "using" (NFC) Identified by misc-unused-using-decls.	2021-12-27 20:48:21 -08:00
Kazu Hirata	0542d15211	Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2021-12-26 09:39:26 -08:00
Kazu Hirata	34558b039b	[StaticAnalyzer] Remove redundant declaration isStdSmartPtr (NFC) An identical declaration is present just a couple of lines above the line being removed in this patch. Identified with readability-redundant-declaration.	2021-12-25 00:35:41 -08:00
Sami Tolvanen	ec2e26eaf6	[Clang] Add __builtin_function_start Control-Flow Integrity (CFI) replaces references to address-taken functions with pointers to the CFI jump table. This is a problem for low-level code, such as operating system kernels, which may need the address of an actual function body without the jump table indirection. This change adds the __builtin_function_start() builtin, which accepts an argument that can be constant-evaluated to a function, and returns the address of the function body. Link: https://github.com/ClangBuiltLinux/linux/issues/1353 Depends on D108478 Reviewed By: pcc, rjmccall Differential Revision: https://reviews.llvm.org/D108479	2021-12-20 12:55:33 -08:00
Kazu Hirata	713ee230f8	[clang] Use llvm::reverse (NFC)	2021-12-17 16:51:42 -08:00
Denys Petrov	da8bd972a3	[analyzer][NFC] Change return value of StoreManager::attemptDownCast function from SVal to Optional<SVal> Summary: Refactor return value of `StoreManager::attemptDownCast` function by removing the last parameter `bool &Failed` and replace the return value `SVal` with `Optional<SVal>`. Make the function consistent with the family of `evalDerivedToBase` by renaming it to `evalBaseToDerived`. Aligned the code on the call side with these changes. Differential Revision: https://reviews.llvm.org/	2021-12-17 13:03:47 +02:00
Gabor Marton	bd9e23943a	[analyzer] Expand conversion check to check more expressions for overflow and underflow This expands checking for more expressions. This will check underflow and loss of precision when using call expressions like: void foo(unsigned); int i = -1; foo(i); This also includes other expressions as well, so it can catch negative indices to std::vector since it uses unsigned integers for [] and .at() function. Patch by: @pfultz2 Differential Revision: https://reviews.llvm.org/D46081	2021-12-15 11:41:34 +01:00
Denys Petrov	6a399bf4b3	[analyzer] Implemented RangeSet::Factory::unite function to handle intersections and adjacency Summary: Handle intersected and adjacent ranges uniting them into a single one. Example: intersection [0, 10] U [5, 20] = [0, 20] adjacency [0, 10] U [11, 20] = [0, 20] Differential Revision: https://reviews.llvm.org/D99797	2021-12-10 18:48:02 +02:00
Logan Smith	715c72b4fb	[NFC][analyzer] Return underlying strings directly instead of OS.str() This avoids an unnecessary copy required by 'return OS.str()', allowing instead for NRVO or implicit move. The .str() call (which flushes the stream) is no longer required since `65b13610a5`, which made raw_string_ostream unbuffered by default. Differential Revision: https://reviews.llvm.org/D115374	2021-12-09 16:05:46 -08:00
Gabor Marton	978431e80b	[Analyzer] SValBuilder: Simlify a SymExpr to the absolute simplest form Move the SymExpr simplification fixpoint logic into SValBuilder. Differential Revision: https://reviews.llvm.org/D114938	2021-12-07 10:02:32 +01:00
Balazs Benics	a6816b957d	[analyzer][solver] Fix assertion on (NonLoc, Op, Loc) expressions Previously, the `SValBuilder` could not encounter expressions of the following kind: NonLoc OP Loc Loc OP NonLoc Where the `Op` is other than `BO_Add`. As of now, due to the smarter simplification and the fixedpoint iteration, it turns out we can. It can happen if the `Loc` was perfectly constrained to a concrete value (`nonloc::ConcreteInt`), thus the simplifier can do constant-folding in these cases as well. Unfortunately, this could cause assertion failures, since we assumed that the operator must be `BO_Add`, causing a crash. --- In the patch, I decided to preserve the original behavior (aka. swap the operands (if the operator is commutative), but if the `RHS` was a `loc::ConcreteInt` call `evalBinOpNN()`. I think this interpretation of the arithmetic expression is closer to reality. I also tried naively introducing a separate handler for `loc::ConcreteInt` RHS, before doing handling the more generic `Loc` RHS case. However, it broke the `zoo1backwards()` test in the `nullptr.cpp` file. This highlighted for me the importance to preserve the original behavior for the `BO_Add` at least. PS: Sorry for introducing yet another branch into this `evalBinOpXX` madness. I've got a couple of ideas about refactoring these. We'll see if I can get to it. The test file demonstrates the issue and makes sure nothing similar happens. The `no-crash` annotated lines show, where we crashed before applying this patch. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D115149	2021-12-06 18:38:58 +01:00
Balazs Benics	9873ef409c	[analyzer] Ignore flex generated files Some projects [1,2,3] have flex-generated files besides bison-generated ones. Unfortunately, the comment `"/* A lexical scanner generated by flex */"` generated by the tools is not necessarily at the beginning of the file, thus we need to quickly skim through the file for this needle string. Luckily, StringRef can do this operation in an efficient way. That being said, now the bison comment is not required to be at the very beginning of the file. This allows us to detect a couple more cases [4,5,6]. Alternatively, we could say that we only allow whitespace characters before matching the bison/flex header comment. That would prevent the (probably) unnecessary string search in the buffer. However, I could not verify that these tools would actually respect this assumption. Additionally to this, e.g. the Twin project [1] has other non-whitespace characters (some preprocessor directives) before the flex-generated header comment. So the heuristic in the previous paragraph won't work with that. Thus, I would advocate the current implementation. According to my measurement, this patch won't introduce measurable performance degradation, even though we will do 2 linear scans. I introduce the ignore-bison-generated-files and ignore-flex-generated-files to disable skipping these files. Both of these options are true by default. [1]: https://github.com/cosmos72/twin/blob/master/server/rcparse_lex.cpp#L7 [2]: `22362cdcf9/sandbox/count-words/lexer.c (L6)` [3]: `11abdf6462/lab1/lex.yy.c (L6)` [4]: `47f5b2cfe2/B_yacc/1/y1.tab.h (L2)` [5]: `71d1bf9b1e/src/VBox/Additions/x11/x11include/xorg-server-1.8.0/parser.h (L2)` [6]: `3f773ceb13/Framework/OpenEars.framework/Versions/A/Headers/jsgf_parser.h (L2)` Reviewed By: xazax.hun Differential Revision: https://reviews.llvm.org/D114510	2021-12-06 10:20:17 +01:00
Gabor Marton	20f8733d4b	[Analyzer][solver] Simplification: Do a fixpoint iteration before the eq class merge This reverts commit `f02c5f3478` and addresses the issue mentioned in D114619 differently. Repeating the issue here: Currently, during symbol simplification we remove the original member symbol from the equivalence class (`ClassMembers` trait). However, we keep the reverse link (`ClassMap` trait), in order to be able the query the related constraints even for the old member. This asymmetry can lead to a problem when we merge equivalence classes: ``` ClassA: [a, b] // ClassMembers trait, a->a, b->a // ClassMap trait, a is the representative symbol ``` Now let,s delete `a`: ``` ClassA: [b] a->a, b->a ``` Let's merge ClassA into the trivial class `c`: ``` ClassA: [c, b] c->c, b->c, a->a ``` Now, after the merge operation, `c` and `a` are actually in different equivalence classes, which is inconsistent. This issue manifests in a test case (added in D103317): ``` void recurring_symbol(int b) { if (b * b != b) if ((b * b) * b * b != (b * b) * b) if (b * b == 1) } ``` Before the simplification we have these equivalence classes: ``` trivial EQ1: [b * b != b] trivial EQ2: [(b * b) * b * b != (b * b) * b] ``` During the simplification with `b * b == 1`, EQ1 is merged with `1 != b` `EQ1: [b * b != b, 1 != b]` and we remove the complex symbol, so `EQ1: [1 != b]` Then we start to simplify the only symbol in EQ2: `(b * b) * b * b != (b * b) * b --> 1 * b * b != 1 * b --> b * b != b` But `b * b != b` is such a symbol that had been removed previously from EQ1, thus we reach the above mentioned inconsistency. This patch addresses the issue by making it impossible to synthesise a symbol that had been simplified before. We achieve this by simplifying the given symbol to the absolute simplest form. Differential Revision: https://reviews.llvm.org/D114887	2021-12-01 22:23:41 +01:00
Gabor Marton	0a17896fe6	[Analyzer][Core] Make SValBuilder to better simplify svals with 3 symbols in the tree Add the capability to simplify more complex constraints where there are 3 symbols in the tree. In this change I extend simplifySVal to query constraints of children sub-symbols in a symbol tree. (The constraint for the parent is asked in getKnownValue.) Differential Revision: https://reviews.llvm.org/D103317	2021-11-30 11:24:59 +01:00
Gabor Marton	f02c5f3478	[Analyzer][solver] Do not remove the simplified symbol from the eq class Currently, during symbol simplification we remove the original member symbol from the equivalence class (`ClassMembers` trait). However, we keep the reverse link (`ClassMap` trait), in order to be able the query the related constraints even for the old member. This asymmetry can lead to a problem when we merge equivalence classes: ``` ClassA: [a, b] // ClassMembers trait, a->a, b->a // ClassMap trait, a is the representative symbol ``` Now lets delete `a`: ``` ClassA: [b] a->a, b->a ``` Let's merge the trivial class `c` into ClassA: ``` ClassA: [c, b] c->c, b->c, a->a ``` Now after the merge operation, `c` and `a` are actually in different equivalence classes, which is inconsistent. One solution to this problem is to simply avoid removing the original member and this is what this patch does. Other options I have considered: 1) Always merge the trivial class into the non-trivial class. This might work most of the time, however, will fail if we have to merge two non-trivial classes (in that case we no longer can track equivalences precisely). 2) In `removeMember`, update the reverse link as well. This would cease the inconsistency, but we'd loose precision since we could not query the constraints for the removed member. Differential Revision: https://reviews.llvm.org/D114619	2021-11-30 11:13:13 +01:00
Balazs Benics	af37d4b6fe	[analyzer][NFC] Refactor AnalysisConsumer::getModeForDecl() I just read this part of the code, and I found the nested ifs less readable. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D114441	2021-11-29 10:39:36 +01:00
Zarko Todorovski	d42a6432aa	[NFC][clang]Inclusive language: remove remaining uses of sanity Missed some uses of sanity check in previous commits.	2021-11-24 14:20:13 -05:00
Gabor Marton	12887a2024	[Analyzer][Core] Better simplification in SimpleSValBuilder::evalBinOpNN Make the SValBuilder capable to simplify existing SVals based on a newly added constraints when evaluating a BinOp. Before this patch, we called `simplify` only in some edge cases. However, we can and should investigate the constraints in all cases. Differential Revision: https://reviews.llvm.org/D113753	2021-11-23 16:38:01 +01:00
Gabor Marton	ffc32efd1c	[Analyzer][Core] Simplify IntSym in SValBuilder Make the SimpleSValBuilder capable to simplify existing IntSym expressions based on a newly added constraint on the sub-expression. Differential Revision: https://reviews.llvm.org/D113754	2021-11-22 17:33:43 +01:00
Zarko Todorovski	d8e5a0c42b	[clang][NFC] Inclusive terms: replace some uses of sanity in clang Rewording of comments to avoid using `sanity test, sanity check`. Reviewed By: aaron.ballman, Quuxplusone Differential Revision: https://reviews.llvm.org/D114025	2021-11-19 14:58:35 -05:00
Balazs Benics	d5de568cc7	[analyzer][NFC] MaybeUInt -> MaybeCount I forgot to include this in D113594 Differential Revision: https://reviews.llvm.org/D113594	2021-11-19 18:36:55 +01:00
Balazs Benics	e6ef134f3c	[analyzer][NFC] Use enum for CallDescription flags Yeah, let's prefer a slightly stronger type representing this. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113595	2021-11-19 18:32:13 +01:00
Balazs Benics	97f1bf15b1	[analyzer][NFC] Consolidate the inner representation of CallDescriptions `CallDescriptions` have a `RequiredArgs` and `RequiredParams` members, but they are of different types, `unsigned` and `size_t` respectively. In the patch I use only `unsigned` for both, that should be large enough anyway. I also introduce the `MaybeUInt` type alias for `Optional<unsigned>`. Additionally, I also avoid the use of the //smart// less-than operator. template <typename T> constexpr bool operator<=(const Optional<T> &X, const T &Y); Which would check if the optional has a value and compare the data only after. I found it surprising, thus I think we are better off without it. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113594	2021-11-19 18:32:13 +01:00
Balazs Benics	de9d7e42ac	[analyzer][NFC] CallDescription should own the qualified name parts Previously, CallDescription simply referred to the qualified name parts by `const char` pointers. In the future we might want to dynamically load and populate `CallDescriptionMaps`, hence we will need the `CallDescriptions` to actually own* their qualified name parts. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113593	2021-11-19 18:32:13 +01:00
Balazs Benics	9ad0a90baa	[analyzer][NFC] Demonstrate the use of CallDescriptionSet Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113592	2021-11-19 18:32:13 +01:00
Balazs Benics	f18da190b0	[analyzer][NFC] Switch to using CallDescription::matches() instead of isCalled() This patch replaces each use of the previous API with the new one. In variadic cases, it will use the ADL `matchesAny(Call, CDs...)` variadic function. Also simplifies some code involving such operations. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113591	2021-11-19 18:32:13 +01:00
Balazs Benics	6c512703a9	[analyzer][NFC] Introduce CallDescription::matches() in addition to isCalled() This patch introduces `CallDescription::matches()` member function, accepting a `CallEvent`. Semantically, `Call.isCalled(CD)` is the same as `CD.matches(Call)`. The patch also introduces the `matchesAny()` variadic free function template. It accepts a `CallEvent` and at least one `CallDescription` to match against. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D113590	2021-11-19 18:32:13 +01:00
Balazs Benics	d448fcd9b2	[analyzer][NFC] Introduce CallDescriptionSets Sometimes we only want to decide if some function is called, and we don't care which of the set. This `CallDescriptionSet` will have the same behavior, except instead of `lookup()` returning a pointer to the mapped value, the `contains()` returns `bool`. Internally, it uses the `CallDescriptionMap<bool>` for implementing the behavior. It is preferred, to reuse the generic `CallDescriptionMap::lookup()` logic, instead of duplicating it. The generic version might be improved by implementing a hash lookup or something along those lines. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D113589	2021-11-19 18:32:13 +01:00
Kazu Hirata	74115602e8	[clang] Use range-based for loops with llvm::reverse (NFC)	2021-11-17 19:40:48 -08:00
Balazs Benics	0b9d3a6e53	[analyzer][NFC] Separate CallDescription from CallEvent `CallDescriptions` deserve its own translation unit. This patch simply moves the corresponding parts. Also includes the `CallDescription.h` where it's necessary. Reviewed By: martong, xazax.hun, Szelethus Differential Revision: https://reviews.llvm.org/D113587	2021-11-15 19:10:46 +01:00
Denys Petrov	f0bc7d2488	[analyzer] Fix region cast between the same types with different qualifiers. Summary: Specifically, this fixes the case when we get an access to array element through the pointer to element. This covers several FIXME's. in https://reviews.llvm.org/D111654. Example: const int arr[4][2]; const int ptr = arr[1]; // Fixes this. The issue is that `arr[1]` is `int` (&Element{Element{glob_arr5,1 S64b,int[2]},0 S64b,int}), and `ptr` is `const int*`. We don't take qualifiers into account. Consequently, we doesn't match the types as the same ones. Differential Revision: https://reviews.llvm.org/D113480	2021-11-15 19:23:00 +02:00
Kazu Hirata	d0ac215dd5	[clang] Use isa instead of dyn_cast (NFC)	2021-11-14 09:32:40 -08:00
Gabor Marton	01c9700aaa	[analyzer][solver] Remove reference to RangedConstraintManager We no longer need a reference to RangedConstraintManager, we call top level `State->assume` functions. Differential Revision: https://reviews.llvm.org/D113261	2021-11-12 11:44:49 +01:00
Gabor Marton	806329da07	[analyzer][solver] Iterate to a fixpoint during symbol simplification with constants D103314 introduced symbol simplification when a new constant constraint is added. Currently, we simplify existing equivalence classes by iterating over all existing members of them and trying to simplify each member symbol with simplifySVal. At the end of such a simplification round we may end up introducing a new constant constraint. Example: ``` if (a + b + c != d) return; if (c + b != 0) return; // Simplification starts here. if (b != 0) return; ``` The `c == 0` constraint is the result of the first simplification iteration. However, we could do another round of simplification to reach the conclusion that `a == d`. Generally, we could do as many new iterations until we reach a fixpoint. We can reach to a fixpoint by recursively calling `State->assume` on the newly simplified symbol. By calling `State->assume` we re-ignite the whole assume machinery (along e.g with adjustment handling). Why should we do this? By reaching a fixpoint in simplification we are capable of discovering infeasible states at the moment of the introduction of the first constant constraint. Let's modify the previous example just a bit, and consider what happens without the fixpoint iteration. ``` if (a + b + c != d) return; if (c + b != 0) return; // Adding a new constraint. if (a == d) return; // This brings in a contradiction. if (b != 0) return; clang_analyzer_warnIfReached(); // This produces a warning. // The path is already infeasible... if (c == 0) // ...but we realize that only when we evaluate `c == 0`. return; ``` What happens currently, without the fixpoint iteration? As the inline comments suggest, without the fixpoint iteration we are doomed to realize that we are on an infeasible path only after we are already walking on that. With fixpoint iteration we can detect that before stepping on that. With fixpoint iteration, the `clang_analyzer_warnIfReached` does not warn in the above example b/c during the evaluation of `b == 0` we realize the contradiction. The engine and the checkers do rely on that either `assume(Cond)` or `assume(!Cond)` should be feasible. This is in fact assured by the so called expensive checks (LLVM_ENABLE_EXPENSIVE_CHECKS). The StdLibraryFuncionsChecker is notably one of the checkers that has a very similar assertion. Before this patch, we simply added the simplified symbol to the equivalence class. In this patch, after we have added the simplified symbol, we remove the old (more complex) symbol from the members of the equivalence class (`ClassMembers`). Removing the old symbol is beneficial because during the next iteration of the simplification we don't have to consider again the old symbol. Contrary to how we handle `ClassMembers`, we don't remove the old Sym->Class relation from the `ClassMap`. This is important for two reasons: The constraints of the old symbol can still be found via it's equivalence class that it used to be the member of (1). We can spare one removal and thus one additional tree in the forest of `ClassMap` (2). Performance and complexity: Let us assume that in a State we have N non-trivial equivalence classes and that all constraints and disequality info is related to non-trivial classes. In the worst case, we can simplify only one symbol of one class in each iteration. The number of symbols in one class cannot grow b/c we replace the old symbol with the simplified one. Also, the number of the equivalence classes can decrease only, b/c the algorithm does a merge operation optionally. We need N iterations in this case to reach the fixpoint. Thus, the steps needed to be done in the worst case is proportional to `N*N`. Empirical results (attached) show that there is some hardly noticeable run-time and peak memory discrepancy compared to the baseline. In my opinion, these differences could be the result of measurement error. This worst case scenario can be extended to that cases when we have trivial classes in the constraints and in the disequality map are transforming to such a State where there are only non-trivial classes, b/c the algorithm does merge operations. A merge operation on two trivial classes results in one non-trivial class. Differential Revision: https://reviews.llvm.org/D106823	2021-11-12 11:44:49 +01:00
Denys Petrov	a12bfac292	[analyzer] Retrieve a value from list initialization of multi-dimensional array declaration. Summary: Add support of multi-dimensional arrays in `RegionStoreManager::getBindingForElement`. Handle nested ElementRegion's getting offsets and checking for being in bounds. Get values from the nested initialization lists using obtained offsets. Differential Revision: https://reviews.llvm.org/D111654	2021-11-08 16:17:55 +02:00
Balazs Benics	9b5c9c469d	[analyzer] Dump checker name if multiple checkers evaluate the same call Previously, if accidentally multiple checkers `eval::Call`-ed the same `CallEvent`, in debug builds the analyzer detected this and crashed with the message stating this. Unfortunately, the message did not state the offending checkers violating this invariant. This revision addresses this by printing a more descriptive message before aborting. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112889	2021-11-02 14:42:14 +01:00
Kazu Hirata	4db2e4cebe	Use {DenseSet,SetVector,SmallPtrSet}::contains (NFC)	2021-10-30 19:00:19 -07:00
Zarko Todorovski	8659b241ae	[clang][NFC] Inclusive terms: Replace uses of whitelist in clang/lib/StaticAnalyzer Replace variable and functions names, as well as comments that contain whitelist with more inclusive terms. Reviewed By: aaron.ballman, martong Differential Revision: https://reviews.llvm.org/D112642	2021-10-29 16:51:36 -04:00
Denys Petrov	1deccd05ba	[analyzer] Retrieve a character from StringLiteral as an initializer for constant arrays. Summary: Assuming that values of constant arrays never change, we can retrieve values for specific position(index) right from the initializer, if presented. Retrieve a character code by index from StringLiteral which is an initializer of constant arrays in global scope. This patch has a known issue of getting access to characters past the end of the literal. The declaration, in which the literal is used, is an implicit cast of kind `array-to-pointer`. The offset should be in literal length's bounds. This should be distinguished from the states in the Standard C++20 [dcl.init.string] 9.4.2.3. Example: const char arr[42] = "123"; char c = arr[41]; // OK const char * const str = "123"; char c = str[41]; // NOK Differential Revision: https://reviews.llvm.org/D107339	2021-10-29 19:44:37 +03:00
Mike Rice	6f9c25167d	[OpenMP] Initial parsing/sema for the 'omp loop' construct Adds basic parsing/sema/serialization support for the #pragma omp loop directive. Differential Revision: https://reviews.llvm.org/D112499	2021-10-28 08:26:43 -07:00
Balazs Benics	49285f43e5	[analyzer] sprintf is a taint propagator not a source Due to a typo, `sprintf()` was recognized as a taint source instead of a taint propagator. It was because an empty taint source list - which is the first parameter of the `TaintPropagationRule` - encoded the unconditional taint sources. This typo effectively turned the `sprintf()` into an unconditional taint source. This patch fixes that typo and demonstrated the correct behavior with tests. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112558	2021-10-28 11:03:02 +02:00
Gabor Marton	a8297ed994	[Analyzer][solver] Handle adjustments in constraint assignor remainder We can reuse the "adjustment" handling logic in the higher level of the solver by calling `State->assume`. Differential Revision: https://reviews.llvm.org/D112296	2021-10-27 17:14:34 +02:00
Gabor Marton	888af47095	[Analyzer][solver] Simplification: reorganize equalities with adjustment Initiate the reorganization of the equality information during symbol simplification. E.g., if we bump into `c + 1 == 0` during simplification then we'd like to express that `c == -1`. It makes sense to do this only with `SymIntExpr`s. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D111642	2021-10-27 16:48:55 +02:00
Balazs Benics	c18407217e	[analyzer] Fix StringChecker for Unknown params It seems like protobuf crashed the `std::string` checker. Somehow it acquired `UnknownVal` as the sole `std::string` constructor parameter, causing a crash in the `castAs<Loc>()`. This patch addresses this. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112551	2021-10-26 18:15:00 +02:00
Denys Petrov	3b1165ba3d	[analyzer] Retrieve incomplete array extent from its redeclaration. Summary: Fix a case when the extent can not be retrieved correctly from incomplete array declaration. Use redeclaration to get the array extent. Differential Revision: https://reviews.llvm.org/D111542	2021-10-25 15:14:10 +03:00
Denys Petrov	44e803ef6d	[analyzer][NFCI] Move a block from `getBindingForElement` to separate functions Summary: 1. Improve readability by moving deeply nested block of code from RegionStoreManager::getBindingForElement to new separate functions: - getConstantValFromConstArrayInitializer; - getSValFromInitListExpr. 2. Handle the case when index is a symbolic value. Write specific test cases. 3. Add test cases when there is no initialization expression presented. This patch implies to make next patches clearer and easier for review process. Differential Revision: https://reviews.llvm.org/D106681	2021-10-25 15:14:10 +03:00
Balazs Benics	e1fdec875f	[analyzer] Add std::string checker This patch adds a checker checking `std::string` operations. At first, it only checks the `std::string` single `const char *` constructor for nullness. If It might be `null`, it will constrain it to non-null and place a note tag there. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111247	2021-10-25 11:15:40 +02:00
Balazs Benics	f9db6a44eb	Revert "[analyzer][solver] Introduce reasoning for not equal to operator" This reverts commit `cac8808f15`. #5 0x00007f28ec629859 abort (/lib/x86_64-linux-gnu/libc.so.6+0x25859) #6 0x00007f28ec629729 (/lib/x86_64-linux-gnu/libc.so.6+0x25729) #7 0x00007f28ec63af36 (/lib/x86_64-linux-gnu/libc.so.6+0x36f36) #8 0x00007f28ecc2cc46 llvm::APInt::compareSigned(llvm::APInt const&) const (libLLVMSupport.so.14git+0xeac46) #9 0x00007f28e7bbf957 (anonymous namespace)::SymbolicRangeInferrer::VisitBinaryOperator(clang::ento::RangeSet, clang::BinaryOperatorKind, clang::ento::RangeSet, clang::QualType) (libclangStaticAnalyzerCore.so.14git+0x1df957) #10 0x00007f28e7bbf2db (anonymous namespace)::SymbolicRangeInferrer::infer(clang::ento::SymExpr const) (libclangStaticAnalyzerCore.so.14git+0x1df2db) #11 0x00007f28e7bb2b5e (anonymous namespace)::RangeConstraintManager::assumeSymNE(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::SymExpr const, llvm::APSInt const&, llvm::APSInt const&) (libclangStaticAnalyzerCore.so.14git+0x1d2b5e) #12 0x00007f28e7bc67af clang::ento::RangedConstraintManager::assumeSymUnsupported(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::SymExpr const, bool) (libclangStaticAnalyzerCore.so.14git+0x1e67af) #13 0x00007f28e7be3578 clang::ento::SimpleConstraintManager::assumeAux(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::NonLoc, bool) (libclangStaticAnalyzerCore.so.14git+0x203578) #14 0x00007f28e7be33d8 clang::ento::SimpleConstraintManager::assume(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::NonLoc, bool) (libclangStaticAnalyzerCore.so.14git+0x2033d8) #15 0x00007f28e7be32fb clang::ento::SimpleConstraintManager::assume(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::DefinedSVal, bool) (libclangStaticAnalyzerCore.so.14git+0x2032fb) #16 0x00007f28e7b15dbc clang::ento::ConstraintManager::assumeDual(llvm::IntrusiveRefCntPtr<clang::ento::ProgramState const>, clang::ento::DefinedSVal) (libclangStaticAnalyzerCore.so.14git+0x135dbc) #17 0x00007f28e7b4780f clang::ento::ExprEngine::evalEagerlyAssumeBinOpBifurcation(clang::ento::ExplodedNodeSet&, clang::ento::ExplodedNodeSet&, clang::Expr const) (libclangStaticAnalyzerCore.so.14git+0x16780f) This is known to be triggered on curl, tinyxml2, tmux, twin and on xerces. But @bjope also reported similar crashes. So, I'm reverting it to make our internal bots happy again. Differential Revision: https://reviews.llvm.org/D106102	2021-10-23 21:01:59 +02:00
Kazu Hirata	d8e4170b0a	Ensure newlines at the end of files (NFC)	2021-10-23 08:45:29 -07:00
Manas	cac8808f15	[analyzer][solver] Introduce reasoning for not equal to operator Prior to this, the solver was only able to verify whether two symbols are equal/unequal, only when constants were involved. This patch allows the solver to work over ranges as well. Reviewed By: steakhal, martong Differential Revision: https://reviews.llvm.org/D106102 Patch by: @manas (Manas Gupta)	2021-10-22 12:00:08 +02:00
Gabor Marton	5f8dca0235	[Analyzer] Extend ConstraintAssignor to handle remainder op Summary: `a % b != 0` implies that `a != 0` for any `a` and `b`. This patch extends the ConstraintAssignor to do just that. In fact, we could do something similar with division and in case of multiplications we could have some other inferences, but I'd like to keep these for future patches. Fixes https://bugs.llvm.org/show_bug.cgi?id=51940 Reviewers: noq, vsavchenko, steakhal, szelethus, asdenyspetrov Subscribers: Differential Revision: https://reviews.llvm.org/D110357	2021-10-22 10:47:25 +02:00
Gabor Marton	e2a2c8328f	[Analyzer][NFC] Add RangedConstraintManager to ConstraintAssignor In this patch we store a reference to `RangedConstraintManager` in the `ConstraintAssignor`. This way it is possible to call back and reuse some functions of it. This patch is exclusively needed for its child patches, it is not intended to be a standalone patch. Differential Revision: https://reviews.llvm.org/D111640	2021-10-22 10:46:28 +02:00
Gabor Marton	01b4ddbfbb	[Analyzer][NFC] Move RangeConstraintManager's def before ConstraintAssignor's def In this patch we simply move the definition of RangeConstraintManager before the definition of ConstraintAssignor. This patch is exclusively needed for it's child patch, so in the child the diff would be clean and the review would be easier. Differential Revision: https://reviews.llvm.org/D110387	2021-10-22 10:46:28 +02:00
Simon Pilgrim	7562f3df89	InvalidPtrChecker - don't dereference a dyn_cast<> - use cast<> instead. Avoid dereferencing a nullptr returned by dyn_cast<>, by using cast<> instead which asserts that the cast is valid.	2021-10-20 18:06:00 +01:00
Balazs Benics	16be17ad4b	[analyzer][NFC] Refactor llvm::isa<> usages in the StaticAnalyzer It turns out llvm::isa<> is variadic, and we could have used this at a lot of places. The following patterns: x && isa<T1>(x) \|\| isa<T2>(x) ... Will be replaced by: isa_and_non_null<T1, T2, ...>(x) Sometimes it caused further simplifications, when it would cause even more code smell. Aside from this, keep in mind that within `assert()` or any macro functions, we need to wrap the isa<> expression within a parenthesis, due to the parsing of the comma. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111982	2021-10-20 17:43:31 +02:00
Kazu Hirata	0abb5d293c	[Sema, StaticAnalyzer] Use StringRef::contains (NFC)	2021-10-20 08:02:36 -07:00
Balazs Benics	72d04d7b2b	[analyzer] Allow matching non-CallExprs using CallDescriptions Fallback to stringification and string comparison if we cannot compare the `IdentifierInfo`s, which is the case for C++ overloaded operators, constructors, destructors, etc. Examples: { "std", "basic_string", "basic_string", 2} // match the 2 param std::string constructor { "std", "basic_string", "~basic_string" } // match the std::string destructor { "aaa", "bbb", "operator int" } // matches the struct bbb conversion operator to int Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111535	2021-10-18 14:57:24 +02:00
Balazs Benics	3ec7b91141	[analyzer][NFC] Refactor CallEvent::isCalled() Refactor the code to make it more readable. It will set up further changes, and improvements to this code in subsequent patches. This is a non-functional change. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111534	2021-10-18 14:57:24 +02:00
Kazu Hirata	d245f2e859	[clang] Use llvm::erase_if (NFC)	2021-10-17 13:50:29 -07:00
Kazu Hirata	6a154e606e	[clang] Use llvm::is_contained (NFC)	2021-10-15 10:07:08 -07:00
Artem Dergachev	12cbc8cbf0	[analyzer] Fix property access kind detection inside parentheses. '(self.prop)' produces a surprising AST where ParenExpr resides inside `PseudoObjectExpr. This breaks ObjCMethodCall::getMessageKind() which in turn causes us to perform unnecessary dynamic dispatch bifurcation when evaluating body-farmed property accessors, which in turn causes us to explore infeasible paths.	2021-10-14 21:07:19 -07:00
Gabor Marton	ac3edc5af0	[analyzer][solver] Handle simplification to ConcreteInt The solver's symbol simplification mechanism was not able to handle cases when a symbol is simplified to a concrete integer. This patch adds the capability. E.g., in the attached lit test case, the original symbol is `c + 1` and it has a `[0, 0]` range associated with it. Then, a new condition `c == 0` is assumed, so a new range constraint `[0, 0]` comes in for `c` and simplification kicks in. `c + 1` becomes `0 + 1`, but the associated range is `[0, 0]`, so now we are able to realize the contradiction. Differential Revision: https://reviews.llvm.org/D110913	2021-10-14 17:53:29 +02:00
Kazu Hirata	e567f37dab	[clang] Use llvm::is_contained (NFC)	2021-10-13 20:41:55 -07:00
Balazs Benics	edde4efc66	[analyzer] Introduce the assume-controlled-environment config option If the `assume-controlled-environment` is `true`, we should expect `getenv()` to succeed, and the result should not be considered tainted. By default, the option will be `false`. Reviewed By: NoQ, martong Differential Revision: https://reviews.llvm.org/D111296	2021-10-13 10:50:26 +02:00
Balazs Benics	7fc150309d	[analyzer] Bifurcate on getenv() calls The `getenv()` function might return `NULL` just like any other function. However, in case of `getenv()` a state-split seems justified since the programmer should expect the failure of this function. `secure_getenv(const char *name)` behaves the same way but is not handled right now. Note that `std::getenv()` is also not handled. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D111245	2021-10-13 10:50:26 +02:00
Artem Dergachev	f3ec9d8501	[analyzer] Fix non-obvious analyzer warning: Use of zero-allocated memory. Clarify the message provided when the analyzer catches the use of memory that is allocated with size zero. Differential Revision: https://reviews.llvm.org/D111655	2021-10-12 10:41:00 -07:00
Gabor Marton	b8f6c85a83	[analyzer][NFC] Add RangeSet::dump This tiny change improves the debugging experience of the solver a lot! Differential Revision: https://reviews.llvm.org/D110911	2021-10-06 18:45:07 +02:00
Gabor Marton	792be5df92	[analyzer][solver] Fix CmpOpTable handling bug There is an error in the implementation of the logic of reaching the `Unknonw` tristate in CmpOpTable. ``` void cmp_op_table_unknownX2(int x, int y, int z) { if (x >= y) { // x >= y [1, 1] if (x + z < y) return; // x + z < y [0, 0] if (z != 0) return; // x < y [0, 0] clang_analyzer_eval(x > y); // expected-warning{{TRUE}} expected-warning{{FALSE}} } } ``` We miss the `FALSE` warning because the false branch is infeasible. We have to exploit simplification to discover the bug. If we had `x < y` as the second condition then the analyzer would return the parent state on the false path and the new constraint would not be part of the State. But adding `z` to the condition makes both paths feasible. The root cause of the bug is that we reach the `Unknown` tristate twice, but in both occasions we reach the same `Op` that is `>=` in the test case. So, we reached `>=` twice, but we never reached `!=`, thus querying the `Unknonw2x` column with `getCmpOpStateForUnknownX2` is wrong. The solution is to ensure that we reached both different `Op`s once. Differential Revision: https://reviews.llvm.org/D110910	2021-10-06 18:28:03 +02:00
Vince Bridgers	b29186c08a	[analyzer] canonicalize special case of structure/pointer deref This simple change addresses a special case of structure/pointer aliasing that produced different symbolvals, leading to false positives during analysis. The reproducer is as simple as this. ```lang=C++ struct s { int v; }; void foo(struct s ps) { struct s ss = ps; clang_analyzer_dump(ss.v); // reg_$1<int Element{SymRegion{reg_$0<struct s ps>},0 S64b,struct s}.v> clang_analyzer_dump(ps->v); //reg_$3<int SymRegion{reg_$0<struct s ps>}.v> clang_analyzer_eval(ss.v == ps->v); // UNKNOWN } ``` Acks: Many thanks to @steakhal and @martong for the group debug session. Reviewed By: steakhal, martong Differential Revision: https://reviews.llvm.org/D110625	2021-10-06 05:18:27 -05:00
Corentin Jabot	424733c12a	Implement if consteval (P1938) Modify the IfStmt node to suppoort constant evaluated expressions. Add a new ExpressionEvaluationContext::ImmediateFunctionContext to keep track of immediate function contexts. This proved easier/better/probably more efficient than walking the AST backward as it allows diagnosing nested if consteval statements.	2021-10-05 08:04:14 -04:00
Zurab Tsinadze	811b1736d9	[analyzer] Add InvalidPtrChecker This patch introduces a new checker: `alpha.security.cert.env.InvalidPtr` Checker finds usage of invalidated pointers related to environment. Based on the following SEI CERT Rules: ENV34-C: https://wiki.sei.cmu.edu/confluence/x/8tYxBQ ENV31-C: https://wiki.sei.cmu.edu/confluence/x/5NUxBQ Reviewed By: martong Differential Revision: https://reviews.llvm.org/D97699	2021-10-04 17:08:34 +02:00
Jay Foad	d933adeaca	[APInt] Stop using soft-deprecated constructors and methods in clang. NFC. Stop using APInt constructors and methods that were soft-deprecated in D109483. This fixes all the uses I found in clang. Differential Revision: https://reviews.llvm.org/D110808	2021-10-04 09:38:11 +01:00
Denys Petrov	98a95d4844	[analyzer] Retrieve a value from list initialization of constant array declaration in a global scope. Summary: Fix the point that we didn't take into account array's dimension. Retrieve a value of global constant array by iterating through its initializer list. Differential Revision: https://reviews.llvm.org/D104285 Fixes: https://bugs.llvm.org/show_bug.cgi?id=50604	2021-09-24 12:37:58 +03:00
Nico Weber	9197834535	Revert "Fix CLANG_ENABLE_STATIC_ANALYZER=OFF building all analyzer source" This reverts commit `6d7b3d6b3a`. Breaks running cmake with `-DCLANG_ENABLE_STATIC_ANALYZER=OFF` without turning off CLANG_TIDY_ENABLE_STATIC_ANALYZER. See comments on https://reviews.llvm.org/D109611 for details.	2021-09-20 16:18:03 -04:00
Alex Richardson	6d7b3d6b3a	Fix CLANG_ENABLE_STATIC_ANALYZER=OFF building all analyzer source Since https://reviews.llvm.org/D87118, the StaticAnalyzer directory is added unconditionally. In theory this should not cause the static analyzer sources to be built unless they are referenced by another target. However, the clang-cpp target (defined in clang/tools/clang-shlib) uses the CLANG_STATIC_LIBS global property to determine which libraries need to be included. To solve this issue, this patch avoids adding libraries to that property if EXCLUDE_FROM_ALL is set. In case something like this comes up again: `cmake --graphviz=targets.dot` is quite useful to see why a target is included as part of `ninja all`. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D109611	2021-09-20 12:55:56 +01:00
alokmishra.besu	000875c127	OpenMP 5.0 metadirective This patch supports OpenMP 5.0 metadirective features. It is implemented keeping the OpenMP 5.1 features like dynamic user condition in mind. A new function, getBestWhenMatchForContext, is defined in llvm/Frontend/OpenMP/OMPContext.h Currently this function return the index of the when clause with the highest score from the ones applicable in the Context. But this function is declared with an array which can be used in OpenMP 5.1 implementation to select all the valid when clauses which can be resolved in runtime. Currently this array is set to null by default and its implementation is left for future. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91944	2021-09-18 13:40:44 -05:00
Nico Weber	31cca21565	Revert "OpenMP 5.0 metadirective" This reverts commit `c7d7b98e52`. Breaks tests on macOS, see comment on https://reviews.llvm.org/D91944	2021-09-18 09:10:37 -04:00
alokmishra.besu	347f3c186d	OpenMP 5.0 metadirective This patch supports OpenMP 5.0 metadirective features. It is implemented keeping the OpenMP 5.1 features like dynamic user condition in mind. A new function, getBestWhenMatchForContext, is defined in llvm/Frontend/OpenMP/OMPContext.h Currently this function return the index of the when clause with the highest score from the ones applicable in the Context. But this function is declared with an array which can be used in OpenMP 5.1 implementation to select all the valid when clauses which can be resolved in runtime. Currently this array is set to null by default and its implementation is left for future. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91944	2021-09-17 16:30:06 -05:00
cchen	7efb825382	Revert "OpenMP 5.0 metadirective" This reverts commit `c7d7b98e52`.	2021-09-17 16:14:16 -05:00
cchen	c7d7b98e52	OpenMP 5.0 metadirective This patch supports OpenMP 5.0 metadirective features. It is implemented keeping the OpenMP 5.1 features like dynamic user condition in mind. A new function, getBestWhenMatchForContext, is defined in llvm/Frontend/OpenMP/OMPContext.h Currently this function return the index of the when clause with the highest score from the ones applicable in the Context. But this function is declared with an array which can be used in OpenMP 5.1 implementation to select all the valid when clauses which can be resolved in runtime. Currently this array is set to null by default and its implementation is left for future. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D91944	2021-09-17 16:03:13 -05:00
Gabor Marton	96ec9b6ff2	[Analyzer] ConversionChecker: track back the cast expression Adding trackExpressionValue to the checker so it tracks the value of the implicit cast's DeclRefExpression up to initialization/assignment. This way the report becomes cleaner. Differential Revision: https://reviews.llvm.org/D109836	2021-09-16 11:42:54 +02:00
Kristóf Umann	9d359f6c73	[analyzer] MallocChecker: Add notes from NoOwnershipChangeVisitor only when a function "intents", but doesn't change ownership, enable by default D105819 Added NoOwnershipChangeVisitor, but it is only registered when an off-by-default, hidden checker option was enabled. The reason behind this was that it grossly overestimated the set of functions that really needed a note: std::string getTrainName(const Train T) { return T->name; } // note: Retuning without changing the ownership of or deallocating memory // Umm... I mean duh? Nor would I expect this function to do anything like that... void foo() { Train T = new Train("Land Plane"); print(getTrainName(T)); // note: calling getTrainName / returning from getTrainName } // warn: Memory leak This patch adds a heuristic that guesses that any function that has an explicit operator delete call could have be responsible for deallocating the memory that ended up leaking. This is waaaay too conservative (see the TODOs in the new function), but it safer to err on the side of too little than too much, and would allow us to enable the option by default now, and add refinements one-by-one. Differential Revision: https://reviews.llvm.org/D108753	2021-09-13 15:01:20 +02:00
Kristóf Umann	0213d7ec0c	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it Fix a compilation error due to a missing 'template' keyword. Differential Revision: https://reviews.llvm.org/D108695	2021-09-13 13:50:01 +02:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Balazs Benics	b97a96400a	[analyzer] SValBuilder should have an easy access to AnalyzerOptions `SVB.getStateManager().getOwningEngine().getAnalysisManager().getAnalyzerOptions()` is quite a mouthful and might involve a few pointer indirections to get such a simple thing like an analyzer option. This patch introduces an `AnalyzerOptions` reference to the `SValBuilder` abstract class, while refactors a few cases to use this /simpler/ accessor. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D108824	2021-09-04 10:19:57 +02:00
Balazs Benics	91c07eb8ee	[analyzer] Ignore single element arrays in getStaticSize() conditionally Quoting https://gcc.gnu.org/onlinedocs/gcc/Zero-Length.html: > In the absence of the zero-length array extension, in ISO C90 the contents > array in the example above would typically be declared to have a single > element. We should not assume that the size of the //flexible array member// field has a single element, because in some cases they use it as a fallback for not having the //zero-length array// language extension. In this case, the analyzer should return `Unknown` as the extent of the field instead. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D108230	2021-09-04 10:19:57 +02:00
Jessica Paquette	b9e57e0305	Revert "[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it" This reverts commit `a375bfb5b7`. This was causing a bot to crash: https://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/23380/	2021-09-03 10:28:07 -07:00
Kristóf Umann	a375bfb5b7	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it D105553 added NoStateChangeFuncVisitor, an abstract class to aid in creating notes such as "Returning without writing to 'x'", or "Returning without changing the ownership status of allocated memory". Its clients need to define, among other things, what a change of state is. For code like this: f() { g(); } foo() { f(); h(); } We'd have a path in the ExplodedGraph that looks like this: -- <g> --> / \ --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> When we're interested in whether f neglected to change some property, NoStateChangeFuncVisitor asks these questions: ÷×~ -- <g> --> ß / \$ @&#* --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> Has anything changed in between # and ? Has anything changed in between & and ? Has anything changed in between @ and ? ... Has anything changed in between $ and ? Has anything changed in between × and ~? Has anything changed in between ÷ and ~? ... Has anything changed in between ß and ? ... This is a rather thorough line of questioning, which is why in D105819, I was only interested in whether state right before* and right after a function call changed, and early returned to the CallEnter location: if (!CurrN->getLocationAs<CallEnter>()) return; Except that I made a typo, and forgot to negate the condition. So, in this patch, I'm fixing that, and under the same hood allow all clients to decide to do this whole-function check instead of the thorough one. Differential Revision: https://reviews.llvm.org/D108695	2021-09-03 13:50:18 +02:00
Kristóf Umann	3891b45a06	Revert "[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it" This reverts commit `7d0e62bfb7`.	2021-09-02 17:19:49 +02:00
Kristóf Umann	7d0e62bfb7	[analyzer][NFCI] Allow clients of NoStateChangeFuncVisitor to check entire function calls, rather than each ExplodedNode in it D105553 added NoStateChangeFuncVisitor, an abstract class to aid in creating notes such as "Returning without writing to 'x'", or "Returning without changing the ownership status of allocated memory". Its clients need to define, among other things, what a change of state is. For code like this: f() { g(); } foo() { f(); h(); } We'd have a path in the ExplodedGraph that looks like this: -- <g> --> / \ --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> When we're interested in whether f neglected to change some property, NoStateChangeFuncVisitor asks these questions: ÷×~ -- <g> --> ß / \$ @&#* --- <f> --------> --- <h> ---> / \ / \ -------- <foo> ------ <foo> --> Has anything changed in between # and ? Has anything changed in between & and ? Has anything changed in between @ and ? ... Has anything changed in between $ and ? Has anything changed in between × and ~? Has anything changed in between ÷ and ~? ... Has anything changed in between ß and ? ... This is a rather thorough line of questioning, which is why in D105819, I was only interested in whether state right before* and right after a function call changed, and early returned to the CallEnter location: if (!CurrN->getLocationAs<CallEnter>()) return; Except that I made a typo, and forgot to negate the condition. So, in this patch, I'm fixing that, and under the same hood allow all clients to decide to do this whole-function check instead of the thorough one. Differential Revision: https://reviews.llvm.org/D108695	2021-09-02 16:56:32 +02:00
Fanbo Meng	ae206db2d6	[SystemZ][z/OS] Create html report file with text flag Change OF_None to OF_Text flag in file creation, same reasoning as https://reviews.llvm.org/D97785 Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D108998	2021-08-31 11:52:04 -04:00
Balazs Benics	68088563fb	[analyzer] MallocOverflow should consider comparisons only preceding malloc MallocOverflow works in two phases: 1) Collects suspicious malloc calls, whose argument is a multiplication 2) Filters the aggregated list of suspicious malloc calls by iterating over the BasicBlocks of the CFG looking for comparison binary operators over the variable constituting in any suspicious malloc. Consequently, it suppressed true-positive cases when the comparison check was after the malloc call. In this patch the checker will consider the relative position of the relation check to the malloc call. E.g.: ```lang=C++ void check_after_malloc(int n, int x) { int p = NULL; if (x == 42) p = malloc(n * sizeof(int)); // Previously no warning, now it // warns about this. // The check is after the allocation! if (n > 10) { // Do something conditionally. } return p; } ``` Reviewed By: martong Differential Revision: https://reviews.llvm.org/D107804	2021-08-27 14:41:26 +02:00
Balazs Benics	6ad47e1c4f	[analyzer] Catch leaking stack addresses via stack variables Not only global variables can hold references to dead stack variables. Consider this example: void write_stack_address_to(char *q) { char local; q = &local; } void test_stack() { char p; write_stack_address_to(&p); } The address of 'local' is assigned to 'p', which becomes a dangling pointer after 'write_stack_address_to()' returns. The StackAddrEscapeChecker was looking for bindings in the store which referred to variables of the popped stack frame, but it only considered global variables in this regard. This patch relaxes this, catching stack variable bindings as well. --- This patch also works for temporary objects like: struct Bar { const int &ref; explicit Bar(int y) : ref(y) { // Okay. } // End of the constructor call, `ref` is dangling now. Warning! }; void test() { Bar{33}; // Temporary object, so the corresponding memregion is // not* a VarRegion. } --- The return value optimization aka. copy-elision might kick in but that is modeled by passing an imaginary CXXThisRegion which refers to the parent stack frame which is supposed to be the 'return slot'. Objects residing in the 'return slot' outlive the scope of the inner call, thus we should expect no warning about them - except if we explicitly disable copy-elision. Reviewed By: NoQ, martong Differential Revision: https://reviews.llvm.org/D107078	2021-08-27 11:31:16 +02:00
Artem Dergachev	7309359928	[analyzer] Fix scan-build report deduplication. The previous behavior was to deduplicate reports based on md5 of the html file. This algorithm might have worked originally but right now HTML reports contain information rich enough to make them virtually always distinct which breaks deduplication entirely. The new strategy is to (finally) take advantage of IssueHash - the stable report identifier provided by clang that is the same if and only if the reports are duplicates of each other. Additionally, scan-build no longer performs deduplication on its own. Instead, the report file name is now based on the issue hash, and clang instances will silently refuse to produce a new html file when a duplicate already exists. This eliminates the problem entirely. The '-analyzer-config stable-report-filename' option is deprecated because report filenames are no longer unstable. A new option is introduced, '-analyzer-config verbose-report-filename', to produce verbose file names that look similar to the old "stable" file names. The old option acts as an alias to the new option. Differential Revision: https://reviews.llvm.org/D105167	2021-08-26 13:34:29 -07:00
Balazs Benics	e5646b9254	Revert "Revert "[analyzer] Ignore IncompleteArrayTypes in getStaticSize() for FAMs"" This reverts commit `df1f4e0cc6`. Now the test case explicitly specifies the target triple. I decided to use x86_64 for that matter, to have a fixed bitwidth for `size_t`. Aside from that, relanding the original changes of: https://reviews.llvm.org/D105184	2021-08-25 17:19:06 +02:00
Balazs Benics	df1f4e0cc6	Revert "[analyzer] Ignore IncompleteArrayTypes in getStaticSize() for FAMs" This reverts commit `360ced3b8f`.	2021-08-25 16:43:25 +02:00
Balazs Benics	360ced3b8f	[analyzer] Ignore IncompleteArrayTypes in getStaticSize() for FAMs Currently only `ConstantArrayType` is considered for flexible array members (FAMs) in `getStaticSize()`. However, `IncompleteArrayType` also shows up in practice as FAMs. This patch will ignore the `IncompleteArrayType` and return Unknown for that case as well. This way it will be at least consistent with the current behavior until we start modeling them accurately. I'm expecting that this will resolve a bunch of false-positives internally, caused by the `ArrayBoundV2`. Reviewed By: ASDenysPetrov Differential Revision: https://reviews.llvm.org/D105184	2021-08-25 16:12:17 +02:00
Simon Pilgrim	ae691648b4	Fix unknown parameter Wdocumentation warning. NFC.	2021-08-19 15:40:10 +01:00
Denys Petrov	9dabacd09f	[analyzer] Adjust JS code of analyzer's HTML report for IE support. Summary: Change and replace some functions which IE does not support. This patch is made as a continuation of D92928 revision. Also improve hot keys behavior. Differential Revision: https://reviews.llvm.org/D107366	2021-08-17 19:32:34 +03:00
Rong Xu	9b8425e42c	Reapply commit `b7425e956` The commit `b7425e956`: [NFC] fix typos is harmless but was reverted by accident. Reapply.	2021-08-16 12:18:40 -07:00
Kostya Kortchinsky	80ed75e7fb	Revert "[NFC] Fix typos" This reverts commit `b7425e956b`.	2021-08-16 11:13:05 -07:00
Rong Xu	b7425e956b	[NFC] Fix typos s/senstive/senstive/g	2021-08-16 10:15:30 -07:00
Kristóf Umann	2d3668c997	[analyzer] MallocChecker: Add a visitor to leave a note on functions that could have, but did not change ownership on leaked memory This is a rather common feedback we get from out leak checkers: bug reports are really short, and are contain barely any usable information on what the analyzer did to conclude that a leak actually happened. This happens because of our bug report minimizing effort. We construct bug reports by inspecting the ExplodedNodes that lead to the error from the bottom up (from the error node all the way to the root of the exploded graph), and mark entities that were the cause of a bug, or have interacted with it as interesting. In order to make the bug report a bit less verbose, whenever we find an entire function call (from CallEnter to CallExitEnd) that didn't talk about any interesting entity, we prune it (click here for more info on bug report generation). Even if the event to highlight is exactly this lack of interaction with interesting entities. D105553 generalized the visitor that creates notes for these cases. This patch adds a new kind of NoStateChangeVisitor that leaves notes in functions that took a piece of dynamically allocated memory that later leaked as parameter, and didn't change its ownership status. Differential Revision: https://reviews.llvm.org/D105553	2021-08-16 16:19:00 +02:00
Kristóf Umann	c019142a89	[analyzer][NFC] Split the main logic of NoStoreFuncVisitor to an abstract NoStateChangeVisitor class Preceding discussion on cfe-dev: https://lists.llvm.org/pipermail/cfe-dev/2021-June/068450.html NoStoreFuncVisitor is a rather unique visitor. As VisitNode is invoked on most other visitors, they are looking for the point where something changed -- change on a value, some checker-specific GDM trait, a new constraint. NoStoreFuncVisitor, however, looks specifically for functions that didn't write to a MemRegion of interesting. Quoting from its comments: /// Put a diagnostic on return statement of all inlined functions /// for which the region of interest \p RegionOfInterest was passed into, /// but not written inside, and it has caused an undefined read or a null /// pointer dereference outside. It so happens that there are a number of other similar properties that are worth checking. For instance, if some memory leaks, it might be interesting why a function didn't take ownership of said memory: void sink(int P) {} // no notes void f() { sink(new int(5)); // note: Memory is allocated // Well hold on, sink() was supposed to deal with // that, this must be a false positive... } // warning: Potential memory leak [cplusplus.NewDeleteLeaks] In here, the entity of interest isn't a MemRegion, but a symbol. The property that changed here isn't a change of value, but rather liveness and GDM traits managed by MalloChecker. This patch moves some of the logic of NoStoreFuncVisitor to a new abstract class, NoStateChangeFuncVisitor. This is mostly calculating and caching the stack frames in which the entity of interest wasn't changed. Descendants of this interface have to define 3 things: What constitutes as a change to an entity (this is done by overriding wasModifiedBeforeCallExit) * What the diagnostic message should be (this is done by overriding maybeEmitNoteFor.) What constitutes as the entity of interest being passed into the function (this is also done by overriding maybeEmitNoteFor.*) Differential Revision: https://reviews.llvm.org/D105553	2021-08-16 15:03:22 +02:00
Balázs Kéri	9f517fd11e	[clang][analyzer] Improve bug report in alpha.security.ReturnPtrRange Add some notes and track of bad return value. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D107051	2021-08-11 13:04:55 +02:00
Christopher Di Bella	c874dd5362	[llvm][clang][NFC] updates inline licence info Some files still contained the old University of Illinois Open Source Licence header. This patch replaces that with the Apache 2 with LLVM Exception licence. Differential Revision: https://reviews.llvm.org/D107528	2021-08-11 02:48:53 +00:00
Vince Bridgers	d39ebdae67	[analyzer] Cleanup a FIXME in SValBuilder.cpp This change follows up on a FIXME submitted with D105974. This change simply let's the reference case fall through to return a concrete 'true' instead of a nonloc pointer of appropriate length set to NULL. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D107720	2021-08-10 16:12:52 -05:00
Valeriy Savchenko	9e02f58780	[analyzer] Highlight arrows for currently selected event In some cases, when the execution path of the diagnostic goes back and forth, arrows can overlap and create a mess. Dimming arrows that are not relevant at the moment, solves this issue. They are still visible, but don't draw too much attention. Differential Revision: https://reviews.llvm.org/D92928	2021-08-02 19:15:01 +03:00
Valeriy Savchenko	97bcafa28d	[analyzer] Add control flow arrows to the analyzer's HTML reports This commit adds a very first version of this feature. It is off by default and has to be turned on by checking the corresponding box. For this reason, HTML reports still keep control notes (aka grey bubbles). Further on, we plan on attaching arrows to events and having all arrows not related to a currently selected event barely visible. This will help with reports where control flow goes back and forth (eg in loops). Right now, it can get pretty crammed with all the arrows. Differential Revision: https://reviews.llvm.org/D92639	2021-08-02 19:15:00 +03:00
Matheus Izvekov	0c7cd4a873	[clang] NFC: refactor multiple implementations of getDecltypeForParenthesizedExpr This cleanup patch refactors a bunch of functional duplicates of getDecltypeForParenthesizedExpr into a common implementation. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: aaronpuchert Differential Revision: https://reviews.llvm.org/D100713	2021-07-28 23:27:43 +02:00
Gabor Marton	4761321d49	[Analyzer][solver][NFC] print constraints deterministically (ordered by their string representation) This change is an extension to D103967 where I added dump methods for (dis)equality classes of the State. There, the (dis)equality classes and their contents are dumped in an ordered fashion, they are ordered based on their string representation. This is very useful once we start to use FileCheck to test the State dump in certain tests. Differential Revision: https://reviews.llvm.org/D106642	2021-07-26 16:27:23 +02:00
Gabor Marton	44fa31fa6d	[Analyzer][solver] Fix inconsistent equivalence class data https://bugs.llvm.org/show_bug.cgi?id=51109 When we merged two classes, `this` became an obsolete representation of the new `State`. This is b/c the member relations had changed during the previous merge of another member of the same class in a way that `this` had no longer any members. (`mergeImpl` might keep the member relations to `Other` and could dissolve `*this`.) Differential Revision: https://reviews.llvm.org/D106285	2021-07-23 14:25:32 +02:00
Deep Majumder	80068ca623	[analyzer] Fix for faulty namespace test in SmartPtrModelling This patch: - Fixes how the std-namespace test is written in SmartPtrModelling (now accounts for functions with no Decl available) - Adds the smart pointer checker flag check where it was missing Differential Revision: https://reviews.llvm.org/D106296	2021-07-21 18:23:35 +05:30
Gabor Marton	732a8a9dfb	[Analyzer][solver][NFC] Add explanatory comments to trivial eq classes Differential Revision: https://reviews.llvm.org/D106370	2021-07-21 11:59:56 +02:00
Balázs Kéri	90cb5297ad	[clang][analyzer] Improve report of file read at EOF condition (alpha.unix.Stream checker). The checker warns if a stream is read that is already in end-of-file (EOF) state. The commit adds indication of the last location where the EOF flag is set on the stream. Reviewed By: Szelethus Differential Revision: https://reviews.llvm.org/D104925	2021-07-21 08:54:11 +02:00
Deep Majumder	d825309352	[analyzer] Handle std::make_unique Differential Revision: https://reviews.llvm.org/D103750	2021-07-18 19:54:28 +05:30
Deep Majumder	0cd98bef1b	[analyzer] Handle std::swap for std::unique_ptr This patch handles the `std::swap` function specialization for `std::unique_ptr`. Implemented to be very similar to how `swap` method is handled Differential Revision: https://reviews.llvm.org/D104300	2021-07-18 14:38:55 +05:30
Vince Bridgers	918bda1241	[analyzer] Do not assume that all pointers have the same bitwidth as void* This change addresses this assertion that occurs in a downstream compiler with a custom target. ```APInt.h:1151: bool llvm::APInt::operator==(const llvm::APInt &) const: Assertion `BitWidth == RHS.BitWidth && "Comparison requires equal bit widths"'``` No covering test case is susbmitted with this change since this crash cannot be reproduced using any upstream supported target. The test case that exposes this issue is as simple as: ```lang=c++ void test(int * p) { int * q = p-1; if (q) {} if (q) {} // crash (void)q; } ``` The custom target that exposes this problem supports two address spaces, 16-bit `char`s, and a `_Bool` type that maps to 16-bits. There are no upstream supported targets with similar attributes. The assertion appears to be happening as a result of evaluating the `SymIntExpr` `(reg_$0<int * p>) != 0U` in `VisitSymIntExpr` located in `SimpleSValBuilder.cpp`. The `LHS` is evaluated to `32b` and the `RHS` is evaluated to `16b`. This eventually leads to the assertion in `APInt.h`. While this change addresses the crash and passes LITs, two follow-ups are required: 1) The remainder of `getZeroWithPtrWidth()` and `getIntWithPtrWidth()` should be cleaned up following this model to prevent future confusion. 2) We're not sure why references are found along with the modified code path, that should not be the case. A more principled fix may be found after some further comprehension of why this is the case. Acks: Thanks to @steakhal and @martong for the discussions leading to this fix. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105974	2021-07-16 03:22:57 -05:00
Deep Majumder	13fe78212f	[analyzer] Handle << operator for std::unique_ptr This patch handles the `<<` operator defined for `std::unique_ptr` in the std namespace (ignores custom overloads of the operator). Differential Revision: https://reviews.llvm.org/D105421	2021-07-16 12:34:30 +05:30
Deep Majumder	48688257c5	[analyzer] Model comparision methods of std::unique_ptr This patch handles all the comparision methods (defined via overloaded operators) on std::unique_ptr. These operators compare the underlying pointers, which is modelled by comparing the corresponding inner-pointer SVal. There is also a special case for comparing the same pointer. Differential Revision: https://reviews.llvm.org/D104616	2021-07-16 09:54:05 +05:30
Gabor Marton	d0d37fcc4e	[Analyzer][solver] Remove unused functions ../../git/llvm-project/clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:2395:17: warning: 'clang::ento::ProgramStateRef {anonymous}::RangeConstraintManager::setRange(clang::ento::ProgramStateRef, {anonymous}::EquivalenceClass, clang::ento::RangeSet)' defined but not used [-Wunused-function] ../../git/llvm-project/clang/lib/StaticAnalyzer/Core/RangeConstraintManager.cpp:2384:10: warning: 'clang::ento::RangeSet {anonymous}::RangeConstraintManager::getRange(clang::ento::ProgramStateRef, {anonymous}::EquivalenceClass)' defined but not used [-Wunused-function] Differential Revision: https://reviews.llvm.org/D106063	2021-07-15 16:36:01 +02:00
Balázs Kéri	b0d38ad0bc	[clang][Analyzer] Add symbol uninterestingness to bug report. `PathSensitiveBughReport` has a function to mark a symbol as interesting but it was not possible to clear this flag. This can be useful in some cases, so the functionality is added. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105637	2021-07-15 10:02:18 +02:00
Gabor Marton	bdf31471c7	[Analyzer][solver] Add dump methods for (dis)equality classes. This proved to be very useful during debugging. Differential Revision: https://reviews.llvm.org/D103967	2021-07-14 13:45:02 +02:00
Valeriy Savchenko	60bd8cbc0c	[analyzer][solver][NFC] Refactor how we detect (dis)equalities This patch simplifies the way we deal with (dis)equalities. Due to the symmetry between constraint handler and range inferrer, we can have very similar implementations of logic handling questions about (dis)equality and assumptions involving (dis)equality. It also helps us to remove one more visitor, and removes uncertainty that we got all the right places to put `trackNE` and `trackEQ`. Differential Revision: https://reviews.llvm.org/D105693	2021-07-13 21:00:30 +03:00
Valeriy Savchenko	f26deb4e6b	[analyzer][solver][NFC] Introduce ConstraintAssignor The new component is a symmetric response to SymbolicRangeInferrer. While the latter is the unified component, which answers all the questions what does the solver knows about a particular symbolic expression, assignor associates new constraints (aka "assumes") with symbolic expressions and can imply additional knowledge that the solver can extract and use later on. - Why do we need it and why is SymbolicRangeInferrer not enough? As it is noted before, the inferrer only helps us to get the most precise range information based on the existing knowledge and on the mathematical foundations of different operations that symbolic expressions actually represent. It doesn't introduce new constraints. The assignor, on the other hand, can impose constraints on other symbols using the same domain knowledge. - But for some expressions, SymbolicRangeInferrer looks into constraints for similar expressions, why can't we do that for all the cases? That's correct! But in order to do something like this, we should have a finite number of possible "similar expressions". Let's say we are asked about `$a - $b` and we know something about `$b - $a`. The inferrer can invert this expression and check constraints for `$b - $a`. This is simple! But let's say we are asked about `$a` and we know that `$a * $b != 0`. In this situation, we can imply that `$a != 0`, but the inferrer shouldn't try every possible symbolic expression `X` to check if `$a * X` or `X * $a` is constrained to non-zero. With the assignor mechanism, we can catch this implication right at the moment we associate `$a * $b` with non-zero range, and set similar constraints for `$a` and `$b` as well. Differential Revision: https://reviews.llvm.org/D105692	2021-07-13 21:00:30 +03:00
SharmaRithik	cad9b7f708	[analyzer] Print time taken to analyze each function Summary: This patch is a part of an attempt to obtain more timer data from the analyzer. In this patch, we try to use LLVM::TimeRecord to save time before starting the analysis and to print the time that a specific function takes while getting analyzed. The timer data is printed along with the -analyzer-display-progress outputs. ANALYZE (Syntax): test.c functionName : 0.4 ms ANALYZE (Path, Inline_Regular): test.c functionName : 2.6 ms Authored By: RithikSharma Reviewer: NoQ, xazax.hun, teemperor, vsavchenko Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105565	2021-07-13 04:52:47 +00:00
Abbas Sabra	1af97c9d0b	[analyzer] LoopUnrolling: fix crash when a loop counter is captured in a lambda by reference Reviewed By: vsavchenko Differential Revision: https://reviews.llvm.org/D102273	2021-07-12 17:06:07 +03:00
Balazs Benics	d3e14fafc6	[analyzer][NFC] Display the correct function name even in crash dumps The `-analyzer-display-progress` displayed the function name of the currently analyzed function. It differs in C and C++. In C++, it prints the argument types as well in a comma-separated list. While in C, only the function name is displayed, without the brackets. E.g.: C++: foo(), foo(int, float) C: foo In crash traces, the analyzer dumps the location contexts, but the string is not enough for `-analyze-function` in C++ mode. This patch addresses the issue by dumping the proper function names even in stack traces. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D105708	2021-07-12 09:06:46 +02:00
David Blaikie	1def2579e1	PR51018: Remove explicit conversions from SmallString to StringRef to future-proof against C++23 C++23 will make these conversions ambiguous - so fix them to make the codebase forward-compatible with C++23 (& a follow-up change I've made will make this ambiguous/invalid even in <C++23 so we don't regress this & it generally improves the code anyway)	2021-07-08 13:37:57 -07:00
Valeriy Savchenko	6017cb31bb	[analyzer][solver] Use all sources of constraints Prior to this patch, we always gave priority to constraints that we actually know about symbols in question. However, these can get outdated and we can get better results if we look at all possible sources of knowledge, including sub-expressions. Differential Revision: https://reviews.llvm.org/D105436	2021-07-06 11:09:08 +03:00
Georgy Komarov	c558b1fca7	[analyzer] Fix calculating offset for fields with an empty type Fix offset calculation routines in padding checker to avoid assertion errors described in bugzilla issue 50426. The fields that are subojbects of zero size, marked with [[no_unique_address]] or empty bitfields will be excluded from padding calculation routines. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D104097	2021-07-04 06:57:11 +03:00
Balazs Benics	55662b24a4	[analyzer][NFC] Inline ExprEngine::handleLVectorSplat() It seems like ExprEngine::handleLVectorSplat() was used at only 2 places. It might be better to directly inline them for readability. It seems like these cases were not covered by tests according to my coverage measurement, so I'm adding tests as well, demonstrating that no behavior changed. Besides that, I'm handling CK_MatrixCast similarly to how the rest of the unhandled casts are evaluated. Differential Revision: https://reviews.llvm.org/D105125 Reviewed by: NoQ	2021-07-01 10:54:28 +02:00
Balazs Benics	aa454dda2e	[analyzer] LValueToRValueBitCasts should evaluate to an r-value Previously `LValueToRValueBitCast`s were modeled in the same way how a regular `BitCast` was. However, this should not produce an l-value. Modeling bitcasts accurately is tricky, so it's probably better to model this expression by binding a fresh conjured value. The following code should not result in a diagnostic: ```lang=C++ __attribute__((always_inline)) static inline constexpr unsigned int_castf32_u32(float __A) { return __builtin_bit_cast(unsigned int, __A); // no-warning } ``` Previously, it reported `Address of stack memory associated with local variable '__A' returned to caller [core.StackAddressEscape]`. Differential Revision: https://reviews.llvm.org/D105017 Reviewed by: NoQ, vsavchenko	2021-07-01 10:54:22 +02:00
Balazs Benics	3dae01911b	[analyzer] Make CheckerManager::hasPathSensitiveCheckers() complete again It turns out that the CheckerManager::hasPathSensitiveCheckers() missed checking for the BeginFunctionCheckers. It seems like other callbacks are also missing: - ObjCMessageNilCheckers - BeginFunctionCheckers - NewAllocatorCheckers - PointerEscapeCheckers - EndOfTranslationUnitCheckers In this patch, I wanted to use a fold-expression, but until C++17 arrives we are left with the old-school method. When I tried to write a unittest I observed an interesting behavior. I subscribed only to the BeginFunction event, it was not fired. However, when I also defined the PreCall with an empty handler, suddenly both fired. I could add this test demonstrating the issue, but I don't think it would serve much value in a long run. I don't expect regressions for this. However, I think it would be great to enforce the completeness of this list in a runtime check. I could not come up with a solution for this though. PS: Thank you @Szelethus for helping me debugging this. Differential Revision: https://reviews.llvm.org/D105101 Reviewed by: vsavchenko	2021-06-29 16:35:07 +02:00
Valeriy Savchenko	159024ce23	[analyzer] Implement getType for SVal This commit adds a function to the top-class of SVal hierarchy to provide type information about the value. That can be extremely useful when this is the only piece of information that the user is actually caring about. Additionally, this commit introduces a testing framework for writing unit-tests for symbolic values. Differential Revision: https://reviews.llvm.org/D104550	2021-06-29 12:11:19 +03:00
Nico Weber	d5402a2fee	Revert "[Analyzer][solver] Add dump methods for (dis)equality classes." This reverts commit `6f3b775c3e`. Test fails flakily, see comments on https://reviews.llvm.org/D103967 Also revert follow-up "[Analyzer] Attempt to fix windows bots test failure b/c of new-line" This reverts commit `fe0e861a4d`.	2021-06-28 11:32:57 -04:00
Valeriy Savchenko	8474bb13c3	[analyzer][solver][NFC] Simplify function signatures Since RangeSet::Factory actually contains BasicValueFactory, we can remove value factory from many function signatures inside the solver. Differential Revision: https://reviews.llvm.org/D105005	2021-06-28 14:20:06 +03:00
Gabor Marton	6f3b775c3e	[Analyzer][solver] Add dump methods for (dis)equality classes. This proved to be very useful during debugging. Differential Revision: https://reviews.llvm.org/D103967	2021-06-28 12:57:14 +02:00
Valeriy Savchenko	d646157146	[analyzer] Fix assertion failure on code with transparent unions rdar://76948312 Differential Revision: https://reviews.llvm.org/D104716	2021-06-25 23:09:16 +03:00
Gabor Marton	0646e36254	[Analyzer][solver] Fix crashes during symbol simplification Consider the code ``` void f(int a0, int b0, int c) { int a1 = a0 - b0; int b1 = (unsigned)a1 + c; if (c == 0) { int d = 7L / b1; } } ``` At the point of divisiion by `b1` that is considered to be non-zero, which results in a new constraint for `$a0 - $b0 + $c`. The type of this sym is unsigned, however, the simplified sym is `$a0 - $b0` and its type is signed. This is probably the result of the inherent improper handling of casts. Anyway, Range assignment for constraints use this type information. Therefore, we must make sure that first we simplify the symbol and only then we assign the range. Differential Revision: https://reviews.llvm.org/D104844	2021-06-25 11:49:26 +02:00
Martin Storsjö	e5c7c171e5	[clang] Rename StringRef _lower() method calls to _insensitive() This is mostly a mechanical change, but a testcase that contains parts of the StringRef class (clang/test/Analysis/llvm-conventions.cpp) isn't touched.	2021-06-25 00:22:01 +03:00
Balázs Kéri	d7227a5bc7	[clang][Analyzer] Track null stream argument in alpha.unix.Stream . The checker contains check for passing a NULL stream argument. This change should make more easy to identify where the passed pointer becomes NULL. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D104640	2021-06-22 11:16:56 +02:00
Tomasz Kamiński	cc2ef19556	[analyzer] Handle NTTP invocation in CallContext.getCalleeDecl() This fixes a crash in MallocChecker for the situation when operator new (delete) is invoked via NTTP and makes the behavior of CallContext.getCalleeDecl(Expr) identical to CallEvent.getDecl(). Reviewed By: vsavchenko Differential Revision: https://reviews.llvm.org/D103025	2021-06-18 16:32:19 +03:00
Kirstóf Umann	9cca5c1391	[analyzer] Make checker silencing work for non-pathsensitive bug reports D66572 separated BugReport and BugReporter into basic and path sensitive versions. As a result, checker silencing, which worked deep in the path sensitive report generation facilities became specific to it. DeadStoresChecker, for instance, despite being in the static analyzer, emits non-pathsensitive reports, and was impossible to silence. This patch moves the corresponding code before the call to the virtual function generateDiagnosticForConsumerMap (which is overriden by the specific kinds of bug reporters). Although we see bug reporting as relatively lightweight compared to the analysis, this will get rid of several steps we used to throw away. Quoting from D65379: At a very high level, this consists of 3 steps: For all BugReports in the same BugReportEquivClass, collect all their error nodes in a set. With that set, create a new, trimmed ExplodedGraph whose leafs are all error nodes. Until a valid report is found, construct a bug path, which is yet another ExplodedGraph, that is linear from a given error node to the root of the graph. Run all visitors on the constructed bug path. If in this process the report got invalidated, start over from step 2. Checker silencing used to kick in after all of these. Now it does before any of them :^) Differential Revision: https://reviews.llvm.org/D102914 Change-Id: Ice42939304516f2bebd05a1ea19878b89c96a25d	2021-06-17 10:27:34 +02:00
Valeriy Savchenko	eadd54f274	[analyzer] Decouple NoteTag from its Factory This allows us to create other types of tags that carry useful bits of information alongside. Differential Revision: https://reviews.llvm.org/D104135	2021-06-15 11:58:13 +03:00
Valeriy Savchenko	16f7a952ec	[analyzer] Simplify the process of producing notes for stores Differential Revision: https://reviews.llvm.org/D104046	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	6e6a26b8f0	[analyzer] Extract InlinedFunctionCallHandler Differential Revision: https://reviews.llvm.org/D103961	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	2e490676ea	[analyzer] Extract InterestingLValueHandler Differential Revision: https://reviews.llvm.org/D103917	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	40cb73bd20	[analyzer] Extract ArrayIndexHandler One interesting problem was discovered here. When we do interrupt Tracker's track flow, we want to interrupt only it and not all the other flows recursively. Differential Revision: https://reviews.llvm.org/D103914	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	1639dcb279	[analyzer] Extract NilReceiverHandler Differential Revision: https://reviews.llvm.org/D103902	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	85f475c979	[analyzer] Extract ControlDependencyHandler Differential Revision: https://reviews.llvm.org/D103677	2021-06-15 11:37:36 +03:00
Valeriy Savchenko	bbebf38b73	[analyzer] Refactor StoreSiteFinder and extract DefaultStoreHandler After this patch, custom StoreHandlers will also work as expected. Differential Revision: https://reviews.llvm.org/D103644	2021-06-15 11:37:35 +03:00
Gabor Marton	8ddbb442b6	[Analyzer][solver] Simplify existing eq classes and constraints when a new constraint is added Update `setConstraint` to simplify existing equivalence classes when a new constraint is added. In this patch we iterate over all existing equivalence classes and constraints and try to simplfy them with simplifySVal. This solves problematic cases where we have two symbols in the tree, e.g.: ``` int test_rhs_further_constrained(int x, int y) { if (x + y != 0) return 0; if (y != 0) return 0; clang_analyzer_eval(x + y == 0); // expected-warning{{TRUE}} clang_analyzer_eval(y == 0); // expected-warning{{TRUE}} return 0; } ``` Differential Revision: https://reviews.llvm.org/D103314	2021-06-14 12:19:09 +02:00
Simon Pilgrim	61cdaf66fe	[ADT] Remove APInt/APSInt toString() std::string variants <string> is currently the highest impact header in a clang+llvm build: https://commondatastorage.googleapis.com/chromium-browser-clang/llvm-include-analysis.html One of the most common places this is being included is the APInt.h header, which needs it for an old toString() implementation that returns std::string - an inefficient method compared to the SmallString versions that it actually wraps. This patch replaces these APInt/APSInt methods with a pair of llvm::toString() helpers inside StringExtras.h, adjusts users accordingly and removes the <string> from APInt.h - I was hoping that more of these users could be converted to use the SmallString methods, but it appears that most end up creating a std::string anyhow. I avoided trying to use the raw_ostream << operators as well as I didn't want to lose having the integer radix explicit in the code. Differential Revision: https://reviews.llvm.org/D103888	2021-06-11 13:19:15 +01:00
Valeriy Savchenko	57006d2f6d	[analyzer] Refactor trackExpressionValue to accept TrackingOptions Differential Revision: https://reviews.llvm.org/D103633	2021-06-11 12:49:04 +03:00
Valeriy Savchenko	51d4704d5e	[analyzer] Turn TrackControlDependencyCond into a tracking visitor Differential Revision: https://reviews.llvm.org/D103631	2021-06-11 12:49:04 +03:00
Valeriy Savchenko	3fc8d943c3	[analyzer] Refactor trackRValueExpression into ExpressionHandler Differential Revision: https://reviews.llvm.org/D103630	2021-06-11 12:49:04 +03:00
Valeriy Savchenko	f853d2601a	[analyzer] Turn ReturnVisitor into a tracking visitor Whenever Tracker spawns a visitor that needs to call tracker back, we have to use TrackingBugReporterVisitor in order to maintain all the hooks that the checker might've used. Differential Revision: https://reviews.llvm.org/D103628	2021-06-11 12:49:03 +03:00
Valeriy Savchenko	87a5c4d374	[analyzer] Hide and rename FindLastStoreBRVisitor This component should not be used directly at this point and it is simply an implementation detail, that's why StoreSiteFinder is out of the header file. Differential Revision: https://reviews.llvm.org/D103624	2021-06-11 12:49:03 +03:00
Valeriy Savchenko	b6bcf95322	[analyzer] Change FindLastStoreBRVisitor to use Tracker Additionally, this commit completely removes any uses of FindLastStoreBRVisitor from the analyzer except for the one in Tracker. The next step is actually removing this class altogether from the header file. Differential Revision: https://reviews.llvm.org/D103618	2021-06-11 12:49:03 +03:00
Valeriy Savchenko	967c06b3e9	[analyzer] Reimplement trackExpressionValue as ExpressionHandler This commit moves trackExpressionValue into the Tracker interface as DefaultExpressionHandler. It still can be split into smaller handlers, but that can be a future change. Additionally, this commit doesn't remove the original trackExpressionValue interface, so it's not too big. One of the next commits will address it. Differential Revision: https://reviews.llvm.org/D103616	2021-06-11 12:49:03 +03:00
Valeriy Savchenko	0cc3100bf8	[analyzer] Introduce a new interface for tracking Tracking values through expressions and the stores is fundamental for producing clear diagnostics. However, the main components participating in this process, namely `trackExpressionValue` and `FindLastStoreBRVisitor`, became pretty bloated. They have an interesting dynamic between them (and some other visitors) that one might call a "chain reaction". `trackExpressionValue` adds `FindLastStoreBRVisitor`, and the latter calls `trackExpressionValue`. Because of this design, individual checkers couldn't affect what's going to happen somewhere in the middle of that chain. Whether they want to produce a more informative note or keep the overall tracking going by utilizing some of the domain expertise. This all lead to two biggest problems that I see: * Some checkers don't use it This should probably never be the case for path-sensitive checks. * Some checkers incorporated their logic directly into those components This doesn't make the maintenance easier, breaks multiple architecture principles, and makes the code harder to read adn understand, thus, increasing the probability of the first case. This commit introduces a prototype for a new interface that will be responsible for tracking. My main idea here was to make operations that I want have as a checker developer easy to implement and hook directly into the tracking process. Differential Revision: https://reviews.llvm.org/D103605	2021-06-11 12:49:03 +03:00
Michael Kruse	a22236120f	[OpenMP] Implement '#pragma omp unroll'. Implementation of the unroll directive introduced in OpenMP 5.1. Follows the approach from D76342 for the tile directive (i.e. AST-based, not using the OpenMPIRBuilder). Tries to use `llvm.loop.unroll.*` metadata where possible, but has to fall back to an AST representation of the outer loop if the partially unrolled generated loop is associated with another directive (because it needs to compute the number of iterations). Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99459	2021-06-10 14:30:17 -05:00
Matheus Izvekov	aef5d8fdc7	[clang] NFC: Rename rvalue to prvalue This renames the expression value categories from rvalue to prvalue, keeping nomenclature consistent with C++11 onwards. C++ has the most complicated taxonomy here, and every other language only uses a subset of it, so it's less confusing to use the C++ names consistently, and mentally remap to the C names when working on that context (prvalue -> rvalue, no xvalues, etc). Renames: * VK_RValue -> VK_PRValue * Expr::isRValue -> Expr::isPRValue * SK_QualificationConversionRValue -> SK_QualificationConversionPRValue * JSON AST Dumper Expression nodes value category: "rvalue" -> "prvalue" Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D103720	2021-06-09 12:27:10 +02:00
Denys Petrov	d3a6181e82	[analyzer] [NFC] Implement a wrapper SValBuilder::getCastedMemRegionVal for similar functionality on region cast Summary: Replaced code on region cast with a function-wrapper SValBuilder::getCastedMemRegionVal. This is a next step of code refining due to suggestions in D103319. Differential Revision: https://reviews.llvm.org/D103803	2021-06-08 10:43:43 +03:00
Valeriy Savchenko	92d03c20ea	[analyzer] Add forwarding `addVisitor` method The majority of all `addVisitor` callers follow the same pattern: addVisitor(std::make_unique<SomeVisitor>(arg1, arg2, ...)); This patches introduces additional overload for `addVisitor` to simplify that pattern: addVisitor<SomeVisitor>(arg1, arg2, ...); Differential Revision: https://reviews.llvm.org/D103457	2021-06-03 17:10:16 +03:00
Xuanda Yang	620cef9120	[analyzer] MallocSizeof: sizeof pointer type is compatible with void* source: https://bugs.llvm.org/show_bug.cgi?id=50214 Make sizeof pointer type compatible with void* in MallocSizeofChecker. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D103358	2021-05-30 09:51:41 +08:00
Denys Petrov	fae3534b30	[analyzer] Use Optional as a return type of StoreManager::castRegion Summary: Make StoreManager::castRegion function usage safier. Replace `const MemRegion ` with `Optional<const MemRegion >`. Simplified one of related test cases due to suggestions in D101635. Differential Revision: https://reviews.llvm.org/D103319	2021-05-29 15:16:56 +03:00
Erich Keane	eba69b59d1	Reimplement __builtin_unique_stable_name- The original version of this was reverted, and @rjmcall provided some advice to architect a new solution. This is that solution. This implements a builtin to provide a unique name that is stable across compilations of this TU for the purposes of implementing the library component of the unnamed kernel feature of SYCL. It does this by running the Itanium mangler with a few modifications. Because it is somewhat common to wrap non-kernel-related lambdas in macros that aren't present on the device (such as for logging), this uniquely generates an ID for all lambdas involved in the naming of a kernel. It uses the lambda-mangling number to do this, except replaces this with its own number (starting at 10000 for readabililty reasons) for lambdas used to name a kernel. Additionally, this implements itself as constexpr with a slight catch: if a name would be invalidated by the use of this lambda in a later kernel invocation, it is diagnosed as an error (see the Sema tests). Differential Revision: https://reviews.llvm.org/D103112	2021-05-27 07:12:20 -07:00
Kristóf Umann	479ea2a8ed	[analyzer] Check the checker name, rather than the ProgramPointTag when silencing a checker The program point created by the checker, even if it is an error node, might not be the same as the name under which the report is emitted. Make sure we're checking the name of the checker, because thats what we're silencing after all. Differential Revision: https://reviews.llvm.org/D102683	2021-05-19 12:40:09 +02:00
Abbas Sabra	ebcf030efc	[analyzer] Engine: fix crash with SEH __leave keyword MSVC has a `try-except` statement. This statement could containt a `__leave` keyword, which is similar to `goto` to the end of the try block. The semantic of this keyword is not implemented. We should at least parse such code without crashing. https://docs.microsoft.com/en-us/cpp/cpp/try-except-statement?view=msvc-160 Patch By: AbbasSabra! Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D102280	2021-05-17 20:10:26 +02:00
Valeriy Savchenko	45212dec01	[analyzer][solver] Prevent use of a null state rdar://77686137 Differential Revision: https://reviews.llvm.org/D102240	2021-05-13 20:16:29 +03:00
Vince Bridgers	a27af1d816	[analyzer] Fix assertion in SVals.h Fix assertion in SVals.h apparently caused by https://reviews.llvm.org/D89055. clang:clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:596: clang::ento::loc::MemRegionVal::MemRegionVal(const clang::ento::MemRegion *): Assertion `r' failed. Backtrace: ... clang/include/clang/StaticAnalyzer/Core/PathSensitive/SVals.h:597:3 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:773:18 clang::QualType, clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:612:12 clang::QualType) clang/lib/StaticAnalyzer/Core/SValBuilder.cpp:587:12 namespace)::RegionBindingsRef const&, clang::ento::Loc, clang::QualType) clang/lib/StaticAnalyzer/Core/RegionStore.cpp:1510:24 ... Reviewed By: ASDenysPetrov Differential Revision: https://reviews.llvm.org/D101635	2021-04-30 11:00:43 -05:00
Denys Petrov	b30521c28a	[analyzer] Wrong type cast occurs during pointer dereferencing after type punning Summary: During pointer dereferencing CastRetrievedVal uses wrong type from the Store after type punning. Namely, the pointer casts to another type and then assigns with a value of one more another type. It produces NonLoc value when Loc is expected. Differential Revision: https://reviews.llvm.org/D89055 Fixes: https://bugs.llvm.org/show_bug.cgi?id=37503 https://bugs.llvm.org/show_bug.cgi?id=49007	2021-04-29 01:03:38 +03:00
Valeriy Savchenko	ab5823867c	[analyzer] Find better description for tracked symbolic values When searching for stores and creating corresponding notes, the analyzer is more specific about the target region of the store as opposed to the stored value. While this description was tweaked for constant and undefined values, it lacked in the most general case of symbolic values. This patch tries to find a memory region, where this value is stored, to use it as a better alias for the value. rdar://76645710 Differential Revision: https://reviews.llvm.org/D101041	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	e273918038	[analyzer] Track leaking object through stores Since we can report memory leaks on one variable, while the originally allocated object was stored into another one, we should explain how did it get there. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100852	2021-04-28 18:37:38 +03:00
Valeriy Savchenko	61ae2db2d7	[analyzer] Adjust the reported variable name in retain count checker When reporting leaks, we try to attach the leaking object to some variable, so it's easier to understand. Before the patch, we always tried to use the first variable that stored the object in question. This can get very confusing for the user, if that variable doesn't contain that object at the moment of the actual leak. In many cases, the warning is dismissed as false positive and it is effectively a false positive when we fail to properly explain the warning to the user. This patch addresses the bigest issue in cases like this. Now we check if the variable still contains the leaking symbolic object. If not, we look for the last variable to actually hold it and use that variable instead. rdar://76645710 Differential Revision: https://reviews.llvm.org/D100839	2021-04-28 18:37:37 +03:00
Valeriy Savchenko	1dad8c5036	[analyzer][NFC] Remove duplicated work from retain count leak report Allocation site is the key location for the leak checker. It is a uniqueing location for the report and a source of information for the warning's message. Before this patch, we calculated and used it twice in bug report and in bug report visitor. Such duplication is not only harmful performance-wise (not much, but still), but also design-wise. Because changing something about the end piece of the report should've been repeated for description as well. Differential Revision: https://reviews.llvm.org/D100626	2021-04-28 18:37:37 +03:00
Gabor Marton	4b99f9c7db	[analyzer][StdLibraryFunctionsChecker] Track dependent arguments When we report an argument constraint violation, we should track those other arguments that participate in the evaluation of the violation. By default, we depend only on the argument that is constrained, however, there are some special cases like the buffer size constraint that might be encoded in another argument(s). Differential Revision: https://reviews.llvm.org/D101358	2021-04-27 15:35:58 +02:00
Gabor Marton	a7cb951fa4	[Analyzer][StdLibraryFunctionsChecker] Describe arg constraints In this patch, I provide a detailed explanation for each argument constraint. This explanation is added in an extra 'note' tag, which is displayed alongside the warning. Since these new notes describe clearly the constraint, there is no need to provide the number of the argument (e.g. 'Arg3') within the warning. However, I decided to keep the name of the constraint in the warning (but this could be a subject of discussion) in order to be able to identify the different kind of constraint violations easily in a bug database (e.g. CodeChecker). Differential Revision: https://reviews.llvm.org/D101060	2021-04-23 17:27:54 +02:00
Denys Petrov	01ddfa95bd	[analyzer] [NFC] Eliminate dispatchCast, evalCastFromNonLoc and evalCastFromLoc functions from SValBuilder Summary: Remove dispatchCast, evalCastFromNonLoc and evalCastFromLoc functions since their functionality has been moved to common evalCast function. Use evalCast instead. Post-clean up patch for https://reviews.llvm.org/D96090 patch. The patch shall not change any behavior. Differential Revision: https://reviews.llvm.org/D97277	2021-04-13 18:56:04 +03:00
Denys Petrov	7736b08c28	[analyzer] Replace StoreManager::CastRetrievedVal with SValBuilder::evalCast Summary: Move logic from CastRetrievedVal to evalCast and replace CastRetrievedVal with evalCast. Also move guts from SimpleSValBuilder::dispatchCast inside evalCast. evalCast intends to substitute dispatchCast, evalCastFromNonLoc and evalCastFromLoc in the future. OriginalTy provides additional information for casting, which is useful for some cases and useless for others. If `OriginalTy.isNull()` is true, then cast performs based on CastTy only. Now evalCast operates in two ways. It retains all previous behavior and take over dispatchCast behavior. dispatchCast, evalCastFromNonLoc and evalCastFromLoc is considered as buggy since it doesn't take into account OriginalTy of the SVal and should be improved. From this patch use evalCast instead of dispatchCast, evalCastFromNonLoc and evalCastFromLoc functions. dispatchCast redirects to evalCast. This patch shall not change any behavior. Differential Revision: https://reviews.llvm.org/D96090	2021-04-13 18:10:06 +03:00
Saurabh Jha	71ab6c98a0	[Matrix] Implement C-style explicit type conversions for matrix types. This implements C-style type conversions for matrix types, as specified in clang/docs/MatrixTypes.rst. Fixes PR47141. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D99037	2021-04-10 11:48:41 +01:00
cchen	1a43fd2769	[OpenMP51] Initial support for masked directive and filter clause Adds basic parsing/sema/serialization support for the #pragma omp masked directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D99995	2021-04-09 14:00:36 -05:00
Valeriy Savchenko	663ac91ed1	[analyzer] Fix false positives in inner pointer checker (PR49628) This patch supports std::data and std::addressof functions. rdar://73463300 Differential Revision: https://reviews.llvm.org/D99260	2021-04-08 20:30:12 +03:00
Valeriy Savchenko	4b958dd6bc	[analyzer] Fix crash on spaceship operator (PR47511) rdar://68954187 Differential Revision: https://reviews.llvm.org/D99181	2021-04-08 20:28:05 +03:00
Valeriy Savchenko	9f0d8bac14	[analyzer] Fix dead store checker false positive It is common to zero-initialize not only scalar variables, but also structs. This is also defensive programming and we shouldn't complain about that. rdar://34122265 Differential Revision: https://reviews.llvm.org/D99262	2021-04-08 16:12:42 +03:00
Abhina Sreeskantharajan	82b3e28e83	[SystemZ][z/OS][Windows] Add new OF_TextWithCRLF flag and use this flag instead of OF_Text Problem: On SystemZ we need to open text files in text mode. On Windows, files opened in text mode adds a CRLF '\r\n' which may not be desirable. Solution: This patch adds two new flags - OF_CRLF which indicates that CRLF translation is used. - OF_TextWithCRLF = OF_Text \| OF_CRLF indicates that the file is text and uses CRLF translation. Developers should now use either the OF_Text or OF_TextWithCRLF for text files and OF_None for binary files. If the developer doesn't want carriage returns on Windows, they should use OF_Text, if they do want carriage returns on Windows, they should use OF_TextWithCRLF. So this is the behaviour per platform with my patch: z/OS: OF_None: open in binary mode OF_Text : open in text mode OF_TextWithCRLF: open in text mode Windows: OF_None: open file with no carriage return OF_Text: open file with no carriage return OF_TextWithCRLF: open file with carriage return The Major change is in llvm/lib/Support/Windows/Path.inc to only set text mode if the OF_CRLF is set. ``` if (Flags & OF_CRLF) CrtOpenFlags \|= _O_TEXT; ``` These following files are the ones that still use OF_Text which I left unchanged. I modified all these except raw_ostream.cpp in recent patches so I know these were previously in Binary mode on Windows. ./llvm/lib/Support/raw_ostream.cpp ./llvm/lib/TableGen/Main.cpp ./llvm/tools/dsymutil/DwarfLinkerForBinary.cpp ./llvm/unittests/Support/Path.cpp ./clang/lib/StaticAnalyzer/Core/HTMLDiagnostics.cpp ./clang/lib/Frontend/CompilerInstance.cpp ./clang/lib/Driver/Driver.cpp ./clang/lib/Driver/ToolChains/Clang.cpp Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99426	2021-04-06 07:23:31 -04:00
Balázs Kéri	bee4813789	[clang][Checkers] Fix PthreadLockChecker state cleanup at dead symbol. It is possible that an entry in 'DestroyRetVal' lives longer than an entry in 'LockMap' if not removed at checkDeadSymbols. The added test case demonstrates this. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D98504	2021-04-06 11:15:29 +02:00
Charusso	9b3df78b4c	[analyzer] DynamicSize: Rename 'size' to 'extent'	2021-04-05 19:20:43 +02:00
Charusso	89d210fe1a	[analyzer] DynamicSize: Debug facility This patch adds two debug functions to ExprInspectionChecker to dump out the dynamic extent and element count of symbolic values: dumpExtent(), dumpElementCount().	2021-04-05 19:17:52 +02:00
Charusso	df64f471d1	[analyzer] DynamicSize: Store the dynamic size This patch introduces a way to store the size. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D69726	2021-04-05 19:04:53 +02:00
Balázs Kéri	df4fa53fdd	[clang][Checkers] Extend PthreadLockChecker state dump (NFC). Add printing of map 'DestroyRetVal'. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D98502	2021-04-01 11:59:00 +02:00
Balázs Kéri	ffcb4b43b7	Revert "[clang][Checkers] Extend PthreadLockChecker state dump (NFC)." This reverts commit `49c0ab6d76`. Test failures showed up because non-deterministic output.	2021-03-31 15:28:53 +02:00
Balázs Kéri	49c0ab6d76	[clang][Checkers] Extend PthreadLockChecker state dump (NFC). Add printing of map 'DestroyRetVal'. Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D98502	2021-03-31 11:19:42 +02:00
Mike Rice	b7899ba0e8	[OPENMP51]Initial support for the dispatch directive. Added basic parsing/sema/serialization support for dispatch directive. Differential Revision: https://reviews.llvm.org/D99537	2021-03-30 14:12:53 -07:00
Valeriy Savchenko	90377308de	[analyzer] Support allocClassWithName in OSObjectCStyleCast checker `allocClassWithName` allocates an object with the given type. The type is actually provided as a string argument (type's name). This creates a possibility for not particularly useful warnings from the analyzer. In order to combat with those, this patch checks for casts of the `allocClassWithName` results to types mentioned directly as its argument. All other uses of this method should be reasoned about as before. rdar://72165694 Differential Revision: https://reviews.llvm.org/D99500	2021-03-30 15:58:06 +03:00
Gabor Marton	efa7df1682	[Analyzer] Track RValue expressions It makes sense to track rvalue expressions in the case of special concrete integer values. The most notable special value is zero (later we may find other values). By tracking the origin of 0, we can provide a better explanation for users e.g. in case of division by 0 warnings. When the divisor is a product of a multiplication then now we can show which operand (or both) was (were) zero and why. Differential Revision: https://reviews.llvm.org/D99344	2021-03-30 14:48:38 +02:00
Gabor Marton	015c39882e	[Analyzer] Infer 0 value when the divisible is 0 (bug fix) Currently, we infer 0 if the divisible of the modulo op is 0: int a = x < 0; // a can be 0 int b = a % y; // b is either 1 % sym or 0 However, we don't when the op is / : int a = x < 0; // a can be 0 int b = a / y; // b is either 1 / sym or 0 / sym This commit fixes the discrepancy. Differential Revision: https://reviews.llvm.org/D99343	2021-03-25 18:25:06 +01:00
Gabor Marton	f8a850ccf4	[Analyzer][NFC] Fix typos in comments	2021-03-24 11:46:10 +01:00
Valeriy Savchenko	02b51e5316	[analyzer][solver] Redesign constraint ranges data structure ImmutableSet doesn't seem like the perfect fit for the RangeSet data structure. It is good for saving memory in a persistent setting, but not for the case when the population of the container is tiny. This commit replaces RangeSet implementation and redesigns the most common operations to be more efficient. Differential Revision: https://reviews.llvm.org/D86465	2021-03-22 13:52:35 +03:00
Valeriy Savchenko	3085bda2b3	[analyzer][solver] Fix infeasible constraints (PR49642) Additionally, this patch puts an assertion checking for feasible constraints in every place where constraints are assigned to states. Differential Revision: https://reviews.llvm.org/D98948	2021-03-22 11:02:02 +03:00
Abhina Sreeskantharajan	4f750f6ebc	[SystemZ][z/OS] Distinguish between text and binary files on z/OS This patch consists of the initial changes to help distinguish between text and binary content correctly on z/OS. I would like to get feedback from Windows users on setting OF_None for all ToolOutputFiles. This seems to have been done as an optimization to prevent CRLF translation on Windows in the past. Reviewed By: zibi Differential Revision: https://reviews.llvm.org/D97785	2021-03-19 08:09:57 -04:00
Artem Dergachev	c75b2261a0	[analyzer] Introduce common bug category "Unused code". This category is generic enough to hold a variety of checkers. Currently it contains the Dead Stores checker and an alpha unreachable code checker. Differential Revision: https://reviews.llvm.org/D98741	2021-03-17 20:58:27 -07:00
Mike Rice	410f09af09	[OPENMP51]Initial support for the interop directive. Added basic parsing/sema/serialization support for interop directive. Support for the 'init' clause. Differential Revision: https://reviews.llvm.org/D98558	2021-03-17 09:42:07 -07:00
Vassil Vassilev	0cb7e7ca0c	Make iteration over the DeclContext::lookup_result safe. The idiom: ``` DeclContext::lookup_result R = DeclContext::lookup(Name); for (auto D : R) {...} ``` is not safe when in the loop body we trigger deserialization from an AST file. The deserialization can insert new declarations in the StoredDeclsList whose underlying type is a vector. When the vector decides to reallocate its storage the pointer we hold becomes invalid. This patch replaces a SmallVector with an singly-linked list. The current approach stores a SmallVector<NamedDecl, 4> which is around 8 pointers. The linked list is 3, 5, or 7. We do better in terms of memory usage for small cases (and worse in terms of locality -- the linked list entries won't be near each other, but will be near their corresponding declarations, and we were going to fetch those memory pages anyway). For larger cases: the vector uses a doubling strategy for reallocation, so will generally be between half-full and full. Let's say it's 75% full on average, so there's N * 4/3 + 4 pointers' worth of space allocated currently and will be 2N pointers with the linked list. So we break even when there are N=6 entries and slightly lose in terms of memory usage after that. We suspect that's still a win on average. Thanks to @rsmith! Differential revision: https://reviews.llvm.org/D91524	2021-03-17 08:59:04 +00:00
Aaron Puchert	1cb15b10ea	Correct Doxygen syntax for inline code There is no syntax like {@code ...} in Doxygen, @code is a block command that ends with @endcode, and generally these are not enclosed in braces. The correct syntax for inline code snippets is @c <code>. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D98665	2021-03-16 15:17:45 +01:00
Valeriy Savchenko	6dc1523508	[analyzer][solver] Prevent infeasible states (PR49490) This patch fixes the situation when our knowledge of disequalities can help us figuring out that some assumption is infeasible, but the solver still produces a state with inconsistent constraints. Additionally, this patch adds a couple of assertions to catch this type of problems easier. Differential Revision: https://reviews.llvm.org/D98341	2021-03-12 15:56:48 +03:00
Adam Balogh	bcc662484a	[analyzer] Crash fix for alpha.cplusplus.IteratorRange If the non-iterator side of an iterator operation `+`, `+=`, `-` or `-=` is `UndefinedVal` an assertions happens. This small fix prevents this. Patch by Adam Balogh. Reviewed By: NoQ Differential Revision: https://reviews.llvm.org/D85424	2021-03-10 12:42:24 +01:00
Valeriy Savchenko	c7635040ce	[analyzer] Fix StdLibraryFunctionsChecker performance issue `initFunctionSummaries` lazily initializes a data structure with function summaries for standard library functions. It is called for every pre-, post-, and eval-call events, i.e. 3 times for each call on the path. If the initialization doesn't find any standard library functions in the translation unit, it will get re-tried (with the same effect) many times even for small translation units. For projects not using standard libraries, the speed-up can reach 50% after this patch. Differential Revision: https://reviews.llvm.org/D98244	2021-03-10 10:44:04 +03:00
Michael Kruse	bc172e532a	[clang][StaticAnalyzer] Compilation fix. An enum was unhandled after landing of D94973. Add the new OMPCanonicalLoopClass to the list of unhandled cases.	2021-03-04 23:23:58 -06:00
Balazs Benics	38b185832e	[analyzer][CTU] API for CTU macro expansions Removes `CrossTranslationUnitContext::getImportedFromSourceLocation` Removes the corresponding unit-test segment. Introduces the `CrossTranslationUnitContext::getMacroExpansionContextForSourceLocation` which will return the macro expansion context for an imported TU. Also adds a few implementation FIXME notes where applicable, since this feature is not implemented yet. This fact is also noted as Doxygen comments. Uplifts a few CTU LIT test to match the current incomplete behavior. It is a regression to some extent since now we don't expand any macros in imported TUs. At least we don't crash anymore. Note that the introduced function is already covered by LIT tests. Eg.: Analysis/plist-macros-with-expansion-ctu.c Reviewed By: balazske, Szelethus Differential Revision: https://reviews.llvm.org/D94673	2021-02-22 11:12:22 +01:00
Balazs Benics	170c67d5b8	[analyzer] Use the MacroExpansionContext for macro expansions in plists Removes the obsolete ad-hoc macro expansions during bugreport constructions. It will skip the macro expansion if the expansion happened in an imported TU. Also removes the expected plist file, while expanding matching context for the tests. Adds a previously crashing `plist-macros-with-expansion.c` testfile. Temporarily marks `plist-macros-with-expansion-ctu.c ` to `XFAIL`. Reviewed By: xazax.hun, Szelethus Differential Revision: https://reviews.llvm.org/D93224	2021-02-22 11:12:18 +01:00
Balazs Benics	7c58fb6ba0	[analyzer] Create MacroExpansionContext member in AnalysisConsumer Adds a `MacroExpansionContext` member to the `AnalysisConsumer` class. Tracks macro expansions only if the `ShouldDisplayMacroExpansions` is set. Passes a reference down the pipeline letting AnalysisConsumers query macro expansions during bugreport construction. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D93223	2021-02-22 11:12:14 +01:00
Michael Kruse	6c05005238	[OpenMP] Implement '#pragma omp tile', by Michael Kruse (@Meinersbur). The tile directive is in OpenMP's Technical Report 8 and foreseeably will be part of the upcoming OpenMP 5.1 standard. This implementation is based on an AST transformation providing a de-sugared loop nest. This makes it simple to forward the de-sugared transformation to loop associated directives taking the tiled loops. In contrast to other loop associated directives, the OMPTileDirective does not use CapturedStmts. Letting loop associated directives consume loops from different capture context would be difficult. A significant amount of code generation logic is taking place in the Sema class. Eventually, I would prefer if these would move into the CodeGen component such that we could make use of the OpenMPIRBuilder, together with flang. Only expressions converting between the language's iteration variable and the logical iteration space need to take place in the semantic analyzer: Getting the of iterations (e.g. the overload resolution of `std::distance`) and converting the logical iteration number to the iteration variable (e.g. overload resolution of `iteration + .omp.iv`). In clang, only CXXForRangeStmt is also represented by its de-sugared components. However, OpenMP loop are not defined as syntatic sugar. Starting with an AST-based approach allows us to gradually move generated AST statements into CodeGen, instead all at once. I would also like to refactor `checkOpenMPLoop` into its functionalities in a follow-up. In this patch it is used twice. Once for checking proper nesting and emitting diagnostics, and additionally for deriving the logical iteration space per-loop (instead of for the loop nest). Differential Revision: https://reviews.llvm.org/D76342	2021-02-16 09:45:07 -08:00
Kazu Hirata	1a323c8a96	[analyzer] Fix a warning This patch fixes a warning from -Wcovered-switch-default. The switch statement in question handles all the enum values.	2021-02-16 09:12:07 -08:00
Denys Petrov	13f4448ae7	[analyzer] Rework SValBuilder::evalCast function into maintainable and clear way Summary: Refactor SValBuilder::evalCast function. Make the function clear and get rid of redundant and repetitive code. Unite SValBuilder::evalCast, SimpleSValBuilder::dispatchCast, SimpleSValBuilder::evalCastFromNonLoc and SimpleSValBuilder::evalCastFromLoc functions into single SValBuilder::evalCast. This patch shall not change any previous behavior. Differential Revision: https://reviews.llvm.org/D90157	2021-02-16 14:30:51 +02:00
Deep Majumder	21daada950	[analyzer] Fix static_cast on pointer-to-member handling This commit fixes bug #48739. The bug was caused by the way static_casts on pointer-to-member caused the CXXBaseSpecifier list of a MemberToPointer to grow instead of shrink. The list is now grown by implicit casts and corresponding entries are removed by static_casts. No-op static_casts cause no effect. Reviewed By: vsavchenko Differential Revision: https://reviews.llvm.org/D95877	2021-02-15 11:44:37 +03:00
Daniel Hwang	2407eb08a5	[analyzer] Update static analyzer to be support sarif-html Updates static analyzer to be able to generate both sarif and html output in a single run similar to plist-html. Differential Revision: https://reviews.llvm.org/D96389	2021-02-10 18:34:53 -08:00
Artem Dergachev	3e206a5922	[analyzer] NFC: Introduce reusable bug category for "C++ move semantics". Currently only used by MoveChecker but ideally all checkers should have reusable categories.	2021-01-27 03:39:18 -08:00
Simon Pilgrim	879c12d95a	Fix null dereference static analysis warning. NFCI. Replace cast_or_null<> with cast<> as we immediately dereference the pointer afterward so we're not expecting a null pointer.	2021-01-26 16:19:18 +00:00
Alexander Belyaev	9c4b2225b2	Revert "Revert "Revert "Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis.""""" This reverts commit `6b0ee02747`. Circular dependency again.	2021-01-08 14:17:18 +01:00
Artem Dergachev	6b0ee02747	Revert "Revert "Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis."""" This reverts commit `b12f26733a`. Fix dead include that looked like another missed circular dependency.	2021-01-07 20:22:22 -08:00
David Blaikie	b12f26733a	Revert "Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis.""" This reverts commit `d2ddc694ff`. This still contains a circular dependency between Analysis and CrossTU: $ grep -r include.Analysis clang/include/clang/CrossTU clang/include/clang/CrossTU/CrossTranslationUnit.h: #include "clang/Analysis/CrossTUAnalysisHelper.h" $ grep -r include.CrossTU clang/lib/Analysis clang/lib/Analysis/PlistHTMLPathDiagnosticConsumer.cpp: #include "clang/CrossTU/CrossTranslationUnit.h" clang/lib/Analysis/PlistPathDiagnosticConsumer.cpp: #include "clang/Analysis/CrossTUAnalysisHelper.h"	2021-01-07 18:18:23 -08:00
Artem Dergachev	d2ddc694ff	Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis."" This reverts commit `5663bf201f`. The cyclic dependency problem is addressed now. This is the ~fifth attempt to land this change.	2021-01-07 00:28:22 -08:00
Daniel Hwang	8deaec122e	[analyzer] Update Fuchsia checker to catch releasing unowned handles. Certain Fuchsia functions may return handles that are not owned by the current closure. This adds a check in order to determine when these handles are released. Differential Revision: https://reviews.llvm.org/D93868	2021-01-06 16:23:49 -08:00
Barry Revzin	92310454bf	Make LLVM build in C++20 mode Part of the <=> changes in C++20 make certain patterns of writing equality operators ambiguous with themselves (sorry!). This patch goes through and adjusts all the comparison operators such that they should work in both C++17 and C++20 modes. It also makes two other small C++20-specific changes (adding a constructor to a type that cases to be an aggregate, and adding casts from u8 literals which no longer have type const char*). There were four categories of errors that this review fixes. Here are canonical examples of them, ordered from most to least common: // 1) Missing const namespace missing_const { struct A { #ifndef FIXED bool operator==(A const&); #else bool operator==(A const&) const; #endif }; bool a = A{} == A{}; // error } // 2) Type mismatch on CRTP namespace crtp_mismatch { template <typename Derived> struct Base { #ifndef FIXED bool operator==(Derived const&) const; #else // in one case changed to taking Base const& friend bool operator==(Derived const&, Derived const&); #endif }; struct D : Base<D> { }; bool b = D{} == D{}; // error } // 3) iterator/const_iterator with only mixed comparison namespace iter_const_iter { template <bool Const> struct iterator { using const_iterator = iterator<true>; iterator(); template <bool B, std::enable_if_t<(Const && !B), int> = 0> iterator(iterator<B> const&); #ifndef FIXED bool operator==(const_iterator const&) const; #else friend bool operator==(iterator const&, iterator const&); #endif }; bool c = iterator<false>{} == iterator<false>{} // error \|\| iterator<false>{} == iterator<true>{} \|\| iterator<true>{} == iterator<false>{} \|\| iterator<true>{} == iterator<true>{}; } // 4) Same-type comparison but only have mixed-type operator namespace ambiguous_choice { enum Color { Red }; struct C { C(); C(Color); operator Color() const; bool operator==(Color) const; friend bool operator==(C, C); }; bool c = C{} == C{}; // error bool d = C{} == Red; } Differential revision: https://reviews.llvm.org/D78938	2020-12-17 10:44:10 +00:00
Bardia Mahjour	6eff12788e	[DDG] Data Dependence Graph - DOT printer - recommit This is being recommitted to try and address the MSVC complaint. This patch implements a DDG printer pass that generates a graph in the DOT description language, providing a more visually appealing representation of the DDG. Similar to the CFG DOT printer, this functionality is provided under an option called -dot-ddg and can be generated in a less verbose mode under -dot-ddg-only option. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D90159	2020-12-16 12:37:36 -05:00
Bardia Mahjour	a29ecca781	Revert "[DDG] Data Dependence Graph - DOT printer" This reverts commit `fd4a10732c`, to investigate the failure on windows: http://lab.llvm.org:8011/#/builders/127/builds/3274	2020-12-14 16:54:20 -05:00
Bardia Mahjour	fd4a10732c	[DDG] Data Dependence Graph - DOT printer This patch implements a DDG printer pass that generates a graph in the DOT description language, providing a more visually appealing representation of the DDG. Similar to the CFG DOT printer, this functionality is provided under an option called -dot-ddg and can be generated in a less verbose mode under -dot-ddg-only option. Differential Revision: https://reviews.llvm.org/D90159	2020-12-14 16:41:14 -05:00
Haojian Wu	5663bf201f	Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis." The patch introduced a cycle dependency: clangAnalysis -> clangFrontend -> clangSema -> clangAnalysis This reverts commit `00ffea77ad`. This reverts commit `ea6641085d`.	2020-12-11 10:16:13 +01:00
Artem Dergachev	8c5ca7c6e6	[analyzer] OSObjectCStyleCast: Improve warning message. Suggest OSRequiredCast as a closer alternative to C-style cast. Explain how to decide.	2020-12-10 19:46:33 -08:00
Artem Dergachev	ea6641085d	Revert "Revert "Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis."""" This reverts commit `6a89cb8136`.	2020-12-10 11:02:54 -08:00
Gabor Marton	febe75032f	[analyzer][StdLibraryFunctionsChecker] Add more return value contraints This time, we add contraints to functions that either return with [0, -1] or with a file descriptor. Differential Revision: https://reviews.llvm.org/D92771	2020-12-08 17:04:29 +01:00
Gabor Marton	d14c631673	[analyzer][StdLibraryFunctionsChecker] Make close and mmap to accept -1 as fd close: It is quite often that users chose to call close even if the fd is negative. Theoretically, it would be nicer to close only valid fds, but in practice the implementations of close just returns with EBADF in case of a non-valid fd param. So, we can eliminate many false positives if we let close to take -1 as an fd. Other negative values are very unlikely, because open and other fd factories return with -1 in case of failure. mmap: In the case of MAP_ANONYMOUS flag (which is supported e.g. in Linux) the mapping is not backed by any file; its contents are initialized to zero. The fd argument is ignored; however, some implementations require fd to be -1 if MAP_ANONYMOUS (or MAP_ANON) is specified, and portable applications should ensure this. Consequently, we must allow -1 as the 4th arg. Differential Revision: https://reviews.llvm.org/D92764	2020-12-08 16:58:30 +01:00
Yu Shan	3ce78f54ed	[analyzer] Ignore annotations if func is inlined. When we annotating a function header so that it could be used by other TU, we also need to make sure the function is parsed correctly within the same TU. So if we can find the function's implementation, ignore the annotations, otherwise, false positive would occur. Move the escape by value case to post call and do not escape the handle if the function is inlined and we have analyzed the handle. Differential Revision: https://reviews.llvm.org/D91902	2020-12-07 11:28:11 -08:00
Gabor Marton	b40b3196b3	[analyzer][StdLibraryFunctionsChecker] Add return value constraint to functions with BufferSize Differential Revision: https://reviews.llvm.org/D92474	2020-12-02 17:54:48 +01:00
Balazs Benics	ee073c7985	[analyzer][StdLibraryFunctionsChecker] Fix typos in summaries of mmap and mmap64 The fd parameter of ``` void mmap(void addr, size_t length, int prot, int flags, int fd, off_t offset) ``` should be constrained to the range [0, IntMax] as that is of type int. Constraining to the range [0, Off_tMax] would result in a crash as that is of a signed type with the value of 0xff..f (-1). The crash would happen when we try to apply the arg constraints. At line 583: assert(Min <= Max), as 0 <= -1 is not satisfied The mmap64 is fixed for the same reason. Reviewed By: martong, vsavchenko Differential Revision: https://reviews.llvm.org/D92307	2020-11-30 18:06:28 +01:00
Richard Smith	3fb0879867	Refactor and simplify class scope name lookup. This is partly in preparation for an upcoming change that can change the order in which DeclContext lookup results are presented. In passing, fix some obvious errors where name lookup's notion of a "static member function" missed static member function templates, and where its notion of "same set of declarations" was confused by the same declarations appearing in a different order.	2020-11-25 16:25:33 -08:00
Haowei Wu	914f6c4ff8	[StaticAnalyzer] Support struct annotations in FuchsiaHandleChecker Support adding handle annotations to sturucture that contains handles. All the handles referenced by the structure (direct value or ptr) would be treated as containing the release/use/acquire annotations directly. Patch by Yu Shan Differential Revision: https://reviews.llvm.org/D91223	2020-11-21 19:59:51 -08:00
Artem Dergachev	6a89cb8136	Revert "Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis.""" This reverts commit `41bcc05e2a`.	2020-11-17 18:59:21 -08:00
Artem Dergachev	41bcc05e2a	Revert "Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis."" This reverts commit `77bb3ebebb`.	2020-11-17 18:45:09 -08:00
Artem Dergachev	f8f6d6455f	Revert "Revert "[analyzer] NFC: Move IssueHash to libAnalysis."" This reverts commit `662ed9e67a`.	2020-11-17 16:01:49 -08:00
Nathan James	d44edfc109	[clang][NFC] Use SmallString instead of SmallVector<char Simplifies code in some places and is more explicit about what is being used. No additional includes were added here so no impact on compile time.	2020-11-17 13:02:58 +00:00
Artem Dergachev	499bce3aba	Revert "Revert "[analyzer] NFC: Separate PathDiagnosticConsumer options from AnalyzerOptions."" This reverts commit `10f1ca99b4`. (cherry picked from commit c599fc738a70e482976c6cc0ea31bef561641279)	2020-11-16 14:37:50 -08:00
Kirstóf Umann	22e7182002	[analyzer][ReturnPtrRangeChecker] Fix a false positive on end() iterator ReturnPtrRange checker emits a report if a function returns a pointer which points out of the buffer. However, end() iterator of containers is always such a pointer, so this always results a false positive report. This false positive case is now eliminated. This patch resolves these tickets: https://bugs.llvm.org/show_bug.cgi?id=20929 https://bugs.llvm.org/show_bug.cgi?id=25226 https://bugs.llvm.org/show_bug.cgi?id=27701 Patch by Tibor Brunner! Differential Revision: https://reviews.llvm.org/D83678	2020-11-02 16:41:17 +01:00
Simon Pilgrim	2bc2e2e9fe	[MallocChecker] Remove duplicate QCoreApplication::postEvent check. NFCI. This appears to have been in the original patch in D14170. Reported as "Snippet 11" in https://www.viva64.com/en/b/0771/	2020-10-27 13:14:54 +00:00
Mikhail Maltsev	443ab4d2e0	[clang][Basic] Integrate SourceLocation with FoldingSet, NFCI This patch removes the necessity to access the SourceLocation internal representation in several places that use FoldingSet objects. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D69844	2020-10-27 10:43:39 +00:00
Denys Petrov	32efb81ea6	[analyzer] [NFC] Simplify SVal::getAsLocSymbol function using existing functions Summary: Method of obtaining MemRegion from LocAsInteger/MemRegionVal already exists in SVal::getAsRegion function. Replace repetitive conditions in SVal::getAsLocSymbol with SVal::getAsRegion function. Differential Revision: https://reviews.llvm.org/D89982	2020-10-26 17:00:29 +02:00
Duncan P. N. Exon Smith	af4fb416bd	clang/StaticAnalyzer: Stop using SourceManager::getBuffer Update clang/lib/StaticAnalyzer to stop relying on a `MemoryBuffer*`, using the `MemoryBufferRef` from `getBufferOrNone` or the `Optional<MemoryBufferRef>` from `getBufferOrFake`, depending on whether there's logic for checking validity of the buffer. The change to clang/lib/StaticAnalyzer/Core/IssueHash.cpp is potentially a functionality change, since the logic was wrong (it checked for `nullptr`, which was never returned by the old API), but if that was reachable the new behaviour should be better. Differential Revision: https://reviews.llvm.org/D89414	2020-10-15 00:34:24 -04:00
Artem Dergachev	10f1ca99b4	Revert "[analyzer] NFC: Separate PathDiagnosticConsumer options from AnalyzerOptions." This reverts commit `fd4b3f123d`.	2020-10-13 12:07:36 -07:00
Artem Dergachev	662ed9e67a	Revert "[analyzer] NFC: Move IssueHash to libAnalysis." This reverts commit `b76dc111dd`.	2020-10-13 12:07:28 -07:00
Artem Dergachev	77bb3ebebb	Revert "[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis." This reverts commit `44b7cf2983`.	2020-10-13 12:03:04 -07:00
Artem Dergachev	44b7cf2983	[analyzer] NFC: Move path diagnostic consumer implementations to libAnalysis. With this change, we're more or less ready to allow users outside of the Static Analyzer to take advantage of path diagnostic consumers for emitting their warnings in different formats. Differential Revision: https://reviews.llvm.org/D67422	2020-10-13 10:53:10 -07:00
Artem Dergachev	b76dc111dd	[analyzer] NFC: Move IssueHash to libAnalysis. IssueHash is an attempt to introduce stable warning identifiers that won't change when code around them gets moved around. Path diagnostic consumers print issue hashes for the emitted diagnostics. This move will allow us to ultimately move path diagnostic consumers to libAnalysis. Differential Revision: https://reviews.llvm.org/D67421	2020-10-13 10:53:10 -07:00
Artem Dergachev	fd4b3f123d	[analyzer] NFC: Separate PathDiagnosticConsumer options from AnalyzerOptions. The AnalyzerOptions object contains too much information that's entirely specific to the Analyzer. It is also being referenced by path diagnostic consumers to tweak their behavior. In order for path diagnostic consumers to function separately from the analyzer, make a smaller options object that only contains relevant options. Differential Revision: https://reviews.llvm.org/D67420	2020-10-13 10:53:10 -07:00
Bevin Hansson	101309fe04	[AST] Change return type of getTypeInfoInChars to a proper struct instead of std::pair. Followup to D85191. This changes getTypeInfoInChars to return a TypeInfoChars struct instead of a std::pair of CharUnits. This lets the interface match getTypeInfo more closely. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D86447	2020-10-13 13:26:56 +02:00
Bevin Hansson	9fa7f48459	[Fixed Point] Add fixed-point to floating point cast types and consteval. Reviewed By: leonardchan Differential Revision: https://reviews.llvm.org/D86631	2020-10-13 13:26:56 +02:00
Nithin Vadukkumchery Rajendrakumar	0b4fe8086f	[Analyzer] Fix for dereferece of smart pointer after branching on unknown inner pointer Summary: Enabling warning after dereferece of smart pointer after branching on unknown inner pointer. Reviewers: NoQ, Szelethus, vsavchenko, xazax.hun Reviewed By: NoQ Subscribers: martong, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D87043	2020-10-09 13:42:25 +02:00
Benjamin Kramer	6a1bca8798	[Analyzer] Fix unused variable warning in Release builds clang/lib/StaticAnalyzer/Core/ExprEngineCXX.cpp:377:19: warning: unused variable 'Init'	2020-09-25 14:09:43 +02:00
Adam Balogh	facad21b29	[Analyzer] Fix for `ExprEngine::computeObjectUnderConstruction()` for base and delegating consturctor initializers For /C++/ constructor initializers `ExprEngine:computeUnderConstruction()` asserts that they are all member initializers. This is not neccessarily true when this function is used to get the return value for the construction context thus attempts to fetch return values of base and delegating constructor initializers result in assertions. This small patch fixes this issue. Differential Revision: https://reviews.llvm.org/D85351	2020-09-25 13:28:22 +02:00

... 6 7 8 9 10 ...

4997 Commits