infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	d79bd90b81	[pdesc] new pre-analysis to diverge after "noreturn" function calls Summary: A plugin update allows infer to know when a function doesn't return according to its attributes. This propagates this info all the way to the attributes of each function, and then use this information in a new pre-analysis that cuts the links to successor nodes of each `Call` instruction to a function that does not return. NOTE: The "no_return" `CallFlag.t` was dead code, following diffs deal with that (by removing it). Reviewed By: dulmarod Differential Revision: D18573922 fbshipit-source-id: 85ec64eca	5 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	5 years ago
Jules Villard	b03ca78bf3	[pdesc][refactor] ability to set normal and exceptional succs independently Summary: - more flexible API - less error-prone thanks to named parameters - also takes care of adjusting predecessors of the previous successors! This fixes some (probably harmless) bugs in the frontends. Reviewed By: dulmarod Differential Revision: D18573923 fbshipit-source-id: ad97b3607	5 years ago
Jules Villard	f81c9d56e3	[pulse] arithmetic operations Summary: Model +/- when we know the concrete interval for a value. Reviewed By: skcho Differential Revision: D18528535 fbshipit-source-id: 7c67a7a54	5 years ago
Jules Villard	6ecf4066e8	[pulse] model std::integral_constant Summary: cpp_initialization Reviewed By: skcho Differential Revision: D18528537 fbshipit-source-id: ab5f8038a	5 years ago
Jules Villard	6df4fb6a9b	[pulse] report dereference of NULL and constants Summary: Note: Disabled by default. Having some support for values, we can report when a null or constant value is being dereferenced. The particularity here is that we don't report when 0 is a possible value for the address, or even if we know that the value of the address can only be 0 in that branch! Instead, we allow ourselves to report only when we the address has been set to NULL (or any constant). This is in line with how pulse deals with other issues: only report when 1. we see an address become invalid, and 2. we see the same address be used later on Reviewed By: skcho Differential Revision: D17665468 fbshipit-source-id: f1ccf94cf	5 years ago
Jules Villard	2e4fbb7fe5	[pulse] intervals! Summary: This adds a more interesting value domain to pulse: concrete intervals. There are still two main limitations: 1. arithmetic operations are all over-approximated: any assignment involving arithmetic operations is replaced by non-determinism 2. abstract values that are discovered to be equal are not merged into one Reviewed By: skcho Differential Revision: D18058972 fbshipit-source-id: 0492a590f	5 years ago
Jules Villard	b20c22a5ee	[pulse] abduce arithmetic facts Summary: This does several things because it was hard to split it more: 1. Split most of the arithmetic reasoning to PulseArithmetic.ml. This doesn't need to be reviewed thoroughly because an upcoming diff changes the domain from just `EqualTo of Const.t` to an interval domain! 2. When going through a prune node intra-procedurally, abduce arithmetic facts to the pre (instead of just propagating them). This is the "assume as assert" trick used by biabduction 1.0 too and allows to propagate arithmetic constraints to callers. 3. Use 2 when applying summaries by pruning specs whose preconditions have un-satisfiable arithmetic constraints. This changes one of the tests! Pulse now does a bit more work to find the false positive, as can be seen in the longer trace. Reviewed By: skcho Differential Revision: D18117160 fbshipit-source-id: af3b2c8c0	5 years ago
Ezgi Çiçek	6781ba36d3	[impurity] Start checking equivalence at materialized addresses in pre Summary: Previously, we considered a function which modifies its parameters to be impure even though it might not be modifying the underlying value. This resulted in FPs like the following program in Java: ``` void fresh_pure(int[] a) { a = new int[1]; } ``` Similarly, in C++, we considered the following program as impure because it was writing to `s`: ``` Simple* reassign_pure(Simple* s) { s = new Simple{2}; return s; } ``` This diff fixes that issue by starting the check for address equivalnce in pre-post not directly from the addresses of the stack variables, but from the addresses pointed to by these stack variables. That means, we only consider things to be impure if the actual values pointed by the parameters change. Reviewed By: skcho Differential Revision: D18113846 fbshipit-source-id: 3d7c712f3	5 years ago
Jules Villard	16c88e282d	[pulse] some tests about values Summary: In preparation for improvements to the arithmetic reasoning. Reviewed By: dulmarod Differential Revision: D17977207 fbshipit-source-id: ee98e0772	5 years ago
Jules Villard	6a738045fd	[pulse] interprocedural histories and traces Summary: bigmacro_bender There are 3 ways pulse tracks history. This is at least one too many. So far, we have: 1. "histories": a humble list of "events" like "assigned here", "returned from call", ... 2. "interproc actions": a structured nesting of calls with a final "action", eg "f calls g calls h which does blah" 3. "traces", which combine one history with one interproc action This diff gets rid of interproc actions and makes histories include "nested" callee histories too. This allows pulse to track and display how a value got assigned across function calls. Traces are now more powerful and interleave histories and interproc actions. This allows pulse to track how a value is fed into an action, for instance performed in callee, which itself creates some more (potentially now interprocedural) history before going to the next step of the action (either another call or the action itself). This gives much better traces, and some examples are added to showcase this. There are a lot of changes when applying summaries to keep track of histories more accurately than was done before, but also a few simplifications that give additional evidence that this is the right concept. Reviewed By: skcho Differential Revision: D17908942 fbshipit-source-id: 3b62eaf78	5 years ago
Jules Villard	669383d315	[pulse] more details about variable declaration events Summary: - add the variable being declared so we can report it back in the trace in addition to its location - distinguish between local vars and formals Reviewed By: skcho Differential Revision: D17930348 fbshipit-source-id: a5b863e64	5 years ago
Jules Villard	8182514f35	[impurity] clarify string parameter of `ImpurityDomain.add_to_errlog` Summary: Instead of a string argument named `~str` pass `Formal \| Global` and let `add_to_errlog` figure out how to print it. Reviewed By: ezgicicek Differential Revision: D17907657 fbshipit-source-id: ed09aab72	5 years ago
Jules Villard	96c96a8dc6	[pulse] remember equalities found in branches Summary: When we make the decision to go into a branch "v = N" where some abstract value is compared to a constant, remember the corresponding equality. This allows to prune simple infeasible paths intra-procedurally. Further work is needed to make this useful interprocedurally, for instance either or both of these ideas could be explored: - abduce v=N in the precondition and do not apply summaries when the equalities in the pre are not satisfied - prune post-conditions that lead to unsat states where a value has to be equal to several different constants Reviewed By: skcho Differential Revision: D17906166 fbshipit-source-id: 5cc84abc2	5 years ago
Jules Villard	3ac8e27062	[pulse] use constant equality to prune unfeasible paths Summary: When we know "x = 3" and we have a condition "x != 3" we know we can prune the corresponding path. Reviewed By: skcho Differential Revision: D17665472 fbshipit-source-id: 988958ea6	5 years ago
Ezgi Çiçek	557e2bfa3f	[impurity] Consider functions with no pulse summary as impure Summary: If we have no pulse summary (most likely caused by pulse finding a legit issue with the code), let's consider the function as impure. Reviewed By: jvillard Differential Revision: D17906016 fbshipit-source-id: 671d3e0ba	5 years ago
Nikos Gorogiannis	f57bb9be0a	[starvation] make deduplication depend on filtering config var Summary: Previously deduplication was always on which is not great for testing. Also split tests so that we can still test deduplication separately. Reviewed By: mityal Differential Revision: D17686877 fbshipit-source-id: 280d91473	5 years ago
Jules Villard	362e9cc622	[pulse] do not print `()` after functions Summary: Unfortunately it is very hard to predict when `Typ.Procname.describe` will add `()` after the function name, so we cannot make sure it is always there. Right now we report clowny stuff like "error while calling `foo()()`", which this change fixes. Reviewed By: ezgicicek Differential Revision: D17665470 fbshipit-source-id: ef290d9c0	5 years ago
Ezgi Çiçek	c5ca4db8d0	[pulse][impurity] Use pulse for detecting impurity Summary: Introduce a new experimental checker (`--impurity`) that detects impurity information, tracking which parameters and global variables of a function are modified. The checker relies on Pulse to detect how the state changes: it traverses the pre and post pairs starting from the parameter/global variable and finds where the pre and post heaps diverge. At diversion points, we expect to see WrittenTo/Invalid attributes containing a trace of how the address was modified. We use these to construct the trace of impurity. This checker is a complement to the purity checker that exists mainly for Java (and used for cost and loop-hoisting analyses). The aim of this new experimental checker is to rely on Pulse's precise memory treatment and come up with a more precise im(purity) analysis. To distinguish the two checkers, we introduce a new issue type `IMPURE_FUNCTION` that reports when a function is impure, rather than when it is pure (as in the purity checker). TODO: - improve the analysis to rely on impurity information of external library calls. Currently, all library calls are assumed to be nops, hence pure. - de-entangle Pulse reporting from analysis. Reviewed By: skcho Differential Revision: D17051567 fbshipit-source-id: 5e10afb4f	5 years ago
Dulma Churchill	27ea5d041b	[biabduction] Rename use_after_free to avoid name clash with Pulse Summary: Use_after_free was used both for biabduction and pulse, and the biabduction version is blacklisted by default. As a result, the Pulse version was also disabled unintentionally. This changes the name of the old use_after_free so that now we can get use_after_free bugs whenever pulse is enabled. Reviewed By: skcho Differential Revision: D17182687 fbshipit-source-id: 539ca69de	5 years ago
Dulma Churchill	d04e098eb1	[AL] Add a is_static predicate Summary: With this predicate we are able to check for static global variables in AL. Reviewed By: ddino Differential Revision: D17164848 fbshipit-source-id: a3d10598c	5 years ago
Sungkeun Cho	59f06568cf	[inferbo] Use std::vector model for std::string Summary: This diff uses the models of vector for modelling string in Cpp. Depends on D16963153 Reviewed By: ezgicicek Differential Revision: D16963166 fbshipit-source-id: 5effe2d72	5 years ago
Jules Villard	9e5115a9e0	[annotreach] support for new `"symbol_regexps"` matcher Summary: This is more powerful than `"symbols"` for more advanced use-cases. Keep `"symbols"` unchanged to make migrating easier. Differential Revision: D16985756 fbshipit-source-id: dfbb09393	5 years ago
Dulma Churchill	d0bfb856ed	[AL] Add new predicate is_extern Summary: Adding new predicate for checking whether a variable is defined as extern. May be useful in AL rules. Reviewed By: jvillard Differential Revision: D16961690 fbshipit-source-id: 0677077dc	5 years ago
Nikos Gorogiannis	86a1bbf1a7	[racerd] output access expressions language-sensitively Summary: Use whatever information we can to decide whether to use C or Java syntax when outputting an access expression, now that we store them as such. Also, make cluster callbacks explicitly set the language, as this was not done before and led to some confusion (Clang being set when analysing a Java file). Reviewed By: skcho Differential Revision: D16884160 fbshipit-source-id: 40adf9f35	5 years ago
Jules Villard	0af754f3d7	[annot reachability] apply sanitizers in more cases Summary: Change the logic of the annotation reachability checker in the following ways: 1. Sanitizers take priority over sinks, i.e. a procedure that is both a sink and a sanitizer is not a sink. This changes the existing tests that seemed to assume the opposite. However I think that way is more useful and goes better with the fact that sanitizers are specified as "overrides". 2. When applying a summary, check again that we are not in a sanitizer for the corresponding sink. Without (2) this there was a subtle bug when several rules were specified. For example, if `sink_wrapper()` wraps `sink()` for a rule `R` then the summary of `sink_wrapper()` will be: `R-sink : call to sink()`. Then, suppose `sanitizer()` calls `sink_wrapper()` and `sanitizer()` is a sanitizer for `R` but not for another rule `R'`. The previous code would add the call to `sink()` to the summary of `sanitizer()` because it's not a sanitizer for `R'`, even though `sink()` is not a sink for `R'`! The current code will re-apply the rules correctly so that sinks are matched only against the right sanitizers. Reviewed By: skcho Differential Revision: D16895577 fbshipit-source-id: 266cc4940	5 years ago
Jules Villard	00cbc9c1e4	[annot reachability] add debug logging and light refactor Summary: - run the tests! they weren't hooked up to the main Makefile :/ - add some html debug messages - formatting Reviewed By: skcho Differential Revision: D16895578 fbshipit-source-id: e96d737cc	5 years ago
Sungkeun Cho	ddd4d98636	[inferbo] Add vector model: data Summary: It adds a vector model of `data` method. Depends on D16687280 Reviewed By: ezgicicek Differential Revision: D16689400 fbshipit-source-id: 156016b3c	5 years ago
Sungkeun Cho	58b403c8ff	[inferbo] Add vector model: empty Summary: It adds a model of vector::empty. Depends on D16687269 Reviewed By: ezgicicek Differential Revision: D16687280 fbshipit-source-id: 997a5faeb	5 years ago
Sungkeun Cho	c05062556f	[inferbo] Add vector model: push_back Summary: It adds a model of vector::push_back Depends on D16687225 Reviewed By: ezgicicek Differential Revision: D16687269 fbshipit-source-id: 9d2a73fca	5 years ago
Sungkeun Cho	f6b4f75e7c	[inferbo] Pruning by vector::size Summary: It enables pruning of vector's size when the return value of the function call of `vector::size` is pruned. Depends on D16687167 Reviewed By: ezgicicek Differential Revision: D16687225 fbshipit-source-id: 793a21b3a	5 years ago
Sungkeun Cho	e9cf5d33b3	[inferbo] Add models of vector constructors Summary: It adds models of vector constructors. Depends on D16645624 Reviewed By: ezgicicek Differential Revision: D16687167 fbshipit-source-id: eac49df6d	5 years ago
Sungkeun Cho	8c4be65754	[inferbo] Ondemand value generation of vector as function parameter Summary: It generates vector value ondemand when it is given as a parameter. Depends on D16645589 Reviewed By: ezgicicek Differential Revision: D16645624 fbshipit-source-id: 7498c8ab2	5 years ago
Sungkeun Cho	f066776b17	[inferbo] Add model: vector size Summary: It adds vector model of the size function Reviewed By: ezgicicek Differential Revision: D16645589 fbshipit-source-id: 6518fa228	5 years ago
Sungkeun Cho	7a8e7d13e9	[inferbo] Add model: vector constructor Summary: It adds vector model of constructor. Reviewed By: ezgicicek Differential Revision: D16645564 fbshipit-source-id: 92241a068	5 years ago
Jules Villard	13d54990bd	[models] get rid of include-based C++ models Summary: These have proved to be too fragile to maintain as they would often break compilation of user code. They have been off by default for more than a year now (D7350715). Removing the include models shows a more accurate picture of what infer results look like in production. As such, lots of tests have changed, mostly biabduction but also in inferbo. SIOF was using include-based models too but now libc++ is better and iostreams are implemented in a way that SIOF understands (instead of being magical creatures) so nothing changed there. Reviewed By: skcho Differential Revision: D16602171 fbshipit-source-id: ce38f045b	5 years ago
Sungkeun Cho	b3f52284ed	[inferbo] Ignore the top of latest prune of callees Summary: This diff prevents that the latest prune value is overwritten as top from callees. Reviewed By: jvillard Differential Revision: D16540391 fbshipit-source-id: bdd5b42ed	5 years ago
Ezgi Çiçek	127902222d	[pulse] Filter AddressOfStackVariable from read only heuristic check Reviewed By: skcho Differential Revision: D16518259 fbshipit-source-id: 92a631a82	5 years ago
Sungkeun Cho	84a6561dc9	[inferbo] Precise mod semantics on unsigned integer Summary: This diff improves the precision of the mod operator. For example, result of x % c (when x>=0 and c>0) is (before) [0, c-1] (after) [0, min(c-1,x)] Reviewed By: ezgicicek Differential Revision: D16518578 fbshipit-source-id: a68660ee7	5 years ago
Ezgi Çiçek	09ab685c7e	[pulse] Handle stack refs escaping their scope via pointer Summary: Pulse didn't treat local variables going out of scope as invalidating the corresponding address in memory. This diff fixes that by - marking all local variables that exits the scope with the attribute `AddressOfStackVariable` - before we write the summary for the proc, we make sure to invalidate all such addresses local to the procedure as `Invalid.` If such an address is read, then we would raise a use-after-lifetime issue. Reviewed By: jvillard Differential Revision: D16458355 fbshipit-source-id: 3686524cb	5 years ago
Sungkeun Cho	124ab9fed7	[inferbo] Downgrade issues of void pointer Summary: It downgrades issues of void pointer to L5, because of its impreciseness. This is not ideal but Inferbo cannot analyze arrays pointed by void pointers precisely at the moment. Reviewed By: jvillard Differential Revision: D16379911 fbshipit-source-id: f2c016aba	5 years ago
Jules Villard	a504a67ec2	[pulse] model some of `std::basic_string` Summary: A common gotcha is the new test. Model the minimum amount of `std::basic_string` to catch it. Reviewed By: mbouaziz, ngorogiannis Differential Revision: D16121090 fbshipit-source-id: 66f06cb43	5 years ago
Jules Villard	14b9975cf3	[pulse] support modelling destructors Summary: We want to detect that variables and C++ temporaries go out of scope even when their destructor happens to be modelled. We lost a test to that because `std::function::~function` was poorly modeled as deleting the lambda itself which would now cause a double invalidation. This has to be modelled better now as something that invalidates something inside the lambda, and also model `operator()` as something that accesses that something, to recover that test. It's not a vital test though, so Do It Later©. Reviewed By: ngorogiannis Differential Revision: D16121091 fbshipit-source-id: 6b777ca18	5 years ago
Jules Villard	d9aadf5df2	[pulse] allow models in invalidation traces Summary: Be more flexible in what type of function calls are allowed in `ViaCall ...` actions to be able to include models. Also get rid of `here here` in traces /o\ As a side-effect, get more precise (=qualified) procedure names in traces (but not in messages so as not to be too verbose). Reviewed By: mbouaziz, ngorogiannis Differential Revision: D16121092 fbshipit-source-id: fb51b02f8	5 years ago
Jules Villard	ef26e8bb28	[clang] NamespaceAliasDecl is just a no-op Summary: Fixes #1123. Reviewed By: mbouaziz, ngorogiannis Differential Revision: D16163589 fbshipit-source-id: 10d2d8010	5 years ago
Jules Villard	c89a8d3e63	delete ownership checker Summary: Replaced by pulse. `--ownership` is now a deprecated form of `--pulse`. The ownership checker is starting to give wrong answers due to changes in the clang frontend, so it's better to remove it in favour of pulse. there_goes_my_hero Reviewed By: ngorogiannis Differential Revision: D16107650 fbshipit-source-id: bb2446a19	5 years ago
Jules Villard	e803a30c2d	[clang] fix translation of `initListExpr` again Summary: So it turns out we need to translate even more cases. Pulse had a FP before that this fixes. Reviewed By: ezgicicek Differential Revision: D16073629 fbshipit-source-id: c03460b5a	5 years ago
Jules Villard	14ce445f81	[pulse] run tests against C++17 Summary: This is needed to test some functionality in the next diff. Only one test changes (no longer a FN), which is now documented. Also, stop including the "header models" meant for biabduction! Maybe one day we'll need to have several test modes for different C++ versions. Seems overkill for now, so let's wait until we see some actual issues (eg FPs) that manifest in one version but not the other. Reviewed By: mbouaziz Differential Revision: D16073630 fbshipit-source-id: 1cfdfc933	5 years ago
Jules Villard	86decb83f6	[pulse] record attributes of address not edge-reachable in the post Summary: Sometimes the post of a function call has attributes on addresses that were mentioned in the pre but are no longer reachable in the post. We don't want to forget these, see added test. Reviewed By: mbouaziz Differential Revision: D16050050 fbshipit-source-id: 1ce522b97	6 years ago
Jules Villard	58b1df6bb9	[clang] fix destructor placement for temporaries in conditionals Summary: The previous code would call the destructor for the C++ temporary before the prune nodes, which then try to dereference it. Wrong. Quick fix: don't destroy temporaries in conditionals. Reviewed By: mbouaziz Differential Revision: D16030735 fbshipit-source-id: e11abad58	6 years ago

1 2 3 4 5 ...

839 Commits (624d7d7930d022eb802b9959d51bd686278fcfd9)