infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	0b7e2fb7c7	[pulse] more type-safety around summary and latent issues creation Summary: This makes sure we call `AbductiveDomain.summary_of_post` exactly once per post-condition. Notice in particular in the diff: - in Pulse.ml we remove a now-certified-useless "is_unsat_expensive" call - in PulseOperations.ml we add a previously-missing call to `summary_of_post` (it's needed to remove local variables from the symbolic state + normalize) The price to pay is ugly type annotations and down-casting peppered in a few places, in reasonable number. Reviewed By: da319 Differential Revision: D24078564 fbshipit-source-id: 3102cacf0	4 years ago
Jules Villard	7fdb33b710	[pulse] report errors only when the PRUNE nodes along the path are true Summary: Take another page from the Incorrectness Logic book and refrain from reporting issues on paths unless we know for sure that this path will be taken. Previously, we would report on paths that are merely not impossible. This goes very far in the other direction, so it's possible we'll want to go back to some sort of middle ground. Or maybe not. See the changes in the tests to get a sense of what we're missing. Reviewed By: ezgicicek Differential Revision: D24014719 fbshipit-source-id: d451faf02	4 years ago
Daiva Naudziuniene	d0cb245303	[frontend] Fix capture init for cpp lambdas Summary: We were missing assignment to captured variables with initializers. Consider the following example: ``` S* update_inside_lambda_capture_and_init(S* s) { S* object = nullptr; auto f = [& o = object](S* s) { o = s; }; f(s); return object; } ``` which was translated to ``` VARIABLE_DECLARED(o:S&); &o:S&=&object &f =(_fun...lambda..._operator(),([by ref]&o &o:S&)) ``` However, we want to capture `o` (which is an address of `object`), rather `&o` in closure. After the diff ``` VARIABLE_DECLARED(o:S&); &o:S&=&object n$7=&o:S& &f =(_fun...lambda..._operator(),([by ref]n$7 &o:S&)) ``` Reviewed By: jvillard Differential Revision: D23567346 fbshipit-source-id: 20f77acc2	4 years ago
Sungkeun Cho	d13ae13a71	[pulse] Fix linting issue Reviewed By: jvillard Differential Revision: D23758187 fbshipit-source-id: 4d9f47e66	4 years ago
Jules Villard	931cf2d72b	[pulse][minor] resolve mysterious 'a in .mli Summary: Even though it's unused in the implementation this argument is supposed to be `Typ.t` so spell it out. Reviewed By: skcho Differential Revision: D23729433 fbshipit-source-id: d05548f42	4 years ago
Jules Villard	03bc3f31c8	[pulse] add option to skip functions/classes Summary: This can be useful to make pulse forget about tricky parts of the code. Treat "skipped" procedures as unknown so heuristics for mutating the return value and parameters passed by reference are applied. Reviewed By: ezgicicek Differential Revision: D23729410 fbshipit-source-id: d7a4924a8	4 years ago
Jules Villard	b62c3f55b9	[pulse] fix fuel debug message Summary: This would previously print that we ran out of fuel even if we didn't and we simply reached a normal form. Reviewed By: ezgicicek Differential Revision: D23575571 fbshipit-source-id: 37d02ca8d	5 years ago
Nikos Gorogiannis	30fcdc7d29	[attributes] request attributes only when procdesc isn't required Summary: Most of the time, when the procdesc of a callee is requested, all that is really required is the procedure attributes. However, requesting the procdesc may return `None` when the procedure is undefined (in Java, and soon for Clang too). So, change all callsites to using attributes instead, where possible. Reviewed By: jvillard Differential Revision: D23539422 fbshipit-source-id: 3b1a52d48	5 years ago
Daiva Naudziuniene	4401701578	[pulse] Model for std::function copy constructor Summary: Added a model for copy constructor for `std::function`. In most cases, the SIL instruction `std::function::function(&dest, &src)` gives us pointers to `dest` and `src`, hence, we model the copy constructor as a shallow copy. However, in some cases, e.g. `std::function f = lambda_literal`, SIL instruction contains the closure itself `std::function::function(&dest, (operator(), captured_vars)`, hence, we need to make sure we copy the right value. Reviewed By: ezgicicek Differential Revision: D23396568 fbshipit-source-id: 0acb8f6bc	5 years ago
Daiva Naudziuniene	0a4af7754d	[pulse] Fix std::function::operator() Summary: There was a mismatch between formals and actuals in `std::function::operator()` because we were not passing the first argument corresponding to the closure. Reviewed By: ezgicicek Differential Revision: D23372104 fbshipit-source-id: d0f9b27d6	5 years ago
Daiva Naudziuniene	29fd9e13d1	[pulse] Understand captured variables in cpp lambdas Summary: When we evaluate lambdas in pulse, we create a closure object with `fake` fields to store captured variables. However, during the function call we were not linking the captured values from the closure object. We address this missing part here. Reviewed By: jvillard Differential Revision: D23316750 fbshipit-source-id: 14751aa58	5 years ago
Radu Grigore	9591276541	[topl] Cheap port to Pulse. Summary: Report errors found by running Topl on top of Pulse, when using --topl-pulse. Topl tests now run on top of Pulse. Reviewed By: jvillard Differential Revision: D23030771 fbshipit-source-id: 8770c2902	5 years ago
Jules Villard	5cceead7ae	[pulse] normalize again when we discover new linear eqs Summary: When normalizing discovers new linear arithmetic facts in `normalize_linear_eqs` we go around once more. Do the same when atoms become linear equalities. Reviewed By: skcho Differential Revision: D23264425 fbshipit-source-id: b355875f3	5 years ago
Jules Villard	50b94dbbd6	[pulse] cleanup arithmetic Summary: Mostly cosmetic except for a change in [solve_eq] to try harder at normalization (improves unit tests!). Add more comments and do minor renamings. Reviewed By: skcho Differential Revision: D23243629 fbshipit-source-id: 55bdaf8a8	5 years ago
Jules Villard	8b23fee8f8	[pulse] refactor Atom.eval_atom Summary: This function had become a bit hard to read and the part about embedded atoms was not very clear and also a bit incomplete (need to handle "= 1" and "≠ 1" too). Reviewed By: skcho Differential Revision: D23242216 fbshipit-source-id: 239fade97	5 years ago
Jules Villard	ecdb153579	[pulse] streamline atom normalization Summary: This does a bunch of things at once (sorry): - Refactor atom/term normalisation so that terms that are really just atoms become atoms. - Use this to not bother adding special cases in the functions exported in the .mli: `and_less_than`, `and_equal_binop`, `prune_binop`, etc. all had special cases to avoid introducing terms that could be atoms. That's not great because the same smarts wasn't applied to terms that would only become atom-like after some normalisation, and led to weird and duplicated code. Now it's much cleaner: just add the most straighforward fact and normalise! - Fix a bug: adding a new equality `x = linear` should not be done using `Normalizer.merge_var_linarith` as this is an internal function that assumes that `x` is the right representative in `x - linear`. Instead, for abitrary equations of that form, `solve_eq` should be used. - When `normalize_linear_eqs` discovers new linear equalities, normalize again. Add fuel there too to avoid spending too much time doing that. It could be that we don't need/want fuel there but then we'd need to think very hard about why there's no infinite recursion possible and that seems harder. Reviewed By: skcho Differential Revision: D23241282 fbshipit-source-id: e5b8c4759	5 years ago
Jules Villard	7df30b0c4e	[pulse] preserve physical equality on var subst in LinArith Summary: This is used for variable substitution and will often be a no-op when normalising terms over and over again (after the first normalisation, the expression should stay the same). The equivalent function for terms was already being careful about not re-allocating identical terms so extend that care to linear expression. Reviewed By: skcho Differential Revision: D23241601 fbshipit-source-id: b365eb87a	5 years ago
Jules Villard	eb37d2ced5	[pulse] substitute entire linear expressions Summary: This allows further normalisation now that terms contain linear expressions in normal form. Reviewed By: skcho Differential Revision: D23241499 fbshipit-source-id: f8e4e759c	5 years ago
Jules Villard	36af901d79	[pulse] normalize any linear atom Summary: Linear arithmetic is able to simplify more atoms, eg `x+y <= x+y` becomes `True` by normalising to "lhs - rhs <= 0". This does the first step of normalisation, but to get True in this example we also need to substitute inside atoms according to the linear equalities, which is the next diff (for now we only substitute variables inside atoms for other variables or for constants). Reviewed By: skcho Differential Revision: D23241457 fbshipit-source-id: 0da0b545c	5 years ago
Jules Villard	69995cebb6	[pulse] add a Linear variant to terms Summary: More scaffolding, nothing creates `Linear _` terms yet. Some changes to variables substitution to allow substituting variables for linear terms (as well as constants and other variables). Reviewed By: skcho Differential Revision: D23241461 fbshipit-source-id: fc870255e	5 years ago
Jules Villard	45894a7dd9	[pulse] move LinArith before Term Summary: This is needed for the rest of the stack that introduces a `Linear of LinArith.t` variant in `Term.t` to enable more normalisation inside of terms. Reviewed By: skcho Differential Revision: D23241353 fbshipit-source-id: ad765cd13	5 years ago
Jules Villard	1d56705cd4	[pulse] evaluate all constant expressions Summary: Make term simplification a bit more structured and separate the "simplification" phase from the "evaluating constant expressions" phase. Also implement the latter for all possible terms. Reviewed By: skcho Differential Revision: D23241334 fbshipit-source-id: 2964aa477	5 years ago
Jules Villard	bcba7c8475	[pulse][minor] moving some arithmetic stuff around Summary: Not much to see here, extracted to make further changes more readable. Reviewed By: da319 Differential Revision: D23241335 fbshipit-source-id: 81181f23a	5 years ago
Jules Villard	5278cb7374	[pulse] `delete nullptr` is a no-op Summary: `delete` works exactly like `free` so merge both models together. Also move the `free(0)` test to nullptr.cpp as it seems more appropriate. Reviewed By: da319 Differential Revision: D23241297 fbshipit-source-id: 20a32ac54	5 years ago
Jules Villard	af64d5dafe	[pulse] detect when atoms become linear arithmetic Summary: Since this is where almost all of the reasoning is concentrated, let's make sure we use it at every opportunity! Reviewed By: skcho Differential Revision: D23194224 fbshipit-source-id: fedb2811e	5 years ago
Ezgi Çiçek	c449f6e727	[pulse] Tweak error message for Memory Leak Reviewed By: dulmarod Differential Revision: D23238533 fbshipit-source-id: e5e4e280d	5 years ago
Jules Villard	3e7bf4343b	[pulse] make unit tests more robust to adding more tests Summary: Reset the state before each test so that adding tests doesn't affect other tests by shifting the ids of their anonymous variables. Reviewed By: skcho Differential Revision: D23194171 fbshipit-source-id: 7b717f160	5 years ago
Jules Villard	6fae5f641e	[pulse] change constants to be rationals Summary: These are the only ones we need, it turns out the other types (string, proc names, ...) were dead code. The changes the integer constants to rational constants, to match the domain of the linear arithmetic engine. Reviewed By: skcho Differential Revision: D23164136 fbshipit-source-id: 755c3f526	5 years ago
Jules Villard	0433e9592e	[pulse] new new arithmetic Summary: Instead of alternating between a normal form and a tree structure, always keep a normal form. Except the normal form is not always fully normalized. Overall, it's a bit faster than the previous iteration, while being more precise! In particular, linear arithmetic aims at being much more complete. Reviewed By: skcho Differential Revision: D23134209 fbshipit-source-id: 5f9ec6ece	5 years ago
Daiva Naudziuniene	69e0dce0ed	[pulse] fix end() iterator false positive Summary: Before we were modelling `vector.end()` as returning a fresh pointer every time is was called. It is common to check if an iterator is not the `end()` iterator and proceed to dereference the iterator in that case. In such code pattern `vector.end()` is called twice and returns different fresh values which causes false positives. To fix this, we add a special internal field `__infer_model_backing_array_pointer_to_last_element` to a vector to denote its end. Now, every time we call `vector.end()` we return the value of this field. We introduce a new attribute `EndOfCollection` to mark `end` iterator as the existing `EndIterator` invalidation is not suitable when we need to read the same value multiple times. Reviewed By: jvillard Differential Revision: D23101185 fbshipit-source-id: fa8a33b58	5 years ago
Jules Villard	7b743ceb1a	[pulse][formula] forget dead facts Summary: At the end of analysing a procedure we call `simplify ~keep:vars_live_in_pre_post`. Any variable not in `vars_live_in_pre_post` is not mentioned anywhere else in the state and therefore is not going to contribute constraints in callers of the procedure (in other words: they're dead). We want to also forget arithmetic facts about these variables as this is a good opportunity to make the path condition smaller, sometimes by a lot! The main issue is that dead variables may be useful intermediate terms in the formula, eg trying to keep only facts about `x` in `y = x + 1 && y = 0` is going to lose a lot of precision. But, if a variable not in `keep` is only mentioned in a simple atom `z = 42` atom, for example, it's safe to forget about it, eg it's safe to remember only `x=0` in `x=0 && z=42` (if only `x` is live). In other words, we can get rid of all atoms containing variables not transitively involved in other atoms that eventually involve live variables. A graph problem! This is guaranteed not to forget anything important and can still trim a lot of atoms in certain situations. Reviewed By: skcho Differential Revision: D22921313 fbshipit-source-id: 6d5db7cbe	5 years ago
Jules Villard	bf40a9119e	[pulse][formula] print readable variable names in unit tests Summary: Perhaps a bit overkill to introduce all this extra complexity but it makes the unit tests much more readable. In fact, this uncovered a bug in the dead variable elimination! Reviewed By: dulmarod Differential Revision: D22925548 fbshipit-source-id: d1f411683	5 years ago
Jules Villard	6f5b125aa0	[pulse][formula] improve printing Summary: Do not always add parens around sub-terms, and add more parens around terms in atoms and normal forms when they can be confused with the atom or normal form structure. Reviewed By: skcho Differential Revision: D22925549 fbshipit-source-id: 8646e96a5	5 years ago
Jules Villard	934a13a134	[pulse][minor] name ~callee argument for readability Summary: To make sure we don't mix up the order of arguments. Reviewed By: dulmarod Differential Revision: D22921348 fbshipit-source-id: 9b5333bf3	5 years ago
Jules Villard	62e84185b1	[pulse] a few more unit tests Summary: These will change to more interesting outputs in the next diff. Reviewed By: dulmarod Differential Revision: D22921349 fbshipit-source-id: c58c6240a	5 years ago
Jules Villard	97fcc3b0ad	[pulse] apply equality relation to terms to be added to the equality relation Summary: Extra normalization gives extra precision. This doesn't seem to negatively impact perf. Reviewed By: skcho Differential Revision: D22867109 fbshipit-source-id: 5b82ec377	5 years ago
Jules Villard	2eb6eb3655	[pulse] skeleton for unit testing pulse Summary: Add unit tests to pulse in order to write tests for the arithmetic solver, because it is a pain to write programs to do that end to end. Reviewed By: ezgicicek Differential Revision: D22864607 fbshipit-source-id: 0a20a3593	5 years ago
Jules Villard	7ccec3fd99	[build] make dune format files when testing Summary: This is needed to make dune auto-updating of unit tests introduced in the next diff cohabit peacefully with our tests to make sure code stays correctly formatted wrt ocamlformat. Also, more auto-formatting = better. Reviewed By: da319 Differential Revision: D22865004 fbshipit-source-id: 91c47ab08	5 years ago
Jules Villard	a64f311ea8	[formula] remember results of normalization Summary: Normalization is potentially expensive and its result should be remembered if the formula keeps being used. In the future we might use this to make normalization more incremental. Also rename PathCondition.satisfiable -> is_unsat to match PulseFormula.is_unsat. Reviewed By: skcho Differential Revision: D22728264 fbshipit-source-id: 7759b33ac	5 years ago
Jules Villard	cfa81d168d	[pulse] check formula unsat more often Summary: Now that this is a cheap operation, use it whenever we are checking the satisfiability of the path condition. Reviewed By: skcho Differential Revision: D22724373 fbshipit-source-id: df31c6010	5 years ago
Jules Villard	f1e9e28f73	[pudge] delete Summary: Pausing the experiment in favour of new PulseFormula. Can be resurrected later. Reviewed By: skcho Differential Revision: D22576274 fbshipit-source-id: 76529d767	5 years ago
Jules Villard	5a39c158c5	[pulse] arithmetic domain: take 4! Summary: This time it's personal. Roll out pulse's own arithmetic domain to be fast and be able to add precision as needed. Formulas are precise representations of the path condition to allow for good inter-procedural precision. Reasoning on these is somewhat ad-hoc (except for equalities, but even these aren't quite properly saturated in general), so expect lots of holes. Skipping dead code in the interest of readability as this (at least temporarily) doesn't use pudge anymore. This may make a come-back as pudge has/will have better precision: the proposed implementation of `PulseFormula` is very cheap so can be used any time we could want to prune paths (see following commits), but this comes at the price of some precision. Calling into pudge at reporting time still sounds like a good idea to reduce false positives due to infeasible paths. #skipdeadcode Reviewed By: skcho Differential Revision: D22576004 fbshipit-source-id: c91793256	5 years ago
Jules Villard	c7305245c5	[istd][minor] no need to name ~fold in fold_of_pervasives_map_fold Summary: It's typically used inside another ~fold argument and it gets too verbose. Reviewed By: da319 Differential Revision: D22846501 fbshipit-source-id: 2fdd4271f	5 years ago
Ezgi Çiçek	577d4679da	[absint][pulse] Remove NeverJoin Summary: There used to be `JoinAfter n` mode where we would try to join `n` states instead of always making disjunctions. It got deleted in D14258485 and Pulse's underlying (pre-disjuncts) domain doesn't even have a join operation. `NeverJoin` mode is not useful in Pulse anymore: pulse will diverge or OOM if we don't limit the number of disjuncts. It is also not used by any other analyzer. Let's remove it. Reviewed By: jvillard Differential Revision: D22817425 fbshipit-source-id: 1e658f11d	5 years ago
Ezgi Çiçek	feefda3e59	Wrap Java's PatternMatch into its own module Summary: This diff refactors Java specific `PatternMatch` functions into its own module. When `PatternMatch.ml` was originally created, it was mainly for Java but now it also supports ObjC. Let's refactor it to reflect the Java/ObjC separation: move all functions that operate on Java procnames into Java submodule. Reviewed By: jvillard Differential Revision: D22816504 fbshipit-source-id: ff6b64b29	5 years ago
Daiva Naudziuniene	221d0b62ab	[pulse] Model builtin __new as returning non-null Summary: We model internal builtin `__new` function to return a non-null value. This fixes nullptr_dereference false positives where we explicitly check the result of a function call for nullptr when the function returns a newly created object. Reviewed By: jvillard Differential Revision: D22772217 fbshipit-source-id: 37d209697	5 years ago
Jules Villard	660eceb20f	[pulse] log summary creation Summary: This step does extra normalization so it's useful to see what's going on when debugging. Log stuff in the html debug of the exit node. Reviewed By: da319 Differential Revision: D22596248 fbshipit-source-id: cde3bbb6c	5 years ago
Jules Villard	9578ec74c9	[pulse] model operator== and operator!= for iterators Summary: Pulse has models for iterators that make them use a fake field to remember the element of the collection they point to. But, not all methods are modelled, and some of them look at the real field, eg `operator==`. Since we don't update the real field in the model, this causes imprecision. The imprecision was visible in pudge. Reviewed By: skcho Differential Revision: D22576003 fbshipit-source-id: 2af6be646	5 years ago
Jules Villard	ae57f217d2	[pulse] don't always mistake equality for aliasing Summary: When applying function summaries, we are careful not to violate the summary's assumptions about non-aliasing. For example, the summary we generate for `foo(x,y) { x = y; }` will have `x` and `y` be allocated to two different `AbstractValue.t` in the heap, representing disjointness. However, the current logic is too coarse and also rejects passing the same pure value to functions that made no assumption about them being equal or different, eg `goo(int x,int y) { int z = x + y; }`. This is because the corresponding `AbstractValue.t` are different in the callee's summary, but are represented by only one same value in callers such as `goo(i,i)`. This diff restricts the "don't violate aliasing" condition to only consider heap-allocated values. This is consistent with separation logic by the way: we use the implication `x\|->- * y\|->- \|- x≠y`, which is valid only when both `x` and `y` are both allocated in the heap as in the left-hand-side of `\|-`. Reviewed By: skcho Differential Revision: D22574297 fbshipit-source-id: 206a18499	5 years ago
Daiva Naudziuniene	50d659b750	Update type of procdesc and closure expression to contain information about capture variable mode Summary: We update the type of captured variables to include information about capture mode (`ByReference` or `ByValue`) both for procdesc attributes and the closure expression. For lambda: closure expression now contains correct capture mode for capture variables. Procdesc still does not contain information about captured variables which we will address in the next diff. For objc blocks: at the moment all captured variables have mode `ByReference`. Added TODOs to fix this. Reviewed By: jvillard Differential Revision: D22572054 fbshipit-source-id: 4c88678ee	5 years ago

1 2 3 4 5 ...

288 Commits (b3c74c4152357dda224a4e5c110aaf953d8ba3fa)