infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	3e7bf4343b	[pulse] make unit tests more robust to adding more tests Summary: Reset the state before each test so that adding tests doesn't affect other tests by shifting the ids of their anonymous variables. Reviewed By: skcho Differential Revision: D23194171 fbshipit-source-id: 7b717f160	5 years ago
Jules Villard	6fae5f641e	[pulse] change constants to be rationals Summary: These are the only ones we need, it turns out the other types (string, proc names, ...) were dead code. The changes the integer constants to rational constants, to match the domain of the linear arithmetic engine. Reviewed By: skcho Differential Revision: D23164136 fbshipit-source-id: 755c3f526	5 years ago
Jules Villard	0433e9592e	[pulse] new new arithmetic Summary: Instead of alternating between a normal form and a tree structure, always keep a normal form. Except the normal form is not always fully normalized. Overall, it's a bit faster than the previous iteration, while being more precise! In particular, linear arithmetic aims at being much more complete. Reviewed By: skcho Differential Revision: D23134209 fbshipit-source-id: 5f9ec6ece	5 years ago
Daiva Naudziuniene	69e0dce0ed	[pulse] fix end() iterator false positive Summary: Before we were modelling `vector.end()` as returning a fresh pointer every time is was called. It is common to check if an iterator is not the `end()` iterator and proceed to dereference the iterator in that case. In such code pattern `vector.end()` is called twice and returns different fresh values which causes false positives. To fix this, we add a special internal field `__infer_model_backing_array_pointer_to_last_element` to a vector to denote its end. Now, every time we call `vector.end()` we return the value of this field. We introduce a new attribute `EndOfCollection` to mark `end` iterator as the existing `EndIterator` invalidation is not suitable when we need to read the same value multiple times. Reviewed By: jvillard Differential Revision: D23101185 fbshipit-source-id: fa8a33b58	5 years ago
Jules Villard	7b743ceb1a	[pulse][formula] forget dead facts Summary: At the end of analysing a procedure we call `simplify ~keep:vars_live_in_pre_post`. Any variable not in `vars_live_in_pre_post` is not mentioned anywhere else in the state and therefore is not going to contribute constraints in callers of the procedure (in other words: they're dead). We want to also forget arithmetic facts about these variables as this is a good opportunity to make the path condition smaller, sometimes by a lot! The main issue is that dead variables may be useful intermediate terms in the formula, eg trying to keep only facts about `x` in `y = x + 1 && y = 0` is going to lose a lot of precision. But, if a variable not in `keep` is only mentioned in a simple atom `z = 42` atom, for example, it's safe to forget about it, eg it's safe to remember only `x=0` in `x=0 && z=42` (if only `x` is live). In other words, we can get rid of all atoms containing variables not transitively involved in other atoms that eventually involve live variables. A graph problem! This is guaranteed not to forget anything important and can still trim a lot of atoms in certain situations. Reviewed By: skcho Differential Revision: D22921313 fbshipit-source-id: 6d5db7cbe	5 years ago
Jules Villard	bf40a9119e	[pulse][formula] print readable variable names in unit tests Summary: Perhaps a bit overkill to introduce all this extra complexity but it makes the unit tests much more readable. In fact, this uncovered a bug in the dead variable elimination! Reviewed By: dulmarod Differential Revision: D22925548 fbshipit-source-id: d1f411683	5 years ago
Jules Villard	6f5b125aa0	[pulse][formula] improve printing Summary: Do not always add parens around sub-terms, and add more parens around terms in atoms and normal forms when they can be confused with the atom or normal form structure. Reviewed By: skcho Differential Revision: D22925549 fbshipit-source-id: 8646e96a5	5 years ago
Jules Villard	934a13a134	[pulse][minor] name ~callee argument for readability Summary: To make sure we don't mix up the order of arguments. Reviewed By: dulmarod Differential Revision: D22921348 fbshipit-source-id: 9b5333bf3	5 years ago
Jules Villard	62e84185b1	[pulse] a few more unit tests Summary: These will change to more interesting outputs in the next diff. Reviewed By: dulmarod Differential Revision: D22921349 fbshipit-source-id: c58c6240a	5 years ago
Jules Villard	97fcc3b0ad	[pulse] apply equality relation to terms to be added to the equality relation Summary: Extra normalization gives extra precision. This doesn't seem to negatively impact perf. Reviewed By: skcho Differential Revision: D22867109 fbshipit-source-id: 5b82ec377	5 years ago
Jules Villard	2eb6eb3655	[pulse] skeleton for unit testing pulse Summary: Add unit tests to pulse in order to write tests for the arithmetic solver, because it is a pain to write programs to do that end to end. Reviewed By: ezgicicek Differential Revision: D22864607 fbshipit-source-id: 0a20a3593	5 years ago
Jules Villard	7ccec3fd99	[build] make dune format files when testing Summary: This is needed to make dune auto-updating of unit tests introduced in the next diff cohabit peacefully with our tests to make sure code stays correctly formatted wrt ocamlformat. Also, more auto-formatting = better. Reviewed By: da319 Differential Revision: D22865004 fbshipit-source-id: 91c47ab08	5 years ago
Jules Villard	a64f311ea8	[formula] remember results of normalization Summary: Normalization is potentially expensive and its result should be remembered if the formula keeps being used. In the future we might use this to make normalization more incremental. Also rename PathCondition.satisfiable -> is_unsat to match PulseFormula.is_unsat. Reviewed By: skcho Differential Revision: D22728264 fbshipit-source-id: 7759b33ac	5 years ago
Jules Villard	cfa81d168d	[pulse] check formula unsat more often Summary: Now that this is a cheap operation, use it whenever we are checking the satisfiability of the path condition. Reviewed By: skcho Differential Revision: D22724373 fbshipit-source-id: df31c6010	5 years ago
Jules Villard	f1e9e28f73	[pudge] delete Summary: Pausing the experiment in favour of new PulseFormula. Can be resurrected later. Reviewed By: skcho Differential Revision: D22576274 fbshipit-source-id: 76529d767	5 years ago
Jules Villard	5a39c158c5	[pulse] arithmetic domain: take 4! Summary: This time it's personal. Roll out pulse's own arithmetic domain to be fast and be able to add precision as needed. Formulas are precise representations of the path condition to allow for good inter-procedural precision. Reasoning on these is somewhat ad-hoc (except for equalities, but even these aren't quite properly saturated in general), so expect lots of holes. Skipping dead code in the interest of readability as this (at least temporarily) doesn't use pudge anymore. This may make a come-back as pudge has/will have better precision: the proposed implementation of `PulseFormula` is very cheap so can be used any time we could want to prune paths (see following commits), but this comes at the price of some precision. Calling into pudge at reporting time still sounds like a good idea to reduce false positives due to infeasible paths. #skipdeadcode Reviewed By: skcho Differential Revision: D22576004 fbshipit-source-id: c91793256	5 years ago
Jules Villard	c7305245c5	[istd][minor] no need to name ~fold in fold_of_pervasives_map_fold Summary: It's typically used inside another ~fold argument and it gets too verbose. Reviewed By: da319 Differential Revision: D22846501 fbshipit-source-id: 2fdd4271f	5 years ago
Ezgi Çiçek	577d4679da	[absint][pulse] Remove NeverJoin Summary: There used to be `JoinAfter n` mode where we would try to join `n` states instead of always making disjunctions. It got deleted in D14258485 and Pulse's underlying (pre-disjuncts) domain doesn't even have a join operation. `NeverJoin` mode is not useful in Pulse anymore: pulse will diverge or OOM if we don't limit the number of disjuncts. It is also not used by any other analyzer. Let's remove it. Reviewed By: jvillard Differential Revision: D22817425 fbshipit-source-id: 1e658f11d	5 years ago
Ezgi Çiçek	feefda3e59	Wrap Java's PatternMatch into its own module Summary: This diff refactors Java specific `PatternMatch` functions into its own module. When `PatternMatch.ml` was originally created, it was mainly for Java but now it also supports ObjC. Let's refactor it to reflect the Java/ObjC separation: move all functions that operate on Java procnames into Java submodule. Reviewed By: jvillard Differential Revision: D22816504 fbshipit-source-id: ff6b64b29	5 years ago
Daiva Naudziuniene	221d0b62ab	[pulse] Model builtin __new as returning non-null Summary: We model internal builtin `__new` function to return a non-null value. This fixes nullptr_dereference false positives where we explicitly check the result of a function call for nullptr when the function returns a newly created object. Reviewed By: jvillard Differential Revision: D22772217 fbshipit-source-id: 37d209697	5 years ago
Jules Villard	660eceb20f	[pulse] log summary creation Summary: This step does extra normalization so it's useful to see what's going on when debugging. Log stuff in the html debug of the exit node. Reviewed By: da319 Differential Revision: D22596248 fbshipit-source-id: cde3bbb6c	5 years ago
Jules Villard	9578ec74c9	[pulse] model operator== and operator!= for iterators Summary: Pulse has models for iterators that make them use a fake field to remember the element of the collection they point to. But, not all methods are modelled, and some of them look at the real field, eg `operator==`. Since we don't update the real field in the model, this causes imprecision. The imprecision was visible in pudge. Reviewed By: skcho Differential Revision: D22576003 fbshipit-source-id: 2af6be646	5 years ago
Jules Villard	ae57f217d2	[pulse] don't always mistake equality for aliasing Summary: When applying function summaries, we are careful not to violate the summary's assumptions about non-aliasing. For example, the summary we generate for `foo(x,y) { x = y; }` will have `x` and `y` be allocated to two different `AbstractValue.t` in the heap, representing disjointness. However, the current logic is too coarse and also rejects passing the same pure value to functions that made no assumption about them being equal or different, eg `goo(int x,int y) { int z = x + y; }`. This is because the corresponding `AbstractValue.t` are different in the callee's summary, but are represented by only one same value in callers such as `goo(i,i)`. This diff restricts the "don't violate aliasing" condition to only consider heap-allocated values. This is consistent with separation logic by the way: we use the implication `x\|->- * y\|->- \|- x≠y`, which is valid only when both `x` and `y` are both allocated in the heap as in the left-hand-side of `\|-`. Reviewed By: skcho Differential Revision: D22574297 fbshipit-source-id: 206a18499	5 years ago
Daiva Naudziuniene	50d659b750	Update type of procdesc and closure expression to contain information about capture variable mode Summary: We update the type of captured variables to include information about capture mode (`ByReference` or `ByValue`) both for procdesc attributes and the closure expression. For lambda: closure expression now contains correct capture mode for capture variables. Procdesc still does not contain information about captured variables which we will address in the next diff. For objc blocks: at the moment all captured variables have mode `ByReference`. Added TODOs to fix this. Reviewed By: jvillard Differential Revision: D22572054 fbshipit-source-id: 4c88678ee	5 years ago
Josh Berdine	7e77bad4d2	[sledge] Change: Implement Fol using a solver-independent intermediate type Summary: In order to allow implementations of the single Fol interface using multiple backend first-order logic solvers, add explicit definitions of terms and formulas in the Fol module, and implement Context in terms of them. The Fol interface supports freely mixing Terms and Formulas, in particular there is `Term.ite : cnd:Formula.t -> thn:Term.t -> els:Term.t -> Term.t` which allows Formulas to appear in Terms. The Fol implementation performs enough normalization to enable using an internal representation of terms that is strictly partitioned into "theory terms" and "formulas", which are stratified below "conditional terms" and then below "general terms". This partitioning and stratification enables using backend solvers that do not support mixing formulas in terms. Reviewed By: jvillard Differential Revision: D22170506 fbshipit-source-id: a014ee7d7	5 years ago
Josh Berdine	eca73cf39b	[sledge] Build: Move sledge equality solver to separate lib Reviewed By: ngorogiannis Differential Revision: D22170508 fbshipit-source-id: 1e9cf4a79	5 years ago
Daiva Naudziuniene	35011757dc	[pulse] Add a flag to pass functions that we want to model as returning non-null Summary: To avoid NULLPTR_DEREFERENCE false positives we want to model some functions as returning non-null. A new flag --pulse-model-return-nonnull allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D22431564 fbshipit-source-id: 9944c7382	5 years ago
Jules Villard	a89d3db364	[pulse] change recency maps to be backed by lists Summary: This one is observed to be more memory efficient. Intuitively, maps need to be re-allocated more often than lists for balancing. In pulse, we'll often only ever add new values, in increasing order (when they are fresh variables created as we symbolically execute the program), which pushes maps into their worst-case allocation pattern. At least I suspect that's what happens. With lists, this case is handled much better as lists are not re-allocated when adding elements. This is somewhat confirmed by benchmarking and observing GC stats. Reviewed By: skcho Differential Revision: D22140908 fbshipit-source-id: 29815112f	5 years ago
Daiva Naudziuniene	0ab3689f1f	[infer] NULLPTR_DEREFERENCE false positive caused by thread_local variable Summary: Keyword `thread_local` in cpp allows us to create a variable with thread storage duration, meaning that the object's lifetime begins when the thread begins and ends when the thread ends. We get `NULLPTR_DEREFERENCE` false positive for `thread_local` variable since we reallocate it in the `VariableLifetimeBegins` metadata instruction and we do not see further updates to the variable. To solve the issue we special case `VariableLifetimeBegins` instruction for global variables. Reviewed By: jvillard Differential Revision: D22284135 fbshipit-source-id: 13c14ef90	5 years ago
Dulma Churchill	85ee958bf9	[pulse] Add model for NSObject.init Summary: This model is very important in the analysis of ObjC classes because the pattern ``` - (instancetype)init { if (self = [super init]) { ... } return self; } ``` is very common, so we need to know that if the super class is `NSObject`, the implementation of `init` is returning `self`, otherwise it's a skip function and we don't get the correct spec for the function. We fix some memory leak FP with this model, see test. Reviewed By: ezgicicek Differential Revision: D22259281 fbshipit-source-id: 3ee48c827	5 years ago
Daiva Naudziuniene	2c48e61031	[pulse] A new issue type OPTIONAL_EMPTY_ACCESS for trying to access folly::Optional when it is folly::none Summary: We need to check if `folly::Optional` is not `folly::none` if we want to retrieve the value, otherwise a runtime exception is thrown: ``` folly::Optional<int> foo{folly::none}; return foo.value(); // bad ``` ``` folly::Optional<int> foo{folly::none}; if (foo) { return foo.value(); // ok } ``` This diff adds a new issue type that reports if we try to access `folly::Optional` value when it is known to be `folly::none`. Reviewed By: ezgicicek Differential Revision: D22053352 fbshipit-source-id: 32cb00a99	5 years ago
Dulma Churchill	2d4b3c9acd	[builtins] Change the name of __free_cf to the more appropriate _objc_bridge_transfer and delete the biabduction implementation Summary: This continues on the previous diff by removing the model for `__bridge_transfer` in biabduction. This also had the name __free_cf which we kept for compatibility with biabduction until now but that we can now change. Reviewed By: ezgicicek Differential Revision: D22207396 fbshipit-source-id: 7a175eca6	5 years ago
Daiva Naudziuniene	412d2777eb	[pulse] Add a flag to pass functions that we want to model as abort Summary: To avoid NULLPTR_DEREFERENCE false positives we want to treat some functions as `abort`. A new flag `--pulse-model-abort` allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D21962555 fbshipit-source-id: d46b93c99	5 years ago
Ezgi Çiçek	c23e0044fc	[infer] Remove ppx_compare workaround for nonrec types (2) Summary: The past issue with ppx_compare on nonrec types has (at some point) been fixed. Greped for `let compare = compare` and removed the workaround for `nonrec`. Reviewed By: jberdine Differential Revision: D21973087 fbshipit-source-id: 5e2043e20	5 years ago
Josh Berdine	9c8f2e4a5c	[sledge] Build: Move Timer to Nonstdlib Summary: It has no dependencies on the rest of the sledge codebase and might be more generally useful. Reviewed By: jvillard Differential Revision: D21720980 fbshipit-source-id: b4f061e73	5 years ago
Jules Villard	8a1c10f8a1	remove dynamic severity: Reporting.log_{error,warning} -> log_issue Summary: See previous diff: issues are always reported with the same severity so recognise that and just use their default severity in "modern" checkers. Reviewed By: ngorogiannis Differential Revision: D21904591 fbshipit-source-id: fb5387e35	5 years ago
Dulma Churchill	aa6fe7963c	[pulse] Add dealloc calls for ObjC objects that are about to become unreachable Summary: This diff implements part of the memory management for Objective-C classes in ARC, namely that `dealloc` is called when the objects become unreachable. In reality the semantics of ARC says that this happens when their reference count becomes 0, but we are not modelling this yet in Pulse. However, we could in the future. This fixes false positives memory leaks when the memory is freed in dealloc. `dealloc` is often implicit in Objective-C, it also calls the dealloc of instance variables and superclass. None of this is implemented yet, and will be done in a future diff. This will be added in the frontend probably, similarly to how it's done for C++ destructors. This is an important part of modelling Objective-C semantics in Infer, I looked at whether this should be a preanalysis to be used by all analyses but this needs Pulse. So the idea is that any analysis that needs to understand Objective-C memory model well, should have Pulse as a preanalysis. Reviewed By: jvillard Differential Revision: D21762292 fbshipit-source-id: ced014324	5 years ago
Dulma Churchill	f638e741ae	[pulse] Add DynamicType attribute and use it in the model of ObjC alloc Summary: Adding a new attribute for dynamic type. It is set in the models of constructors, currently only in `alloc` in Objective-C. We use it in the following diff to figure out which `dealloc` method to call. However it could be useful for other things, such as dynamic dispatch. #skipdeadcode Reviewed By: jvillard Differential Revision: D21739928 fbshipit-source-id: 9276c0a4d	5 years ago
Ezgi Çiçek	964388f56c	[pulse] Brush up Collection/List add and remove models Summary: The models were too naive before since they invalidated the underlying array completely (copying C++'s push_back model), causing spurious vector invalidation issues in Java. This diff adds more reasonable models. Reviewed By: skcho Differential Revision: D21787543 fbshipit-source-id: a5a59ff69	5 years ago
Daiva Naudziuniene	98092481d4	[pulse] Special case for std::function:operator=( nullptr ) Summary: Assigning `nullptr` to `std::function` was causing `NULLPTR_DEREFERENCE` as our model was expecting to get an object in the right hand side of the assignment (`std::function::operator=`) and was dereferencing that object. Assigning `nullptr` to `std::function` removes callable object from it. We model this special case by creating a fresh value. Reviewed By: skcho Differential Revision: D21685318 fbshipit-source-id: 2d4af1933	5 years ago
Jules Villard	eab7e9aeb7	minor readability improvement in IssueType.ml Summary: - avoid creating issues just to look up their `unique_id` in the set - avoid `let _ =` since it can hide partial applications - delete outdated comment Reviewed By: skcho Differential Revision: D21663959 fbshipit-source-id: e50d02447	5 years ago
Sungkeun Cho	719b72cb4f	[pulse] Avoid partitioning abstract values Summary: `partition` always constructs two new maps, which is expensive when there are a lot of entries. Let's avoid it if possible. Reviewed By: jvillard Differential Revision: D21684298 fbshipit-source-id: a8674d358	5 years ago
Jules Villard	4e28980c8e	[errlog] reporting asserts checker matches issue-type Summary: Add an extra argument everywhere we report about the identity of the checker doing the reporting. This isn't type safe in any way, i.e. a checker can masquerade as another. But, hopefully it's enough to ensure checker writers (and diff reviewers) have a chance to reflect on what issue type they are reporting. Reviewed By: ngorogiannis Differential Revision: D21638823 fbshipit-source-id: b4a4b0c0a	5 years ago
Josh Berdine	61566caddf	[ocamlformat] Set break-sequences = true Summary: Add `break-sequences = true` to .ocamlformat and reformat. Reviewed By: jvillard Differential Revision: D21583901 fbshipit-source-id: eb4ec836c	5 years ago
Josh Berdine	65f369cf35	[ocamlformat] Reformat repo with new version Reviewed By: jvillard Differential Revision: D21583046 fbshipit-source-id: ee4793880	5 years ago
Dulma Churchill	ef7bc324e3	[pulse] Add a flag to model methods for memory ownership transfer Summary: Just like `CFBridgingRelease` we want to be able to model functions that are specific to a given codebase that make a transfer of memory ownership so that developers don't need to worry about releasing that memory anymore, and hence, we don't want to report leaks on that memory. Things get a little more complicated, because some of the functions we want to model are in a specific namespace, so with this flag we take both cases into account, when we are dealing with namespaces or not. Reviewed By: jvillard Differential Revision: D21404409 fbshipit-source-id: c36bd7afc	5 years ago
Daiva Naudziuniene	ca2ec281c7	[pulse] Model for iterator operator-- Summary: Currently we get false positive if we apply `operator--` to the `end()` iterator. To solve this, we model iterator `operator--` not to raise an error for the `EndIterator` invalidation, but to create a fresh element in the underlying array. Reviewed By: ezgicicek Differential Revision: D21476353 fbshipit-source-id: 5c722372e	5 years ago
Daiva Naudziuniene	eaf95951f5	[pulse] Modeling std::vector::end() Summary: It is undefined behavior to dereference end iterator. To catch end iterator dereferencing issues we change iterator model: instead of having `internal pointer` storing the current index, we model it as a pointer to a current index. This allows us to model `end()` iterator as having an invalid pointer and there is no need to create an invalidated element in the vector itself. Reviewed By: ezgicicek Differential Revision: D21178441 fbshipit-source-id: fd6a94b0b	5 years ago
Ezgi Çiçek	faceece120	[pulse] Brush up List.set() model Summary: We mistakenly invalidated the set element which causes spurious vector invalidation errors. Instead, we should modify it without any invalidation. Reviewed By: jvillard Differential Revision: D21521943 fbshipit-source-id: 67963967e	5 years ago
Ezgi Çiçek	5ff6fc93a0	[pulse] Brush up Java iterator models Summary: Java's iterator models were wrong. This causes `VECTOR_INVALIDATION` errors in fbandroid projects. This diff aims to fix it by modeling Java iterators with a current pointer and an underlying collection array. Reviewed By: skcho Differential Revision: D21448322 fbshipit-source-id: 7d44354b5	5 years ago

1 2 3 4 5 ...

262 Commits (3e7bf4343bbd66c1fab4f2fb069211a45d806b7c)