infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	55871dd285	[pulse][2/2] generate latent issues when null is allocated Summary: See updated tests and code comments: this changes many arithmetic operations to detect when a contradiction "p\|->- * p=0" is about to be detected, and generate a latent issue instead. It's hacky but it does what we want. Many APIs change because of this so there's some code churn but the overall end result is not much worse thanks to monadic operators. Reviewed By: skcho Differential Revision: D26918553 fbshipit-source-id: da2abc652	4 years ago
Martin Trojer	18f28395e8	[clang] migrate to llvm/clang11 Summary: Update Infer to LLVM (clang) 11.1.0. Infer/clang now uses the LLVM 'monorepo' release, simplifying the download script. Some changes done to how/when ASTExporter mangles names, this to avoid the plugin hitting asserts in the clang code when mangling names. Reviewed By: jvillard Differential Revision: D27006986 fbshipit-source-id: 4d4b6ba05	4 years ago
Gabriela Cunha Sampaio	cba144b779	[pulse] Adapting error messages Summary: Adapting error messages in Pulse so that they become more intuitive for developers. Reviewed By: jvillard Differential Revision: D26887140 fbshipit-source-id: 896970ba2	4 years ago
Jules Villard	4c357e434b	[pulse] apply discovered variable equalities eagerly Summary: This resolves a few instances of false negatives; typically: ``` if (x == y) { // HERE x = 10; y = 44; // THERE } ``` We used to get ``` HERE: &x->v * &y ->v' * v == v' THERE: &x->v * &y ->v' * v == v' * v \|-> 10 * v' \|-> 44 ``` The state at THERE was thus inconsistent and detected as such (v` and `v'` are allocated separately in the heap hence cannot be equal). Now we normalize the state more eagerly and so we get: ``` HERE: &x->v * &y->v THERE: &x->v * &y->v * v \|-> 44 ``` Reviewed By: skcho Differential Revision: D26488377 fbshipit-source-id: 568e685f0	4 years ago
Jules Villard	84d1fd3b52	[pulse] add tests Summary: These tests showcase weaknesses of Pulse w.r.t. detecting issues in situations with 1) pointer aliasing, and 2) pointers null-tests Reviewed By: ezgicicek Differential Revision: D26488145 fbshipit-source-id: 3de230bd2	4 years ago
Jules Villard	a1db290c2e	[pulse] models for folly::Optional::operator{*,->}() Summary: These were present for `std::optional` but not `folly::Optional` for some reason. Reviewed By: da319 Differential Revision: D26450400 fbshipit-source-id: 45051e828	4 years ago
Gabriela Cunha Sampaio	bc49f1deb1	[pulse] Adapting --pulse-model-return-nonnull for Java Summary: The `--pulse-model-return-nonnull` config option currently works for C++. Now we will be using it also for Java. Changing type from string list to regexp to make it more general. Reviewed By: ezgicicek Differential Revision: D26367888 fbshipit-source-id: 9a06b9b32	4 years ago
Sungkeun Cho	27ab8bd253	[pulse] Uninitialized check for struct fields Reviewed By: jvillard Differential Revision: D25371929 fbshipit-source-id: 966f333e3	4 years ago
Jules Villard	f5936689a4	[pulse] case split in model of free(3) Summary: Having different behaviours inter-procedurally and intra-procedurally sounds like a bad design in retrospect. The model of free() should not depend on whether we currently know the value is not null as that means some specs are missing from the summary. Reviewed By: skcho Differential Revision: D26019712 fbshipit-source-id: 1ac4316a5	4 years ago
Sungkeun Cho	89c8e25deb	[frontend] Add tests of using single field struct Summary: When a single field struct is initialized with "type x{v}" form, the translated result is not straightforward. For example, ``` struct t { int val_; }; void foo(t x) { t y{x}; } ``` calls the copy constructor with `x`. This is good. ``` void foo(int n) { t y{n}; } ``` assigns the integer `n` to `y.val_`. This is good. ``` t get_v(); void foo() { t y{get_v()}; } ``` assigns return value of `get_v` to `y.val_`, rather than calling the copy constructor. This is not good, but doesn't matter for actual running; `&y.val_` is the same to `&y` and `t` value is the same to `int` value. Reviewed By: jvillard Differential Revision: D26146578 fbshipit-source-id: 8a81bb1db	4 years ago
Sungkeun Cho	8ed44df7f6	[frontend] Fix incorrect order of statements (negation) Summary: This diff fixes incorrect order of statements on `*p = !b;`. Reviewed By: jvillard Differential Revision: D26125069 fbshipit-source-id: 9dcefbd34	4 years ago
Sungkeun Cho	051473394b	[frontend] Fix incorrect order of statements Summary: This diff fixes incorrect order of statements on assignments. In the translation of `LHS=RHS;`, if `RHS` is a complicated expression that introduced new nodes, eg a conditional expression, some load statements for `LHS` came after its usage. To avoid the issue, this diff forces it to introduce new nodes for `LHS`. Reviewed By: jvillard Differential Revision: D26099782 fbshipit-source-id: 27417cd99	4 years ago
Daiva Naudziuniene	16718384b3	[pulse] Optional Empty Access false positives we want to address Reviewed By: skcho Differential Revision: D25846520 fbshipit-source-id: ae60a8c51	4 years ago
Daiva Naudziuniene	0c6eedc835	[pulse] Model std::__optional_storage_base::has_value Summary: Model ` std::__optional_storage_base::has_value` as this is what we see in clang AST when translating `std::optional::has_value` for libc++. For libstdc++, we get `std::optional::has_value` as expected. Reviewed By: skcho, jvillard Differential Revision: D25585543 fbshipit-source-id: b8d9d2902	4 years ago
Daiva Naudziuniene	b5df1be318	[pulse] Model std::vector:empty() Summary: Skipping the analysis of `std::vector::empty()` caused false positives: in the case where `std::vector::empty()` was called several times ("returning" different values each time), we were not able to prune infeasible paths. Model `std::vector::empty()` as returning the same value every time it is called. Reviewed By: ezgicicek Differential Revision: D23904704 fbshipit-source-id: 52e8a2451	4 years ago
Sungkeun Cho	0cbe2f9b08	[pulse] Uninitialized value check in pulse Summary: This diff adds uninitialized value check in pulse. For now, it supports only simple cases, - declared variables with a type of integer, float, void, and pointer - malloced pointer variables that points to integer, float, void, and pointer TODOs: I will add more cases in the following diffs. - declared/malloced array - declared/malloced struct - inter-procedural checking Reviewed By: jvillard Differential Revision: D25269073 fbshipit-source-id: 317df9a85	4 years ago
Daiva Naudziuniene	0343f5c7d9	[pulse] Remove duplicate `by` from a trace Reviewed By: ezgicicek Differential Revision: D25196418 fbshipit-source-id: c1e504099	4 years ago
Daiva Naudziuniene	4e658903ae	[pulse] Check the validity of the addresses captured by lambda only for captures by reference Summary: To look for captured variable address escape we should only check the validity of the addresses captured by reference. Checking the validity of the address captured by value can cause nullptr dereference false positives. Reviewed By: jvillard Differential Revision: D25219347 fbshipit-source-id: faf6f2b00	4 years ago
Jules Villard	29f3941600	[clang] deal with conditionally-destroyed temporaries Summary: This was left as a TODO before: where to place calls to destructors for C++ temporaries that are only conditionally creating when evaluating an expression. This can happen inside the branches of a conditional operation `b?e:f` or in potentially-short-circuited conditions on the righ-hand side of `&&` and `\|\|` operators. Following the compilation scheme of clang (observed by looking at the generated LLVM bitcode), we instrument the program with "marker" variables, so that for instance `X x = true?X():y;` becomes (following the execution on the true branch): ``` marker1 = 0; // initialize all markers to 0 PRUNE(true) // entering true branch X::X(&temporary); // create temporary... marker1 = 1; // ...triggers setting its marker to 1 X::X(&x, &temporary); // finish expression if (marker1) { X::~X(&temporary); // conditionally destroy the temporary } ``` In this diff, you'll find code for: - associating markers to temporaries that need them - code to initialize markers to 0 before full-expressions - code to conditionally destroy temporaries based on the values of the markers once the full-expression has finished evaluating Reviewed By: da319 Differential Revision: D24954070 fbshipit-source-id: cf15df7f7	4 years ago
Daiva Naudziuniene	019adf7e78	[pulse] Model for folly::Optional::get_pointer Summary: Model `folly::Optional::get_pointer` which returns an address to a value if exists or `nullptr` if empty. Reviewed By: jvillard Differential Revision: D24935677 fbshipit-source-id: 9d990fe07	4 years ago
Jules Villard	f411c7d131	[pulse] do not stop at the first error in function calls Summary: We deliberately stopped as soon as an error was detected when applying a function call. This is not good as other pre/posts of the function may apply cleanly, which would allow us to cover more behaviours of the code. Went on a bit of a refactoring tangeant while fixing this, to clarify the `Ok None`/`Ok Some _`/`Error _` datatype returned by PulseInterproc. Now we report errors as soon as we find them during function calls but continue accumulating specs afterwards. Reviewed By: da319 Differential Revision: D24888768 fbshipit-source-id: d5f2c29d7	4 years ago
Jules Villard	578583f2ab	[pulse] check that new arithmetic facts are consistent with the heap Summary: Communicate new facts from the arithmetic domain to the memory domain to detect contradictions between the two. Reviewed By: jberdine Differential Revision: D24832079 fbshipit-source-id: 2caf8e9af	4 years ago
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	4 years ago
Daiva Naudziuniene	58f1fd8b32	[pulse] Optional Empty Access for std::optional Reviewed By: jvillard Differential Revision: D24760820 fbshipit-source-id: bedf6aee3	4 years ago
Daiva Naudziuniene	eb4684f6d5	[pulse] Less precise model for constructing optional from value Summary: We recently introduced a more precise model for constructing an optional from a value by making a shallow copy. However, this introduced Use After Delete false positives. For now, we go back to a less precise model by creating a fresh value. A proper model would be to either make a deep copy or call the copy constructor for a value. We will address this in the following diff. Reviewed By: jvillard Differential Revision: D24826749 fbshipit-source-id: 3e5e4edeb	4 years ago
Daiva Naudziuniene	a4241eeb43	[pulse] Refactor Optional models Summary: Refactor `folly::Optional` models to make them easier to reuse for `std::optional` Reviewed By: jvillard Differential Revision: D24760053 fbshipit-source-id: f665e84c8	4 years ago
Daiva Naudziuniene	3d74f39102	[pulse] Improve trace for Optional Empty Access Summary: `folly::Optional::value()` returns a reference, hence an error was shown when the actual value was being accessed. Since `value()` throws an exception in case of `folly::none`, we want to show the error message at the call site of `value()`. We do this by dereferencing the result of `value()` in the model. Reviewed By: jvillard Differential Revision: D24702875 fbshipit-source-id: ca9f30349	4 years ago
Daiva Naudziuniene	b17861b1c8	[pulse] More precise model for constructing folly::Optional<Value> from Value Summary: Before we were creating a fresh internal value when we were constructing `folly::Optional`. This diff models `folly::Optional` constructor more precisely by copying the given value. There was also a missing dereference in the model of `value_or` Reviewed By: jvillard Differential Revision: D24621016 fbshipit-source-id: c86d3c157	4 years ago
Daiva Naudziuniene	059c0f24a2	[pulse] Model Optional value_or Summary: Model `folly::Optional::value_or(default)` to return value if not-empty and `default` if empty. Reviewed By: jvillard Differential Revision: D24539456 fbshipit-source-id: cc9e176cc	4 years ago
Jules Villard	7fdb33b710	[pulse] report errors only when the PRUNE nodes along the path are true Summary: Take another page from the Incorrectness Logic book and refrain from reporting issues on paths unless we know for sure that this path will be taken. Previously, we would report on paths that are merely not impossible. This goes very far in the other direction, so it's possible we'll want to go back to some sort of middle ground. Or maybe not. See the changes in the tests to get a sense of what we're missing. Reviewed By: ezgicicek Differential Revision: D24014719 fbshipit-source-id: d451faf02	4 years ago
Daiva Naudziuniene	22d317c940	[pulse] Move pulse model flags to .inferconfig for pulse tests Summary: The title Reviewed By: skcho Differential Revision: D23960402 fbshipit-source-id: edc3bc2d0	4 years ago
Daiva Naudziuniene	91a33f6edc	[frontend] Captured struct variables in cpp lambdas Summary: Structs captured both by reference or by value should have reference in their type. Struct captured by value should first call copy constructor. In this diff we fix the type of the captured variable to include reference. Copy constructor injection is left for the future. Reviewed By: jvillard Differential Revision: D23688713 fbshipit-source-id: d13748b5d	4 years ago
Daiva Naudziuniene	857daf63c9	[frontend] Capture reference variables Summary: Variables captured without initialization do not have correct type inside lambda's body. This diff sets the correct type of captured reference variables inside procdesc and makes sure the translation of captured variables is correct. The translation of lambda's body will then take into account the type of captured var from procdesc. Reviewed By: jvillard Differential Revision: D23678371 fbshipit-source-id: ed16dc978	4 years ago
Daiva Naudziuniene	42abe5b277	[frontend] Fix type of captured vars in lambda's body Summary: Add missing reference to the type of variable captured by reference without initialization. Reviewed By: jvillard Differential Revision: D23567685 fbshipit-source-id: b4e2ac0b6	4 years ago
Daiva Naudziuniene	d0cb245303	[frontend] Fix capture init for cpp lambdas Summary: We were missing assignment to captured variables with initializers. Consider the following example: ``` S* update_inside_lambda_capture_and_init(S* s) { S* object = nullptr; auto f = [& o = object](S* s) { o = s; }; f(s); return object; } ``` which was translated to ``` VARIABLE_DECLARED(o:S&); &o:S&=&object &f =(_fun...lambda..._operator(),([by ref]&o &o:S&)) ``` However, we want to capture `o` (which is an address of `object`), rather `&o` in closure. After the diff ``` VARIABLE_DECLARED(o:S&); &o:S&=&object n$7=&o:S& &f =(_fun...lambda..._operator(),([by ref]n$7 &o:S&)) ``` Reviewed By: jvillard Differential Revision: D23567346 fbshipit-source-id: 20f77acc2	4 years ago
Jules Villard	03bc3f31c8	[pulse] add option to skip functions/classes Summary: This can be useful to make pulse forget about tricky parts of the code. Treat "skipped" procedures as unknown so heuristics for mutating the return value and parameters passed by reference are applied. Reviewed By: ezgicicek Differential Revision: D23729410 fbshipit-source-id: d7a4924a8	4 years ago
Daiva Naudziuniene	4401701578	[pulse] Model for std::function copy constructor Summary: Added a model for copy constructor for `std::function`. In most cases, the SIL instruction `std::function::function(&dest, &src)` gives us pointers to `dest` and `src`, hence, we model the copy constructor as a shallow copy. However, in some cases, e.g. `std::function f = lambda_literal`, SIL instruction contains the closure itself `std::function::function(&dest, (operator(), captured_vars)`, hence, we need to make sure we copy the right value. Reviewed By: ezgicicek Differential Revision: D23396568 fbshipit-source-id: 0acb8f6bc	4 years ago
Daiva Naudziuniene	0a4af7754d	[pulse] Fix std::function::operator() Summary: There was a mismatch between formals and actuals in `std::function::operator()` because we were not passing the first argument corresponding to the closure. Reviewed By: ezgicicek Differential Revision: D23372104 fbshipit-source-id: d0f9b27d6	4 years ago
Daiva Naudziuniene	29fd9e13d1	[pulse] Understand captured variables in cpp lambdas Summary: When we evaluate lambdas in pulse, we create a closure object with `fake` fields to store captured variables. However, during the function call we were not linking the captured values from the closure object. We address this missing part here. Reviewed By: jvillard Differential Revision: D23316750 fbshipit-source-id: 14751aa58	5 years ago
Jules Villard	5278cb7374	[pulse] `delete nullptr` is a no-op Summary: `delete` works exactly like `free` so merge both models together. Also move the `free(0)` test to nullptr.cpp as it seems more appropriate. Reviewed By: da319 Differential Revision: D23241297 fbshipit-source-id: 20a32ac54	5 years ago
Daiva Naudziuniene	69e0dce0ed	[pulse] fix end() iterator false positive Summary: Before we were modelling `vector.end()` as returning a fresh pointer every time is was called. It is common to check if an iterator is not the `end()` iterator and proceed to dereference the iterator in that case. In such code pattern `vector.end()` is called twice and returns different fresh values which causes false positives. To fix this, we add a special internal field `__infer_model_backing_array_pointer_to_last_element` to a vector to denote its end. Now, every time we call `vector.end()` we return the value of this field. We introduce a new attribute `EndOfCollection` to mark `end` iterator as the existing `EndIterator` invalidation is not suitable when we need to read the same value multiple times. Reviewed By: jvillard Differential Revision: D23101185 fbshipit-source-id: fa8a33b58	5 years ago
Jules Villard	97fcc3b0ad	[pulse] apply equality relation to terms to be added to the equality relation Summary: Extra normalization gives extra precision. This doesn't seem to negatively impact perf. Reviewed By: skcho Differential Revision: D22867109 fbshipit-source-id: 5b82ec377	5 years ago
Jules Villard	5a39c158c5	[pulse] arithmetic domain: take 4! Summary: This time it's personal. Roll out pulse's own arithmetic domain to be fast and be able to add precision as needed. Formulas are precise representations of the path condition to allow for good inter-procedural precision. Reasoning on these is somewhat ad-hoc (except for equalities, but even these aren't quite properly saturated in general), so expect lots of holes. Skipping dead code in the interest of readability as this (at least temporarily) doesn't use pudge anymore. This may make a come-back as pudge has/will have better precision: the proposed implementation of `PulseFormula` is very cheap so can be used any time we could want to prune paths (see following commits), but this comes at the price of some precision. Calling into pudge at reporting time still sounds like a good idea to reduce false positives due to infeasible paths. #skipdeadcode Reviewed By: skcho Differential Revision: D22576004 fbshipit-source-id: c91793256	5 years ago
Daiva Naudziuniene	221d0b62ab	[pulse] Model builtin __new as returning non-null Summary: We model internal builtin `__new` function to return a non-null value. This fixes nullptr_dereference false positives where we explicitly check the result of a function call for nullptr when the function returns a newly created object. Reviewed By: jvillard Differential Revision: D22772217 fbshipit-source-id: 37d209697	5 years ago
Jules Villard	9690dba871	[pulse] a slow example for pudge Summary: Add a test to the repo to try and detect perf regressions in pulse. Currently analyzed in ~0.1s. With `--pudge`, takes ~10s. Sledge does eager normalization and canonicalization when incorporating new facts into formula contexts and the algorithm is polynomial in the number of equalities. This example generates one equality per location in the array => boom. This bypasses the recency model of arrays because the formula needs to be constructed before it can be simplified to get rid of dead variables. The new arithmetic is not as complete as sledge's algorithm but linear in time. We could use it to simplify the formula before passing it to sledge. In fact, that was the original motivation. Reviewed By: skcho Differential Revision: D22574366 fbshipit-source-id: e9044ae09	5 years ago
Jules Villard	ae57f217d2	[pulse] don't always mistake equality for aliasing Summary: When applying function summaries, we are careful not to violate the summary's assumptions about non-aliasing. For example, the summary we generate for `foo(x,y) { x = y; }` will have `x` and `y` be allocated to two different `AbstractValue.t` in the heap, representing disjointness. However, the current logic is too coarse and also rejects passing the same pure value to functions that made no assumption about them being equal or different, eg `goo(int x,int y) { int z = x + y; }`. This is because the corresponding `AbstractValue.t` are different in the callee's summary, but are represented by only one same value in callers such as `goo(i,i)`. This diff restricts the "don't violate aliasing" condition to only consider heap-allocated values. This is consistent with separation logic by the way: we use the implication `x\|->- * y\|->- \|- x≠y`, which is valid only when both `x` and `y` are both allocated in the heap as in the left-hand-side of `\|-`. Reviewed By: skcho Differential Revision: D22574297 fbshipit-source-id: 206a18499	5 years ago
Daiva Naudziuniene	35011757dc	[pulse] Add a flag to pass functions that we want to model as returning non-null Summary: To avoid NULLPTR_DEREFERENCE false positives we want to model some functions as returning non-null. A new flag --pulse-model-return-nonnull allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D22431564 fbshipit-source-id: 9944c7382	5 years ago
Daiva Naudziuniene	0ab3689f1f	[infer] NULLPTR_DEREFERENCE false positive caused by thread_local variable Summary: Keyword `thread_local` in cpp allows us to create a variable with thread storage duration, meaning that the object's lifetime begins when the thread begins and ends when the thread ends. We get `NULLPTR_DEREFERENCE` false positive for `thread_local` variable since we reallocate it in the `VariableLifetimeBegins` metadata instruction and we do not see further updates to the variable. To solve the issue we special case `VariableLifetimeBegins` instruction for global variables. Reviewed By: jvillard Differential Revision: D22284135 fbshipit-source-id: 13c14ef90	5 years ago
Daiva Naudziuniene	2c48e61031	[pulse] A new issue type OPTIONAL_EMPTY_ACCESS for trying to access folly::Optional when it is folly::none Summary: We need to check if `folly::Optional` is not `folly::none` if we want to retrieve the value, otherwise a runtime exception is thrown: ``` folly::Optional<int> foo{folly::none}; return foo.value(); // bad ``` ``` folly::Optional<int> foo{folly::none}; if (foo) { return foo.value(); // ok } ``` This diff adds a new issue type that reports if we try to access `folly::Optional` value when it is known to be `folly::none`. Reviewed By: ezgicicek Differential Revision: D22053352 fbshipit-source-id: 32cb00a99	5 years ago
Daiva Naudziuniene	412d2777eb	[pulse] Add a flag to pass functions that we want to model as abort Summary: To avoid NULLPTR_DEREFERENCE false positives we want to treat some functions as `abort`. A new flag `--pulse-model-abort` allows us to provide a list of such functions. Reviewed By: ezgicicek Differential Revision: D21962555 fbshipit-source-id: d46b93c99	5 years ago

1 2 3 4

190 Commits (fd1731c34bbbc4d1888d314516ae74051f0ff675)