infer_clone

Commit Graph

Author	SHA1	Message	Date
Sungkeun Cho	051473394b	[frontend] Fix incorrect order of statements Summary: This diff fixes incorrect order of statements on assignments. In the translation of `LHS=RHS;`, if `RHS` is a complicated expression that introduced new nodes, eg a conditional expression, some load statements for `LHS` came after its usage. To avoid the issue, this diff forces it to introduce new nodes for `LHS`. Reviewed By: jvillard Differential Revision: D26099782 fbshipit-source-id: 27417cd99	4 years ago
Ezgi Çiçek	840a10afaa	[pre-analysis] Remove throw/catch handling from preanalysis Summary: D20769039 (`cec8cbeff2`) added a preanalysis step that creates edges from throw nodes to all reachable catch nodes. It intended to fix some deadstore FPs however it caused more damage than the fix itself. In particular, throws were connected irrespective of - the type of the exception - whether the try was surrounded by a catch This in turn caused weird CFGs with dangling and impossible to understand nodes:( This diff reverts this change for now. Instead, the fix should probably be done in the frontend where we have more information about try/catch blocks. Reviewed By: da319 Differential Revision: D25997475 fbshipit-source-id: bbeabfbef	4 years ago
Nikos Gorogiannis	7b8145b8bc	[racerd][C++] do not report on lambdas at top level Summary: Lambdas are essentially private (but are not marked as such in Infer), so we should only report on their non-private callers. Meanwhile, add a test to document that access propagation to those callers is currently broken. Reviewed By: da319 Differential Revision: D25944811 fbshipit-source-id: ef8ca6d9c	4 years ago
Daiva Naudziuniene	16718384b3	[pulse] Optional Empty Access false positives we want to address Reviewed By: skcho Differential Revision: D25846520 fbshipit-source-id: ae60a8c51	4 years ago
Jules Villard	ad45bbe28d	[clang] fix order of translation for [this] Summary: We need to make sure a node is created to avoid instructions appearing in the wrong order in the final CFG. Reviewed By: da319 Differential Revision: D25784405 fbshipit-source-id: 3ef27d712	4 years ago
Daiva Naudziuniene	0c6eedc835	[pulse] Model std::__optional_storage_base::has_value Summary: Model ` std::__optional_storage_base::has_value` as this is what we see in clang AST when translating `std::optional::has_value` for libc++. For libstdc++, we get `std::optional::has_value` as expected. Reviewed By: skcho, jvillard Differential Revision: D25585543 fbshipit-source-id: b8d9d2902	4 years ago
Ezgi Çiçek	731b632632	[cost] Procedures with empty body gets 0 cost Reviewed By: jvillard Differential Revision: D25494222 fbshipit-source-id: 947993978	4 years ago
Daiva Naudziuniene	b5df1be318	[pulse] Model std::vector:empty() Summary: Skipping the analysis of `std::vector::empty()` caused false positives: in the case where `std::vector::empty()` was called several times ("returning" different values each time), we were not able to prune infeasible paths. Model `std::vector::empty()` as returning the same value every time it is called. Reviewed By: ezgicicek Differential Revision: D23904704 fbshipit-source-id: 52e8a2451	4 years ago
Sungkeun Cho	0cbe2f9b08	[pulse] Uninitialized value check in pulse Summary: This diff adds uninitialized value check in pulse. For now, it supports only simple cases, - declared variables with a type of integer, float, void, and pointer - malloced pointer variables that points to integer, float, void, and pointer TODOs: I will add more cases in the following diffs. - declared/malloced array - declared/malloced struct - inter-procedural checking Reviewed By: jvillard Differential Revision: D25269073 fbshipit-source-id: 317df9a85	4 years ago
Daiva Naudziuniene	0343f5c7d9	[pulse] Remove duplicate `by` from a trace Reviewed By: ezgicicek Differential Revision: D25196418 fbshipit-source-id: c1e504099	4 years ago
Ezgi Çiçek	11141cb100	[impurity] Collect all accesses Summary: Previously, impurity analysis only collected one access for a single modification but not all other modifying accesses. This diff - changes the impurity domain to collect all modifying accesses - tracks and prints all the accesses seen to reach the modification, improving readability&debugging Recording all accesses are needed in the next diff to determine if a method modifies any immutable fields. To determine that, we need to know all modifications, not just a single one. Reviewed By: skcho Differential Revision: D25186516 fbshipit-source-id: 43ceb3cd8	4 years ago
Daiva Naudziuniene	4e658903ae	[pulse] Check the validity of the addresses captured by lambda only for captures by reference Summary: To look for captured variable address escape we should only check the validity of the addresses captured by reference. Checking the validity of the address captured by value can cause nullptr dereference false positives. Reviewed By: jvillard Differential Revision: D25219347 fbshipit-source-id: faf6f2b00	4 years ago
Joscha Benz	606a3c95d9	Add a config option to whitelist common initializer values for liveness analysis (#1340 ) Summary: Implements https://github.com/facebook/infer/issues/1083 Pull Request resolved: https://github.com/facebook/infer/pull/1340 Reviewed By: mityal Differential Revision: D24956747 Pulled By: jvillard fbshipit-source-id: 820eb538a	4 years ago
Martin Trojer	9fd1a87cd5	Upgrade Infer to clang10 Reviewed By: jvillard Differential Revision: D25048497 fbshipit-source-id: 7e839a076	4 years ago
Jules Villard	29f3941600	[clang] deal with conditionally-destroyed temporaries Summary: This was left as a TODO before: where to place calls to destructors for C++ temporaries that are only conditionally creating when evaluating an expression. This can happen inside the branches of a conditional operation `b?e:f` or in potentially-short-circuited conditions on the righ-hand side of `&&` and `\|\|` operators. Following the compilation scheme of clang (observed by looking at the generated LLVM bitcode), we instrument the program with "marker" variables, so that for instance `X x = true?X():y;` becomes (following the execution on the true branch): ``` marker1 = 0; // initialize all markers to 0 PRUNE(true) // entering true branch X::X(&temporary); // create temporary... marker1 = 1; // ...triggers setting its marker to 1 X::X(&x, &temporary); // finish expression if (marker1) { X::~X(&temporary); // conditionally destroy the temporary } ``` In this diff, you'll find code for: - associating markers to temporaries that need them - code to initialize markers to 0 before full-expressions - code to conditionally destroy temporaries based on the values of the markers once the full-expression has finished evaluating Reviewed By: da319 Differential Revision: D24954070 fbshipit-source-id: cf15df7f7	4 years ago
Jules Villard	2efa22073c	[clang] force node creation in switch statements Summary: The translation of `switch` cases needs to insert nodes around the translation of each `case` sub-statement, so we need to force node creation in these sub-statements so the nodes around it can be connected to the translation of the sub-statements. Also added more logging I found useful when debugging that. Reviewed By: da319 Differential Revision: D24991455 fbshipit-source-id: d3a622142	4 years ago
Jules Villard	f2e3f67f40	[clang] change the way we wire up return statements Summary: Split the translation of return more aggressively between: 1. the instruction that has to happen before the translation of the sub-expr 2. the sub-expr 3. the instruction that has to happen after the sub-expr This is needed for the next diff which creates potentially large CFGs in (2). Reviewed By: da319 Differential Revision: D24954071 fbshipit-source-id: a7e7e2527	4 years ago
Daiva Naudziuniene	019adf7e78	[pulse] Model for folly::Optional::get_pointer Summary: Model `folly::Optional::get_pointer` which returns an address to a value if exists or `nullptr` if empty. Reviewed By: jvillard Differential Revision: D24935677 fbshipit-source-id: 9d990fe07	4 years ago
Jules Villard	f411c7d131	[pulse] do not stop at the first error in function calls Summary: We deliberately stopped as soon as an error was detected when applying a function call. This is not good as other pre/posts of the function may apply cleanly, which would allow us to cover more behaviours of the code. Went on a bit of a refactoring tangeant while fixing this, to clarify the `Ok None`/`Ok Some _`/`Error _` datatype returned by PulseInterproc. Now we report errors as soon as we find them during function calls but continue accumulating specs afterwards. Reviewed By: da319 Differential Revision: D24888768 fbshipit-source-id: d5f2c29d7	4 years ago
Jules Villard	578583f2ab	[pulse] check that new arithmetic facts are consistent with the heap Summary: Communicate new facts from the arithmetic domain to the memory domain to detect contradictions between the two. Reviewed By: jberdine Differential Revision: D24832079 fbshipit-source-id: 2caf8e9af	4 years ago
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	4 years ago
Daiva Naudziuniene	58f1fd8b32	[pulse] Optional Empty Access for std::optional Reviewed By: jvillard Differential Revision: D24760820 fbshipit-source-id: bedf6aee3	4 years ago
Daiva Naudziuniene	eb4684f6d5	[pulse] Less precise model for constructing optional from value Summary: We recently introduced a more precise model for constructing an optional from a value by making a shallow copy. However, this introduced Use After Delete false positives. For now, we go back to a less precise model by creating a fresh value. A proper model would be to either make a deep copy or call the copy constructor for a value. We will address this in the following diff. Reviewed By: jvillard Differential Revision: D24826749 fbshipit-source-id: 3e5e4edeb	4 years ago
Daiva Naudziuniene	a4241eeb43	[pulse] Refactor Optional models Summary: Refactor `folly::Optional` models to make them easier to reuse for `std::optional` Reviewed By: jvillard Differential Revision: D24760053 fbshipit-source-id: f665e84c8	4 years ago
Jules Villard	b6460870dc	[biabd] rename a test to follow naming conventions Reviewed By: ngorogiannis Differential Revision: D24794526 fbshipit-source-id: 9b2392c35	4 years ago
Daiva Naudziuniene	3d74f39102	[pulse] Improve trace for Optional Empty Access Summary: `folly::Optional::value()` returns a reference, hence an error was shown when the actual value was being accessed. Since `value()` throws an exception in case of `folly::none`, we want to show the error message at the call site of `value()`. We do this by dereferencing the result of `value()` in the model. Reviewed By: jvillard Differential Revision: D24702875 fbshipit-source-id: ca9f30349	4 years ago
Daiva Naudziuniene	b17861b1c8	[pulse] More precise model for constructing folly::Optional<Value> from Value Summary: Before we were creating a fresh internal value when we were constructing `folly::Optional`. This diff models `folly::Optional` constructor more precisely by copying the given value. There was also a missing dereference in the model of `value_or` Reviewed By: jvillard Differential Revision: D24621016 fbshipit-source-id: c86d3c157	4 years ago
Daiva Naudziuniene	059c0f24a2	[pulse] Model Optional value_or Summary: Model `folly::Optional::value_or(default)` to return value if not-empty and `default` if empty. Reviewed By: jvillard Differential Revision: D24539456 fbshipit-source-id: cc9e176cc	4 years ago
Sungkeun Cho	198c700e87	[cost] Add an option printing/suppressing function pointers in cost Summary: This diff adds an option hiding function pointers in costs to users: `cost-suppress-func-ptr` is true by default. Reviewed By: ezgicicek Differential Revision: D24448212 fbshipit-source-id: 88f6b5ea1	4 years ago
Jules Villard	7fdb33b710	[pulse] report errors only when the PRUNE nodes along the path are true Summary: Take another page from the Incorrectness Logic book and refrain from reporting issues on paths unless we know for sure that this path will be taken. Previously, we would report on paths that are merely not impossible. This goes very far in the other direction, so it's possible we'll want to go back to some sort of middle ground. Or maybe not. See the changes in the tests to get a sense of what we're missing. Reviewed By: ezgicicek Differential Revision: D24014719 fbshipit-source-id: d451faf02	4 years ago
Daiva Naudziuniene	22d317c940	[pulse] Move pulse model flags to .inferconfig for pulse tests Summary: The title Reviewed By: skcho Differential Revision: D23960402 fbshipit-source-id: edc3bc2d0	4 years ago
Martin Trojer	26dcd63a25	Revert "upgrade to clang10" Reviewed By: jvillard Differential Revision: D23961256 fbshipit-source-id: d57760b27	4 years ago
Martin Trojer	15786e8f09	upgrade to clang10 Summary: Upgrade to latest clang release, needed for xcode12. clang-8/9 won't be able to read the Xcode 12 SDK since there's annotations that will fail compilation. Also removing unused (and hard to compile) binary `ast_exporter_bin` from facebook-clang-plugins/libtooling. Reviewed By: ngorogiannis Differential Revision: D23780089 fbshipit-source-id: 2314125a9	4 years ago
Jules Villard	6c8fc85e22	[preanalysis] do not nullify vars captured by ref Summary: Nullifying these leads to observable side-effects, like in the added test. Reviewed By: da319 Differential Revision: D23759756 fbshipit-source-id: 559a6486b	4 years ago
Daiva Naudziuniene	91a33f6edc	[frontend] Captured struct variables in cpp lambdas Summary: Structs captured both by reference or by value should have reference in their type. Struct captured by value should first call copy constructor. In this diff we fix the type of the captured variable to include reference. Copy constructor injection is left for the future. Reviewed By: jvillard Differential Revision: D23688713 fbshipit-source-id: d13748b5d	4 years ago
Daiva Naudziuniene	857daf63c9	[frontend] Capture reference variables Summary: Variables captured without initialization do not have correct type inside lambda's body. This diff sets the correct type of captured reference variables inside procdesc and makes sure the translation of captured variables is correct. The translation of lambda's body will then take into account the type of captured var from procdesc. Reviewed By: jvillard Differential Revision: D23678371 fbshipit-source-id: ed16dc978	4 years ago
Daiva Naudziuniene	42abe5b277	[frontend] Fix type of captured vars in lambda's body Summary: Add missing reference to the type of variable captured by reference without initialization. Reviewed By: jvillard Differential Revision: D23567685 fbshipit-source-id: b4e2ac0b6	4 years ago
Daiva Naudziuniene	d0cb245303	[frontend] Fix capture init for cpp lambdas Summary: We were missing assignment to captured variables with initializers. Consider the following example: ``` S* update_inside_lambda_capture_and_init(S* s) { S* object = nullptr; auto f = [& o = object](S* s) { o = s; }; f(s); return object; } ``` which was translated to ``` VARIABLE_DECLARED(o:S&); &o:S&=&object &f =(_fun...lambda..._operator(),([by ref]&o &o:S&)) ``` However, we want to capture `o` (which is an address of `object`), rather `&o` in closure. After the diff ``` VARIABLE_DECLARED(o:S&); &o:S&=&object n$7=&o:S& &f =(_fun...lambda..._operator(),([by ref]n$7 &o:S&)) ``` Reviewed By: jvillard Differential Revision: D23567346 fbshipit-source-id: 20f77acc2	4 years ago
Jules Villard	03bc3f31c8	[pulse] add option to skip functions/classes Summary: This can be useful to make pulse forget about tricky parts of the code. Treat "skipped" procedures as unknown so heuristics for mutating the return value and parameters passed by reference are applied. Reviewed By: ezgicicek Differential Revision: D23729410 fbshipit-source-id: d7a4924a8	4 years ago
Daiva Naudziuniene	4401701578	[pulse] Model for std::function copy constructor Summary: Added a model for copy constructor for `std::function`. In most cases, the SIL instruction `std::function::function(&dest, &src)` gives us pointers to `dest` and `src`, hence, we model the copy constructor as a shallow copy. However, in some cases, e.g. `std::function f = lambda_literal`, SIL instruction contains the closure itself `std::function::function(&dest, (operator(), captured_vars)`, hence, we need to make sure we copy the right value. Reviewed By: ezgicicek Differential Revision: D23396568 fbshipit-source-id: 0acb8f6bc	5 years ago
Daiva Naudziuniene	0a4af7754d	[pulse] Fix std::function::operator() Summary: There was a mismatch between formals and actuals in `std::function::operator()` because we were not passing the first argument corresponding to the closure. Reviewed By: ezgicicek Differential Revision: D23372104 fbshipit-source-id: d0f9b27d6	5 years ago
Daiva Naudziuniene	29fd9e13d1	[pulse] Understand captured variables in cpp lambdas Summary: When we evaluate lambdas in pulse, we create a closure object with `fake` fields to store captured variables. However, during the function call we were not linking the captured values from the closure object. We address this missing part here. Reviewed By: jvillard Differential Revision: D23316750 fbshipit-source-id: 14751aa58	5 years ago
Jules Villard	5278cb7374	[pulse] `delete nullptr` is a no-op Summary: `delete` works exactly like `free` so merge both models together. Also move the `free(0)` test to nullptr.cpp as it seems more appropriate. Reviewed By: da319 Differential Revision: D23241297 fbshipit-source-id: 20a32ac54	5 years ago
Daiva Naudziuniene	69e0dce0ed	[pulse] fix end() iterator false positive Summary: Before we were modelling `vector.end()` as returning a fresh pointer every time is was called. It is common to check if an iterator is not the `end()` iterator and proceed to dereference the iterator in that case. In such code pattern `vector.end()` is called twice and returns different fresh values which causes false positives. To fix this, we add a special internal field `__infer_model_backing_array_pointer_to_last_element` to a vector to denote its end. Now, every time we call `vector.end()` we return the value of this field. We introduce a new attribute `EndOfCollection` to mark `end` iterator as the existing `EndIterator` invalidation is not suitable when we need to read the same value multiple times. Reviewed By: jvillard Differential Revision: D23101185 fbshipit-source-id: fa8a33b58	5 years ago
Jules Villard	97fcc3b0ad	[pulse] apply equality relation to terms to be added to the equality relation Summary: Extra normalization gives extra precision. This doesn't seem to negatively impact perf. Reviewed By: skcho Differential Revision: D22867109 fbshipit-source-id: 5b82ec377	5 years ago
Jules Villard	5a39c158c5	[pulse] arithmetic domain: take 4! Summary: This time it's personal. Roll out pulse's own arithmetic domain to be fast and be able to add precision as needed. Formulas are precise representations of the path condition to allow for good inter-procedural precision. Reasoning on these is somewhat ad-hoc (except for equalities, but even these aren't quite properly saturated in general), so expect lots of holes. Skipping dead code in the interest of readability as this (at least temporarily) doesn't use pudge anymore. This may make a come-back as pudge has/will have better precision: the proposed implementation of `PulseFormula` is very cheap so can be used any time we could want to prune paths (see following commits), but this comes at the price of some precision. Calling into pudge at reporting time still sounds like a good idea to reduce false positives due to infeasible paths. #skipdeadcode Reviewed By: skcho Differential Revision: D22576004 fbshipit-source-id: c91793256	5 years ago
Daiva Naudziuniene	35de604422	[frontend] Record attribute unused in local variable data Summary: To avoid dead store false positives we skip initialization of a variable that has an `unused` attribute. However, this causes uninitialized value false positives when the variable is later used in macros. To fix this, instead of skipping initialization we record the information about `unused` attribute in local variable data that we can later use for filtering out dead store issues. Reviewed By: jvillard Differential Revision: D22868050 fbshipit-source-id: 4a2d0e680	5 years ago
Daiva Naudziuniene	c0152f2278	[frontend] Remove duplicate variable declaration Summary: We get duplicated variable declaration instruction for primitive type variable initialized using list initializer, e.g. ``` int* p{nullptr}; ``` This happens because we add variable declaration instruction when we translate both `DeclStmt` and `InitListExpr`. To fix this, we do not add the duplicated variable declaration when we translate `InitListExpr`. Reviewed By: jvillard Differential Revision: D22844726 fbshipit-source-id: 422806924	5 years ago
Daiva Naudziuniene	221d0b62ab	[pulse] Model builtin __new as returning non-null Summary: We model internal builtin `__new` function to return a non-null value. This fixes nullptr_dereference false positives where we explicitly check the result of a function call for nullptr when the function returns a newly created object. Reviewed By: jvillard Differential Revision: D22772217 fbshipit-source-id: 37d209697	5 years ago
Jules Villard	9690dba871	[pulse] a slow example for pudge Summary: Add a test to the repo to try and detect perf regressions in pulse. Currently analyzed in ~0.1s. With `--pudge`, takes ~10s. Sledge does eager normalization and canonicalization when incorporating new facts into formula contexts and the algorithm is polynomial in the number of equalities. This example generates one equality per location in the array => boom. This bypasses the recency model of arrays because the formula needs to be constructed before it can be simplified to get rid of dead variables. The new arithmetic is not as complete as sledge's algorithm but linear in time. We could use it to simplify the formula before passing it to sledge. In fact, that was the original motivation. Reviewed By: skcho Differential Revision: D22574366 fbshipit-source-id: e9044ae09	5 years ago

1 2 3 4 5 ...

975 Commits (faaf0ecf180c3e0357c593a300e5c44e69a67183)