infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	f2e3f67f40	[clang] change the way we wire up return statements Summary: Split the translation of return more aggressively between: 1. the instruction that has to happen before the translation of the sub-expr 2. the sub-expr 3. the instruction that has to happen after the sub-expr This is needed for the next diff which creates potentially large CFGs in (2). Reviewed By: da319 Differential Revision: D24954071 fbshipit-source-id: a7e7e2527	4 years ago
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	4 years ago
Sungkeun Cho	92e7aeeb3e	[infer] Fix clang frontend for switch statment Summary: This diff fixes the clang translation for switch statement. It assumed that `default:` comes always at last, which introduced some unreachable nodes inadvertently, e.g. when `default:` comes at first. Reviewed By: dulmarod Differential Revision: D19793138 fbshipit-source-id: 1e8b52c0d	5 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	5 years ago
Jules Villard	1395d5581d	[clang] upgrade to 8.0.0 Summary: - take advantage more structured attributes in the exported AST - circumvent new format of `if` and `switch` - a few new features/nodes but nothing major there update-submodule: facebook-clang-plugins Reviewed By: mbouaziz, martintrojer Differential Revision: D15453572 fbshipit-source-id: c0c24345f	6 years ago
Jules Villard	686231ec6e	[SIL] change `variable_initialization()` builtin to a new auxiliary instruction Summary: Instead of emitting an ad-hoc builtin on variable declaration emit a new metadata instruction. This allows us to remove the code matching on that ad-hoc builtin that had to be inserted in several checkers. Inferbo & pulse used that information meaningfully and had to undergo some minor changes to cope with the new metada instruction. Reviewed By: ezgicicek Differential Revision: D14833100 fbshipit-source-id: 9b3009d22	6 years ago
Jules Villard	1c668c4d41	[SIL][preanalysis] add call flag for functions treating first formal as return Summary: This helps some checkers and the liveness preanalysis. Reviewed By: da319 Differential Revision: D13102954 fbshipit-source-id: b8d3c5fe2	6 years ago
Jules Villard	55586b581b	[preanalysis] do not delay killing variables taken by reference Summary: Before, the liveness pre-analysis would place extra instructions in the CFG for either: 1. marking an `Ident.t` as dead, or 2. marking a `Pvar.t` as `= 0` But we have no way of marking pvars dead without setting them to 0. This is bad because setting pvars to 0 is not possible everywhere they are dead. Indeed, we only do it when we haven't seen their address being taken anyway. This prevents the following situation, recorded in our tests: ``` int address_taken() { int** x; int* y; int i = 7; y = &i; x = &y; // if we don't reason about taken addresses while adding nullify instructions, // we'll add // `nullify(y)` here and report a false NPE on the next line return **x; } ``` So we want to mark pvars as dead without nullifying them. This diff extends the `Remove_temps` SIL instruction to accept pvars as well, and so renames it to `ExitScope`. Reviewed By: da319 Differential Revision: D13102953 fbshipit-source-id: aa7f03a52	6 years ago
Sungkeun Cho	1486a5f105	[infer] Translate casting expressions of integer pointers Summary: It enables the translation of casting expression. As of now, it translates only the castings of pointers to integer types, in order to avoid too much of change, which may mess the checkers up. Reviewed By: jvillard Differential Revision: D12920568 fbshipit-source-id: a5489df24	6 years ago
Jules Villard	646aa30797	[cfg] print dotty after pre-analysis Summary: Useful to understand the changes in the pre-analysis, or to inspect the CFG that checkers actually get. This means that the pre-analysis always runs when we output the dotty, but I don't really see a reason why not. In fact, we could probably always store the CFGs as pre-analysed. Reviewed By: mbouaziz Differential Revision: D13102952 fbshipit-source-id: 89f3102ec	6 years ago
Jules Villard	9aa5582caa	[clang] leave markers of variable initialization for pulse Summary: When initialising a variable via semi-exotic means, the frontend loses the information that the variable was initialised. For instance, it translates: ``` struct Foo { int i; }; ... Foo s = {42}; ``` as: ``` s.i := 42 ``` This can be confusing for backends that need to know that `s` actually got initialised, eg pulse. The solution implemented here is to insert of dummy call to `__variable_initiazition`: ``` __variable_initialization(&s); s.i := 42; ``` Then checkers can recognise that this builtin function does what its name says. Reviewed By: mbouaziz Differential Revision: D12887122 fbshipit-source-id: 6e7214438	6 years ago
Mehdi Bouaziz	ad986dffde	Get rid of Declare_locals Reviewed By: jeremydubreil Differential Revision: D9234331 fbshipit-source-id: 70443eabe	6 years ago
Jules Villard	8b882ac1df	Change license to MIT Summary: Change the license of the source code from BSD + PATENTS to MIT. Change `checkCopyright` to reflect the new license and learn some new file types. Generated with: ``` git grep BSD \| xargs -n 1 ./scripts/checkCopyright -i ``` Reviewed By: jeremydubreil, mbouaziz, jberdine Differential Revision: D8071249 fbshipit-source-id: 97ca23a	7 years ago
Jules Villard	8715c4f892	[clang] make switch statement translation more robust Summary: Labels inside switch statements were causing havoc (see test), and the translation of switch statements in general could be improved to handle more cases. It turns out that `case` (and `default`) statements are more or less fancy labels into the code. In other words, if you erase all the `case XXX:` and `default:` strings in the `switch` statement you get the real structure of the program, and `switch` just jumps straight to the first `case` directives (and to the second if the first one is not satisfied, etc. until all `case`/`default` have been considered). This suggests an alternative implementation: translate the body of the `switch` and simply record the list of switch cases inside that body, along with where they point to. Then post-process this list to construct the control flow of the `switch`, which points into the control-flow of the `body`. In order not to modify every function in `CTrans` to propagate the current list of cases, I created an ugly `ref` inside `SwitchCase` instead (but it cannot be directly accessed and it's guaranteed to be well-parenthesised wrt nested switches by the `SwitchCase` API so it's not too bad). [unrelated] Also make translation failures output more information about what exactly in the source code is causing the crash, and the ancestors in the AST that lead to the crash site. Reviewed By: martinoluca Differential Revision: D8011046 fbshipit-source-id: 8455090	7 years ago
Jules Villard	766a16cd90	[clang] enforce that `instruction` always returns one SIL expression Summary: Previously, the type of `trans_result` contained a list of SIL expressions. However, most of the time we expect to get exactly one, and getting a different number is a soft(!) error, usually returning `-1`. This splits `trans_result` into `control`, which contains the information needed for temporary computation (hence when we don't necessarily know the return value yet), and a new version of `trans_result` that includes `control`, the previous `exps` list but replaced by a single `return` expression instead, and a couple other values that made sense to move out of `control`. This allows some flexibility in the frontend compared to enforcing exactly one return expression always: if they are not known yet we stick to `control` instead (see eg `compute_controls_to_parent`). This creates more garbage temporary identifiers, however they do not show up in the final cfg. Instead, we see that temporary IDs are now often not consecutive... The most painful complication is in the treatment of `DeclRefExpr`, which was actually returning two expressions: the method name and the `this` object. Now the method name is a separate (optional) field in `trans_result`. Reviewed By: mbouaziz Differential Revision: D7881088 fbshipit-source-id: 41ad3b5	7 years ago
Jules Villard	73a47d594c	[debug] print procedures in alphabetical order in cfgs Summary: When looking at large CFGs, at least in `xdot`, it's often difficult to find the procedure you're looking for. Sorting the proc names puts them in alphabetical order, which makes searching one procedure easier. Reviewed By: mbouaziz Differential Revision: D7758521 fbshipit-source-id: 8e9997f	7 years ago
Ezgi Çiçek	523c2f539b	change clang translation to track if_kind (i.e. the type of prune node) Reviewed By: ddino Differential Revision: D7653684 fbshipit-source-id: d731ccf	7 years ago
Jules Villard	6b5390fe79	[cfg] rename iCFG to cfg in dotty files Summary: Not sure what an "iCFG" is but the dotty is only about CFGs anyway. Diff obtained by mass-`sed`. Reviewed By: sblackshear Differential Revision: D6324280 fbshipit-source-id: b7603bb	7 years ago
Jules Villard	94e7a7b141	[siof] one access per sink, better report deduplication Summary: The previous domain for SIOF was duplicating some work with the generic Trace domain, and basically was a bit confused and confusing. A sink was a set of global accesses, and a state contains a set of sinks. Then the checker has to needlessly jump through hoops to normalize this set of sets of accesses into a set of accesses. The new domain has one sink = one access, as suggested by sblackshear. This simplifies a few things, and makes the dedup logic much easier: just grab the first report of the list of reports for a function. We only report on the fake procedures generated to initialise a global, and the filtering means that we keep only one report per global. Reviewed By: sblackshear Differential Revision: D5932138 fbshipit-source-id: acb7285	7 years ago
Jeremy Dubreil	919b9268d4	[infer][clang] simplify the translation of the prune nodes Summary: The prune nodes where translated as `prune (expr = false)` and `prune ( expr != false)`. This case is a bit tricky to deconstruct in HIL. This diff translates the prune instructions as just `prune !expr` for the true branch and `prune expr` for the false branch. Reviewed By: dulmarod Differential Revision: D5832147 fbshipit-source-id: 2c3502d	7 years ago
Andrzej Kotulski	462220ce3e	[typ] Print type qualifiers in Typ.pp_full Summary: Title. The way types are printed is completely valid, but little weird for some C++ programmers: `int const` - same as `const int` `int * const` - pointer is `const`, value under it is not `int const ` - pointer is not `const`, but the value is `int const const` - both pointer and value are const Reviewed By: jberdine Differential Revision: D4962180 fbshipit-source-id: dcb02e3	8 years ago
Josh Berdine	0cf71c74ef	Sort nodes when printing cfg to dot file Summary: Currently cfg nodes are written into dot files in whatever order they appear in a hash table. This seems unnecessarily sensitive, so this diff sorts the nodes. Reviewed By: dulmarod Differential Revision: D4232377 fbshipit-source-id: a907cc6	8 years ago
Sam Blackshear	708c0bf1f8	[backend] eliminate phantom spaces in printing of types Summary: These are dangerous if you are trying to compare a type to a string, and they're also unsightly. Reviewed By: jvillard Differential Revision: D4189956 fbshipit-source-id: 14ce127	8 years ago
Cristiano Calcagno	a71902355f	[debug][dotty] Fix issue in dotty output where overloaded functions were conflated Reviewed By: jvillard Differential Revision: D4117078 fbshipit-source-id: fdd8e93	8 years ago
Cristiano Calcagno	3fb8801b6c	[IR] Change cfg representation so the node number is per-procedure and not per-cfg Reviewed By: jeremydubreil Differential Revision: D4088075 fbshipit-source-id: 6e517a7	8 years ago
Sam Blackshear	7b58c71475	centralize creation and detection of clang tmp vars, fix errdesc/bucketing Reviewed By: akotulski Differential Revision: D3529992 fbshipit-source-id: 939f47a	9 years ago
Sam Blackshear	fd8a864c15	doing preanalysis on-demand Reviewed By: jeremydubreil Differential Revision: D3352767 fbshipit-source-id: a9dcc0a	9 years ago
Andrzej Kotulski	617ffab0ac	Add @generated comment to icfg.dot files Reviewed By: jvillard Differential Revision: D3358243 fbshipit-source-id: a47cc01	9 years ago
Sam Blackshear	3f49f3a1d4	using liveness to add removetemps instructions Reviewed By: jberdine Differential Revision: D3245574 fbshipit-source-id: 02c1bcd	9 years ago
Sam Blackshear	20925df57c	removing unused deallocate param in nullify instr Reviewed By: jeremydubreil Differential Revision: D3263241 fbshipit-source-id: b0d2c0f	9 years ago
Sam Blackshear	4fd2f52fe8	new analysis for adding nullify's Reviewed By: jeremydubreil Differential Revision: D3241019 fbshipit-source-id: 8409b33	9 years ago
Sam Blackshear	6f6da12b2c	don't nullify params/locals at beginning of procedure Reviewed By: jeremydubreil Differential Revision: D3258615 fb-gh-sync-id: 73e4670 fbshipit-source-id: 73e4670	9 years ago
Andrzej Kotulski	05c218d84f	Declare local variable for conditional in procdesc Summary:Local variable created by conditional operator translation is now declared in scope of whole procedure. Semantically there is no difference, hopefuly backend will not complain about this change. Also, nullifying that variable is deferred to preanalysis instead of calling it manually Reviewed By: jvillard Differential Revision: D3155733 fb-gh-sync-id: 6cec8fc fbshipit-source-id: 6cec8fc	9 years ago
Andrzej Kotulski	4584f7f6fc	[clang-format] Reformat all c/cpp/objc sources with clang-format Reviewed By: jul Differential Revision: https://phabricator.fb.com/D2953843	9 years ago
Dulma Rodriguez	ba00f08f00	Remove variable resolution and use pointers to declarations instead Summary: @public This removes the old way of finding variable declarations to create sil variables and replaces it with a a new way based on the map from pointers to declarations. Basically, every variable dereference contains a pointer to the variable declaration, with that we can build the corresponding sil variable. Reviewed By: @akotulski Differential Revision: D2536000 fb-gh-sync-id: dd29cf9	9 years ago
Andrzej Kotulski	7ac5a5c308	Refactor C frontend tests Reviewed By: @dulmarod Differential Revision: D2521747 fb-gh-sync-id: 1be8d21	9 years ago

36 Commits (6a1999730394ffccee3b9317a2e8ecfd7dc0379f)