infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	4c48b79f6c	[siof] detect constexpr for all procedures Summary: Previously we were only taking constexpr into account on constructors. Add this info to ProcAttributes.t instead by exporting it from the plugin for all functions. This allows SIOF to take constexpr into account in more cases as it's not always good at capturing which functions can be constexpr-evaluated, which caused false positives. Delete now-useless is_constexpr in constructor types. This generated the changes in frontend tests. Some minor renamings of variants of is_const_expr -> is_constexpr. Reviewed By: da319 Differential Revision: D27503433 fbshipit-source-id: 3d1972900	4 years ago
Martin Trojer	18f28395e8	[clang] migrate to llvm/clang11 Summary: Update Infer to LLVM (clang) 11.1.0. Infer/clang now uses the LLVM 'monorepo' release, simplifying the download script. Some changes done to how/when ASTExporter mangles names, this to avoid the plugin hitting asserts in the clang code when mangling names. Reviewed By: jvillard Differential Revision: D27006986 fbshipit-source-id: 4d4b6ba05	4 years ago
Jules Villard	29f3941600	[clang] deal with conditionally-destroyed temporaries Summary: This was left as a TODO before: where to place calls to destructors for C++ temporaries that are only conditionally creating when evaluating an expression. This can happen inside the branches of a conditional operation `b?e:f` or in potentially-short-circuited conditions on the righ-hand side of `&&` and `\|\|` operators. Following the compilation scheme of clang (observed by looking at the generated LLVM bitcode), we instrument the program with "marker" variables, so that for instance `X x = true?X():y;` becomes (following the execution on the true branch): ``` marker1 = 0; // initialize all markers to 0 PRUNE(true) // entering true branch X::X(&temporary); // create temporary... marker1 = 1; // ...triggers setting its marker to 1 X::X(&x, &temporary); // finish expression if (marker1) { X::~X(&temporary); // conditionally destroy the temporary } ``` In this diff, you'll find code for: - associating markers to temporaries that need them - code to initialize markers to 0 before full-expressions - code to conditionally destroy temporaries based on the values of the markers once the full-expression has finished evaluating Reviewed By: da319 Differential Revision: D24954070 fbshipit-source-id: cf15df7f7	5 years ago
Jules Villard	f2e3f67f40	[clang] change the way we wire up return statements Summary: Split the translation of return more aggressively between: 1. the instruction that has to happen before the translation of the sub-expr 2. the sub-expr 3. the instruction that has to happen after the sub-expr This is needed for the next diff which creates potentially large CFGs in (2). Reviewed By: da319 Differential Revision: D24954071 fbshipit-source-id: a7e7e2527	5 years ago
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	5 years ago
Jules Villard	b6460870dc	[biabd] rename a test to follow naming conventions Reviewed By: ngorogiannis Differential Revision: D24794526 fbshipit-source-id: 9b2392c35	5 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	6 years ago
Jules Villard	04233ee49b	[clang] destroy C++ temporaries Summary: Inject destructor calls to destroy a temporary when its lifetime ends. Reviewed By: mbouaziz Differential Revision: D15674209 fbshipit-source-id: 0f783a906	6 years ago
Jules Villard	db800f138b	[clang] rewrite scope computations Summary: This started as an attempt to understand how to modify the frontend to inject destructors for C++ temporaries (see next diffs). This diff rewrites the existing logic for computing the list of variables that should be destroyed at the end of each statement, either because it's the end of their syntactic scope or because control flow branches outside of their syntactic scope. The frontend translates a function from the last instructions to the first, but scope computation needs to be done in the other direction, so it's done in a separate pass before the main translation happens. That first pass creates a map from statements in the AST to the list of variables that should be destroyed at the end of these statements. This is still the case now. Before, that map would be computed in a bit of a weird way: scopes are naturally a stack but instead of that the structure maintained was a flat list + a counter to know where the current scope ended in that list. In this diff, redo the computation maintaining a stack of scopes instead, which is a bit cleaner. Also treat more instructions as introducing a new scope, eg if, for, ... Reviewed By: mbouaziz Differential Revision: D15674208 fbshipit-source-id: c92429e82	6 years ago
Jules Villard	eaa5c32432	[clang] some more debug info Summary: Somewhat trivial: add a string to "Destruction" nodes to indicate why they were created. Rename the main `instruction_aux` function into `instruction_translate` (see next diff for why). Reviewed By: mbouaziz Differential Revision: D15674211 fbshipit-source-id: 8a7eda72c	6 years ago
Josh Berdine	cfc1c8be36	[copyright] Remove years Reviewed By: jvillard Differential Revision: D15771884 fbshipit-source-id: e2997e3a3	6 years ago
Jules Villard	686231ec6e	[SIL] change `variable_initialization()` builtin to a new auxiliary instruction Summary: Instead of emitting an ad-hoc builtin on variable declaration emit a new metadata instruction. This allows us to remove the code matching on that ad-hoc builtin that had to be inserted in several checkers. Inferbo & pulse used that information meaningfully and had to undergo some minor changes to cope with the new metada instruction. Reviewed By: ezgicicek Differential Revision: D14833100 fbshipit-source-id: 9b3009d22	6 years ago
David Lively	5d4a27ea54	RFC: stop using _ to separate ObjC/C++ class name from method in Typ.Procname.to_string Reviewed By: jvillard Differential Revision: D14736442 fbshipit-source-id: 500df354b	6 years ago
Jules Villard	c3cadace86	[SIL][3/3] add CallFlag for synthetised destructor calls Summary: This will be used in the future to determine what to do with destructors in pulse. Reviewed By: mbouaziz Differential Revision: D14324759 fbshipit-source-id: bc3c34471	6 years ago
Jules Villard	1c668c4d41	[SIL][preanalysis] add call flag for functions treating first formal as return Summary: This helps some checkers and the liveness preanalysis. Reviewed By: da319 Differential Revision: D13102954 fbshipit-source-id: b8d3c5fe2	7 years ago
Jules Villard	55586b581b	[preanalysis] do not delay killing variables taken by reference Summary: Before, the liveness pre-analysis would place extra instructions in the CFG for either: 1. marking an `Ident.t` as dead, or 2. marking a `Pvar.t` as `= 0` But we have no way of marking pvars dead without setting them to 0. This is bad because setting pvars to 0 is not possible everywhere they are dead. Indeed, we only do it when we haven't seen their address being taken anyway. This prevents the following situation, recorded in our tests: ``` int address_taken() { int** x; int* y; int i = 7; y = &i; x = &y; // if we don't reason about taken addresses while adding nullify instructions, // we'll add // `nullify(y)` here and report a false NPE on the next line return **x; } ``` So we want to mark pvars as dead without nullifying them. This diff extends the `Remove_temps` SIL instruction to accept pvars as well, and so renames it to `ExitScope`. Reviewed By: da319 Differential Revision: D13102953 fbshipit-source-id: aa7f03a52	7 years ago
Jules Villard	646aa30797	[cfg] print dotty after pre-analysis Summary: Useful to understand the changes in the pre-analysis, or to inspect the CFG that checkers actually get. This means that the pre-analysis always runs when we output the dotty, but I don't really see a reason why not. In fact, we could probably always store the CFGs as pre-analysed. Reviewed By: mbouaziz Differential Revision: D13102952 fbshipit-source-id: 89f3102ec	7 years ago
Jules Villard	9aa5582caa	[clang] leave markers of variable initialization for pulse Summary: When initialising a variable via semi-exotic means, the frontend loses the information that the variable was initialised. For instance, it translates: ``` struct Foo { int i; }; ... Foo s = {42}; ``` as: ``` s.i := 42 ``` This can be confusing for backends that need to know that `s` actually got initialised, eg pulse. The solution implemented here is to insert of dummy call to `__variable_initiazition`: ``` __variable_initialization(&s); s.i := 42; ``` Then checkers can recognise that this builtin function does what its name says. Reviewed By: mbouaziz Differential Revision: D12887122 fbshipit-source-id: 6e7214438	7 years ago
Jules Villard	116ec5ae55	[clang] changes to accomodate the new version of clang Summary: New clang in the plugin \o/ Changes that were needed: - (minor) Some extra AST nodes - defining a lambda and calling it in the same line (`[&x]() { x = 1; }()`) used to get translated as a call of the literal but now an intermediate variable gets created, which confuses uninit in one test. I added another test to showcase the limitation this is hitting: storing the lambda in a variable then calling it will not get caught by the checker. The controller you requested could not be found.: facebook-clang-plugins Reviewed By: jeremydubreil Differential Revision: D10128626 fbshipit-source-id: 8ffd19f3c	7 years ago
Mehdi Bouaziz	ad986dffde	Get rid of Declare_locals Reviewed By: jeremydubreil Differential Revision: D9234331 fbshipit-source-id: 70443eabe	7 years ago
Dulma Churchill	91e0a7d1a3	[IR] Take parameters into account in to_filename method Reviewed By: mbouaziz Differential Revision: D8395033 fbshipit-source-id: 2dd285e	7 years ago
Dulma Churchill	79a8f8716c	[clang] Adding parameters as part of the procname for C++/ObjC methods and ObjC blocks Reviewed By: mbouaziz Differential Revision: D8237112 fbshipit-source-id: c0ec4b4	7 years ago
Jules Villard	8b882ac1df	Change license to MIT Summary: Change the license of the source code from BSD + PATENTS to MIT. Change `checkCopyright` to reflect the new license and learn some new file types. Generated with: ``` git grep BSD \| xargs -n 1 ./scripts/checkCopyright -i ``` Reviewed By: jeremydubreil, mbouaziz, jberdine Differential Revision: D8071249 fbshipit-source-id: 97ca23a	7 years ago
Jules Villard	766a16cd90	[clang] enforce that `instruction` always returns one SIL expression Summary: Previously, the type of `trans_result` contained a list of SIL expressions. However, most of the time we expect to get exactly one, and getting a different number is a soft(!) error, usually returning `-1`. This splits `trans_result` into `control`, which contains the information needed for temporary computation (hence when we don't necessarily know the return value yet), and a new version of `trans_result` that includes `control`, the previous `exps` list but replaced by a single `return` expression instead, and a couple other values that made sense to move out of `control`. This allows some flexibility in the frontend compared to enforcing exactly one return expression always: if they are not known yet we stick to `control` instead (see eg `compute_controls_to_parent`). This creates more garbage temporary identifiers, however they do not show up in the final cfg. Instead, we see that temporary IDs are now often not consecutive... The most painful complication is in the treatment of `DeclRefExpr`, which was actually returning two expressions: the method name and the `this` object. Now the method name is a separate (optional) field in `trans_result`. Reviewed By: mbouaziz Differential Revision: D7881088 fbshipit-source-id: 41ad3b5	7 years ago
Jules Villard	902de9d6e3	[sil] make return value and type mandatory Summary: This simplifies the frontends and backends in most cases. Before this diff, returning `void` could be modelled either with a `None` return, or a dummy return variable with type `Tvoid`. Now it's always the latter. Reviewed By: sblackshear, dulmarod Differential Revision: D7832938 fbshipit-source-id: 0a403d1	7 years ago
Jules Villard	73a47d594c	[debug] print procedures in alphabetical order in cfgs Summary: When looking at large CFGs, at least in `xdot`, it's often difficult to find the procedure you're looking for. Sorting the proc names puts them in alphabetical order, which makes searching one procedure easier. Reviewed By: mbouaziz Differential Revision: D7758521 fbshipit-source-id: 8e9997f	7 years ago
Ezgi Çiçek	523c2f539b	change clang translation to track if_kind (i.e. the type of prune node) Reviewed By: ddino Differential Revision: D7653684 fbshipit-source-id: d731ccf	7 years ago
Jules Villard	098b0700c2	[clang] upgrade internal clang Summary: Switch to the current stable branch for clang. update-submodule: facebook-clang-plugins Reviewed By: mbouaziz Differential Revision: D7067890 fbshipit-source-id: aedff90	7 years ago
Daiva Naudziuniene	1401696119	[destructors] Inject destructor calls even if the destructor declaration is empty Summary: We do not inject a destructor call if the destructor declaration does not contain a body in AST. We miss all the cases where the destructor is declared in `.h` file and defined in `.cpp` file as other files include `.h` file and do not contain the body of the destructor when destructor calls are being injected based on AST information. After this diff we inject destructor calls even if we do not have body for the destructor in AST. Reviewed By: sblackshear Differential Revision: D6796567 fbshipit-source-id: 1c187ec	8 years ago
Jules Villard	6b5390fe79	[cfg] rename iCFG to cfg in dotty files Summary: Not sure what an "iCFG" is but the dotty is only about CFGs anyway. Diff obtained by mass-`sed`. Reviewed By: sblackshear Differential Revision: D6324280 fbshipit-source-id: b7603bb	8 years ago
Jules Villard	94e7a7b141	[siof] one access per sink, better report deduplication Summary: The previous domain for SIOF was duplicating some work with the generic Trace domain, and basically was a bit confused and confusing. A sink was a set of global accesses, and a state contains a set of sinks. Then the checker has to needlessly jump through hoops to normalize this set of sets of accesses into a set of accesses. The new domain has one sink = one access, as suggested by sblackshear. This simplifies a few things, and makes the dedup logic much easier: just grab the first report of the list of reports for a function. We only report on the fake procedures generated to initialise a global, and the filtering means that we keep only one report per global. Reviewed By: sblackshear Differential Revision: D5932138 fbshipit-source-id: acb7285	8 years ago
Jules Villard	abee644b91	[clang] update clang plugin to hash mangled names Summary: With this change and the previous facebook-clang-plugins change, infer no longer exhausts the biniou buffer when reading the serialized C++ AST. update-submodule: facebook-clang-plugins Reviewed By: mbouaziz Differential Revision: D5891081 fbshipit-source-id: cf48eac	8 years ago
Jeremy Dubreil	919b9268d4	[infer][clang] simplify the translation of the prune nodes Summary: The prune nodes where translated as `prune (expr = false)` and `prune ( expr != false)`. This case is a bit tricky to deconstruct in HIL. This diff translates the prune instructions as just `prune !expr` for the true branch and `prune expr` for the false branch. Reviewed By: dulmarod Differential Revision: D5832147 fbshipit-source-id: 2c3502d	8 years ago
Jia Chen	c0e20e0880	Propagate C++ noexcept annotation from frontend to backend Reviewed By: jeremydubreil Differential Revision: D5167568 fbshipit-source-id: b562d5e	8 years ago
Andrzej Kotulski	462220ce3e	[typ] Print type qualifiers in Typ.pp_full Summary: Title. The way types are printed is completely valid, but little weird for some C++ programmers: `int const` - same as `const int` `int * const` - pointer is `const`, value under it is not `int const ` - pointer is not `const`, but the value is `int const const` - both pointer and value are const Reviewed By: jberdine Differential Revision: D4962180 fbshipit-source-id: dcb02e3	8 years ago
Andrzej Kotulski	62d1d74d74	[Typ] Change Typ.pp_full to not include class keywords Reviewed By: jvillard Differential Revision: D4843009 fbshipit-source-id: 5d0aaa3	8 years ago
Andrzej Kotulski	4da4949049	[clang][AST] Fix wrong type in translation of NoOp cast and MaterializeExpr Summary: This issue was spotted in the wild. There may be more of those, unfortunately it's hard to predict More general problem is that types in infer frontend diverge from clang's types for DerivedToBase cast. Then, infer uses types from clang anyway and that confuses backend. Getting it always right is very hard Reviewed By: jvillard Differential Revision: D4754081 fbshipit-source-id: 5fb7069	8 years ago
Andrzej Kotulski	24b56de0e9	Populate mangled file only if it's not empty Summary: We were including hex of empty string if mangled name was not empty (so for all C++ functions). Instead, include hex of a source file only if it's not empty Reviewed By: mbouaziz Differential Revision: D4705388 fbshipit-source-id: 55b6587	8 years ago
Andrzej Kotulski	6a02568982	[clang] Change procname file naming scheme Summary: Procnames files are now reversed qualifier lists with `#` as separator (instead of `::` which needs to be escaped in bash). Because of the mechanism that is used to obtain qualifiers, it also affects naming for ObjC classes. Examples: ``` std::unique_ptr<int>::get -> get#unique_ptr<int>#std#__MANGLED,...__ // C++ method folly::split -> split#folly#__MANGLED,..._ // function within namespace NSNumber numberWithBool: -> numberWithBool:#NSNumber#class // ObjC method ``` Reviewed By: jvillard Differential Revision: D4689701 fbshipit-source-id: c3acfc6	8 years ago
Jules Villard	e5863f5420	[siof] handle constexpr constructors Summary: Globals that are constexpr-initializable do not participate in SIOF. Reviewed By: sblackshear Differential Revision: D4277216 fbshipit-source-id: fd601c8	9 years ago
Josh Berdine	0cf71c74ef	Sort nodes when printing cfg to dot file Summary: Currently cfg nodes are written into dot files in whatever order they appear in a hash table. This seems unnecessarily sensitive, so this diff sorts the nodes. Reviewed By: dulmarod Differential Revision: D4232377 fbshipit-source-id: a907cc6	9 years ago
Sam Blackshear	708c0bf1f8	[backend] eliminate phantom spaces in printing of types Summary: These are dangerous if you are trying to compare a type to a string, and they're also unsightly. Reviewed By: jvillard Differential Revision: D4189956 fbshipit-source-id: 14ce127	9 years ago
Andrzej Kotulski	28827b461a	[clang] Get translation unit language from AST dump Summary: New version of clang plugin exports `-x` arg information as a part of TranslationUnitDecl. Get it from there instead of reading it from clang argv Reviewed By: jvillard Differential Revision: D4112652 fbshipit-source-id: 5c3af1f	9 years ago
Cristiano Calcagno	a71902355f	[debug][dotty] Fix issue in dotty output where overloaded functions were conflated Reviewed By: jvillard Differential Revision: D4117078 fbshipit-source-id: fdd8e93	9 years ago
Cristiano Calcagno	3fb8801b6c	[IR] Change cfg representation so the node number is per-procedure and not per-cfg Reviewed By: jeremydubreil Differential Revision: D4088075 fbshipit-source-id: 6e517a7	9 years ago
Cristiano Calcagno	847c141912	[tests] Clean up test files shared between frontend and endtoend tests Reviewed By: jberdine Differential Revision: D3893900 fbshipit-source-id: 497effc	9 years ago

46 Commits (f5fef60a42f53e667dfeb3b9a87908f2e5310e14)