infer_clone

Commit Graph

Author	SHA1	Message	Date
Martin Trojer	18f28395e8	[clang] migrate to llvm/clang11 Summary: Update Infer to LLVM (clang) 11.1.0. Infer/clang now uses the LLVM 'monorepo' release, simplifying the download script. Some changes done to how/when ASTExporter mangles names, this to avoid the plugin hitting asserts in the clang code when mangling names. Reviewed By: jvillard Differential Revision: D27006986 fbshipit-source-id: 4d4b6ba05	4 years ago
Jules Villard	ad45bbe28d	[clang] fix order of translation for [this] Summary: We need to make sure a node is created to avoid instructions appearing in the wrong order in the final CFG. Reviewed By: da319 Differential Revision: D25784405 fbshipit-source-id: 3ef27d712	4 years ago
Jules Villard	29f3941600	[clang] deal with conditionally-destroyed temporaries Summary: This was left as a TODO before: where to place calls to destructors for C++ temporaries that are only conditionally creating when evaluating an expression. This can happen inside the branches of a conditional operation `b?e:f` or in potentially-short-circuited conditions on the righ-hand side of `&&` and `\|\|` operators. Following the compilation scheme of clang (observed by looking at the generated LLVM bitcode), we instrument the program with "marker" variables, so that for instance `X x = true?X():y;` becomes (following the execution on the true branch): ``` marker1 = 0; // initialize all markers to 0 PRUNE(true) // entering true branch X::X(&temporary); // create temporary... marker1 = 1; // ...triggers setting its marker to 1 X::X(&x, &temporary); // finish expression if (marker1) { X::~X(&temporary); // conditionally destroy the temporary } ``` In this diff, you'll find code for: - associating markers to temporaries that need them - code to initialize markers to 0 before full-expressions - code to conditionally destroy temporaries based on the values of the markers once the full-expression has finished evaluating Reviewed By: da319 Differential Revision: D24954070 fbshipit-source-id: cf15df7f7	4 years ago
Jules Villard	f2e3f67f40	[clang] change the way we wire up return statements Summary: Split the translation of return more aggressively between: 1. the instruction that has to happen before the translation of the sub-expr 2. the sub-expr 3. the instruction that has to happen after the sub-expr This is needed for the next diff which creates potentially large CFGs in (2). Reviewed By: da319 Differential Revision: D24954071 fbshipit-source-id: a7e7e2527	4 years ago
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	4 years ago
Daiva Naudziuniene	91a33f6edc	[frontend] Captured struct variables in cpp lambdas Summary: Structs captured both by reference or by value should have reference in their type. Struct captured by value should first call copy constructor. In this diff we fix the type of the captured variable to include reference. Copy constructor injection is left for the future. Reviewed By: jvillard Differential Revision: D23688713 fbshipit-source-id: d13748b5d	4 years ago
Daiva Naudziuniene	857daf63c9	[frontend] Capture reference variables Summary: Variables captured without initialization do not have correct type inside lambda's body. This diff sets the correct type of captured reference variables inside procdesc and makes sure the translation of captured variables is correct. The translation of lambda's body will then take into account the type of captured var from procdesc. Reviewed By: jvillard Differential Revision: D23678371 fbshipit-source-id: ed16dc978	4 years ago
Daiva Naudziuniene	42abe5b277	[frontend] Fix type of captured vars in lambda's body Summary: Add missing reference to the type of variable captured by reference without initialization. Reviewed By: jvillard Differential Revision: D23567685 fbshipit-source-id: b4e2ac0b6	4 years ago
Daiva Naudziuniene	d0cb245303	[frontend] Fix capture init for cpp lambdas Summary: We were missing assignment to captured variables with initializers. Consider the following example: ``` S* update_inside_lambda_capture_and_init(S* s) { S* object = nullptr; auto f = [& o = object](S* s) { o = s; }; f(s); return object; } ``` which was translated to ``` VARIABLE_DECLARED(o:S&); &o:S&=&object &f =(_fun...lambda..._operator(),([by ref]&o &o:S&)) ``` However, we want to capture `o` (which is an address of `object`), rather `&o` in closure. After the diff ``` VARIABLE_DECLARED(o:S&); &o:S&=&object n$7=&o:S& &f =(_fun...lambda..._operator(),([by ref]n$7 &o:S&)) ``` Reviewed By: jvillard Differential Revision: D23567346 fbshipit-source-id: 20f77acc2	4 years ago
Daiva Naudziuniene	1c5e47d91e	[frontend] Record lambda's captured variables in `operator()` procdesc Summary: Lambda is called using `operator()`. We need to know the information of captured variables when `operator()` procedure is being analysed. This diff records lambda captured variables in `operator()` procdesc. The complication arises from the fact that procdesc for `operator()` is created before translating lambda expression or during the translation of lambda expression where captured variables are translated. To solve this issue, we update existing `operator()` procdesc attributes with captured variable information when we translate lambda expression. Reviewed By: jvillard Differential Revision: D22374495 fbshipit-source-id: 44909adea	4 years ago
Daiva Naudziuniene	50d659b750	Update type of procdesc and closure expression to contain information about capture variable mode Summary: We update the type of captured variables to include information about capture mode (`ByReference` or `ByValue`) both for procdesc attributes and the closure expression. For lambda: closure expression now contains correct capture mode for capture variables. Procdesc still does not contain information about captured variables which we will address in the next diff. For objc blocks: at the moment all captured variables have mode `ByReference`. Added TODOs to fix this. Reviewed By: jvillard Differential Revision: D22572054 fbshipit-source-id: 4c88678ee	4 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	5 years ago
Jules Villard	b03ca78bf3	[pdesc][refactor] ability to set normal and exceptional succs independently Summary: - more flexible API - less error-prone thanks to named parameters - also takes care of adjusting predecessors of the previous successors! This fixes some (probably harmless) bugs in the frontends. Reviewed By: dulmarod Differential Revision: D18573923 fbshipit-source-id: ad97b3607	5 years ago
Jules Villard	04233ee49b	[clang] destroy C++ temporaries Summary: Inject destructor calls to destroy a temporary when its lifetime ends. Reviewed By: mbouaziz Differential Revision: D15674209 fbshipit-source-id: 0f783a906	6 years ago
Jules Villard	db800f138b	[clang] rewrite scope computations Summary: This started as an attempt to understand how to modify the frontend to inject destructors for C++ temporaries (see next diffs). This diff rewrites the existing logic for computing the list of variables that should be destroyed at the end of each statement, either because it's the end of their syntactic scope or because control flow branches outside of their syntactic scope. The frontend translates a function from the last instructions to the first, but scope computation needs to be done in the other direction, so it's done in a separate pass before the main translation happens. That first pass creates a map from statements in the AST to the list of variables that should be destroyed at the end of these statements. This is still the case now. Before, that map would be computed in a bit of a weird way: scopes are naturally a stack but instead of that the structure maintained was a flat list + a counter to know where the current scope ended in that list. In this diff, redo the computation maintaining a stack of scopes instead, which is a bit cleaner. Also treat more instructions as introducing a new scope, eg if, for, ... Reviewed By: mbouaziz Differential Revision: D15674208 fbshipit-source-id: c92429e82	6 years ago
Jules Villard	eaa5c32432	[clang] some more debug info Summary: Somewhat trivial: add a string to "Destruction" nodes to indicate why they were created. Rename the main `instruction_aux` function into `instruction_translate` (see next diff for why). Reviewed By: mbouaziz Differential Revision: D15674211 fbshipit-source-id: 8a7eda72c	6 years ago
Jules Villard	686231ec6e	[SIL] change `variable_initialization()` builtin to a new auxiliary instruction Summary: Instead of emitting an ad-hoc builtin on variable declaration emit a new metadata instruction. This allows us to remove the code matching on that ad-hoc builtin that had to be inserted in several checkers. Inferbo & pulse used that information meaningfully and had to undergo some minor changes to cope with the new metada instruction. Reviewed By: ezgicicek Differential Revision: D14833100 fbshipit-source-id: 9b3009d22	6 years ago
David Lively	5d4a27ea54	RFC: stop using _ to separate ObjC/C++ class name from method in Typ.Procname.to_string Reviewed By: jvillard Differential Revision: D14736442 fbshipit-source-id: 500df354b	6 years ago
Jules Villard	c3cadace86	[SIL][3/3] add CallFlag for synthetised destructor calls Summary: This will be used in the future to determine what to do with destructors in pulse. Reviewed By: mbouaziz Differential Revision: D14324759 fbshipit-source-id: bc3c34471	6 years ago
Jules Villard	1c668c4d41	[SIL][preanalysis] add call flag for functions treating first formal as return Summary: This helps some checkers and the liveness preanalysis. Reviewed By: da319 Differential Revision: D13102954 fbshipit-source-id: b8d3c5fe2	6 years ago
Jules Villard	55586b581b	[preanalysis] do not delay killing variables taken by reference Summary: Before, the liveness pre-analysis would place extra instructions in the CFG for either: 1. marking an `Ident.t` as dead, or 2. marking a `Pvar.t` as `= 0` But we have no way of marking pvars dead without setting them to 0. This is bad because setting pvars to 0 is not possible everywhere they are dead. Indeed, we only do it when we haven't seen their address being taken anyway. This prevents the following situation, recorded in our tests: ``` int address_taken() { int** x; int* y; int i = 7; y = &i; x = &y; // if we don't reason about taken addresses while adding nullify instructions, // we'll add // `nullify(y)` here and report a false NPE on the next line return **x; } ``` So we want to mark pvars as dead without nullifying them. This diff extends the `Remove_temps` SIL instruction to accept pvars as well, and so renames it to `ExitScope`. Reviewed By: da319 Differential Revision: D13102953 fbshipit-source-id: aa7f03a52	6 years ago
Jules Villard	646aa30797	[cfg] print dotty after pre-analysis Summary: Useful to understand the changes in the pre-analysis, or to inspect the CFG that checkers actually get. This means that the pre-analysis always runs when we output the dotty, but I don't really see a reason why not. In fact, we could probably always store the CFGs as pre-analysed. Reviewed By: mbouaziz Differential Revision: D13102952 fbshipit-source-id: 89f3102ec	6 years ago
Daiva Naudziuniene	2c06254800	[pulse] False positive caused by multiple variables captured by value in lambda Summary: Update clang plugin which now gives names to variables captured by lambdas that were empty before. update-submodule: facebook-clang-plugins Reviewed By: jvillard Differential Revision: D12979015 fbshipit-source-id: 0b092fb24	6 years ago
Jules Villard	9aa5582caa	[clang] leave markers of variable initialization for pulse Summary: When initialising a variable via semi-exotic means, the frontend loses the information that the variable was initialised. For instance, it translates: ``` struct Foo { int i; }; ... Foo s = {42}; ``` as: ``` s.i := 42 ``` This can be confusing for backends that need to know that `s` actually got initialised, eg pulse. The solution implemented here is to insert of dummy call to `__variable_initiazition`: ``` __variable_initialization(&s); s.i := 42; ``` Then checkers can recognise that this builtin function does what its name says. Reviewed By: mbouaziz Differential Revision: D12887122 fbshipit-source-id: 6e7214438	6 years ago
Jules Villard	116ec5ae55	[clang] changes to accomodate the new version of clang Summary: New clang in the plugin \o/ Changes that were needed: - (minor) Some extra AST nodes - defining a lambda and calling it in the same line (`[&x]() { x = 1; }()`) used to get translated as a call of the literal but now an intermediate variable gets created, which confuses uninit in one test. I added another test to showcase the limitation this is hitting: storing the lambda in a variable then calling it will not get caught by the checker. The controller you requested could not be found.: facebook-clang-plugins Reviewed By: jeremydubreil Differential Revision: D10128626 fbshipit-source-id: 8ffd19f3c	6 years ago
Mehdi Bouaziz	ad986dffde	Get rid of Declare_locals Reviewed By: jeremydubreil Differential Revision: D9234331 fbshipit-source-id: 70443eabe	6 years ago
Dulma Churchill	79a8f8716c	[clang] Adding parameters as part of the procname for C++/ObjC methods and ObjC blocks Reviewed By: mbouaziz Differential Revision: D8237112 fbshipit-source-id: c0ec4b4	7 years ago
Jules Villard	8b882ac1df	Change license to MIT Summary: Change the license of the source code from BSD + PATENTS to MIT. Change `checkCopyright` to reflect the new license and learn some new file types. Generated with: ``` git grep BSD \| xargs -n 1 ./scripts/checkCopyright -i ``` Reviewed By: jeremydubreil, mbouaziz, jberdine Differential Revision: D8071249 fbshipit-source-id: 97ca23a	7 years ago
Jules Villard	766a16cd90	[clang] enforce that `instruction` always returns one SIL expression Summary: Previously, the type of `trans_result` contained a list of SIL expressions. However, most of the time we expect to get exactly one, and getting a different number is a soft(!) error, usually returning `-1`. This splits `trans_result` into `control`, which contains the information needed for temporary computation (hence when we don't necessarily know the return value yet), and a new version of `trans_result` that includes `control`, the previous `exps` list but replaced by a single `return` expression instead, and a couple other values that made sense to move out of `control`. This allows some flexibility in the frontend compared to enforcing exactly one return expression always: if they are not known yet we stick to `control` instead (see eg `compute_controls_to_parent`). This creates more garbage temporary identifiers, however they do not show up in the final cfg. Instead, we see that temporary IDs are now often not consecutive... The most painful complication is in the treatment of `DeclRefExpr`, which was actually returning two expressions: the method name and the `this` object. Now the method name is a separate (optional) field in `trans_result`. Reviewed By: mbouaziz Differential Revision: D7881088 fbshipit-source-id: 41ad3b5	7 years ago
Jules Villard	902de9d6e3	[sil] make return value and type mandatory Summary: This simplifies the frontends and backends in most cases. Before this diff, returning `void` could be modelled either with a `None` return, or a dummy return variable with type `Tvoid`. Now it's always the latter. Reviewed By: sblackshear, dulmarod Differential Revision: D7832938 fbshipit-source-id: 0a403d1	7 years ago
Jules Villard	73a47d594c	[debug] print procedures in alphabetical order in cfgs Summary: When looking at large CFGs, at least in `xdot`, it's often difficult to find the procedure you're looking for. Sorting the proc names puts them in alphabetical order, which makes searching one procedure easier. Reviewed By: mbouaziz Differential Revision: D7758521 fbshipit-source-id: 8e9997f	7 years ago
Sam Blackshear	aca9d034a7	[clang] translate capture-by-reference correctly Summary: The Clang frontend previously didn't distinguish between capture-by-value and capture-by-reference. Reviewed By: dulmarod Differential Revision: D7100561 fbshipit-source-id: 3ef168e	7 years ago
Daiva Naudziuniene	1401696119	[destructors] Inject destructor calls even if the destructor declaration is empty Summary: We do not inject a destructor call if the destructor declaration does not contain a body in AST. We miss all the cases where the destructor is declared in `.h` file and defined in `.cpp` file as other files include `.h` file and do not contain the body of the destructor when destructor calls are being injected based on AST information. After this diff we inject destructor calls even if we do not have body for the destructor in AST. Reviewed By: sblackshear Differential Revision: D6796567 fbshipit-source-id: 1c187ec	7 years ago
Sam Blackshear	9366e8dbc8	[clang] add id -> pvar bindings to C++ lambda capture Summary: In Obj-C blocks, we explicitly insert reads of the captured vars. This does the same thing for C++. For example, `foo() { int x = 1; [x]() { return x; } }` would previously not contain a read of `x` in `foo`. Now, we'll create a temporary that reads from `x` and pass it to the closure value. Reviewed By: dulmarod Differential Revision: D6939997 fbshipit-source-id: f218afc	7 years ago
Sam Blackshear	3d170a82c4	[clang] translate lambdas that capture `this` Summary: Capturing this is implicit in the Clang AST. Reviewed By: da319 Differential Revision: D6933848 fbshipit-source-id: 7ab9ae9	7 years ago
Jules Villard	6b5390fe79	[cfg] rename iCFG to cfg in dotty files Summary: Not sure what an "iCFG" is but the dotty is only about CFGs anyway. Diff obtained by mass-`sed`. Reviewed By: sblackshear Differential Revision: D6324280 fbshipit-source-id: b7603bb	7 years ago
Jules Villard	94e7a7b141	[siof] one access per sink, better report deduplication Summary: The previous domain for SIOF was duplicating some work with the generic Trace domain, and basically was a bit confused and confusing. A sink was a set of global accesses, and a state contains a set of sinks. Then the checker has to needlessly jump through hoops to normalize this set of sets of accesses into a set of accesses. The new domain has one sink = one access, as suggested by sblackshear. This simplifies a few things, and makes the dedup logic much easier: just grab the first report of the list of reports for a function. We only report on the fake procedures generated to initialise a global, and the filtering means that we keep only one report per global. Reviewed By: sblackshear Differential Revision: D5932138 fbshipit-source-id: acb7285	7 years ago
Jules Villard	abee644b91	[clang] update clang plugin to hash mangled names Summary: With this change and the previous facebook-clang-plugins change, infer no longer exhausts the biniou buffer when reading the serialized C++ AST. update-submodule: facebook-clang-plugins Reviewed By: mbouaziz Differential Revision: D5891081 fbshipit-source-id: cf48eac	7 years ago
Sam Blackshear	14fa4aa7d9	[clang][dead stores] translate init-capture expressions Summary: Not translating these properly was causing false positives for the dead store analysis in cases like ``` int i = 0; return [j = i]() { return j; }(); ``` Reviewed By: da319 Differential Revision: D5731562 fbshipit-source-id: ae79ac8	7 years ago
Sam Blackshear	cb9c768c61	[clang] translate vars captured by lambda Reviewed By: jeremydubreil Differential Revision: D5482634 fbshipit-source-id: 86afb05	7 years ago
Mehdi Bouaziz	be0c53ddf3	[cpp] Fix failure with c++14 init-capture Summary: This used to fail update-submodule: facebook-clang-plugins Reviewed By: jvillard Differential Revision: D5406280 fbshipit-source-id: c7233b3	7 years ago
Andrzej Kotulski	62d1d74d74	[Typ] Change Typ.pp_full to not include class keywords Reviewed By: jvillard Differential Revision: D4843009 fbshipit-source-id: 5d0aaa3	8 years ago
Andrzej Kotulski	24b56de0e9	Populate mangled file only if it's not empty Summary: We were including hex of empty string if mangled name was not empty (so for all C++ functions). Instead, include hex of a source file only if it's not empty Reviewed By: mbouaziz Differential Revision: D4705388 fbshipit-source-id: 55b6587	8 years ago
Andrzej Kotulski	6a02568982	[clang] Change procname file naming scheme Summary: Procnames files are now reversed qualifier lists with `#` as separator (instead of `::` which needs to be escaped in bash). Because of the mechanism that is used to obtain qualifiers, it also affects naming for ObjC classes. Examples: ``` std::unique_ptr<int>::get -> get#unique_ptr<int>#std#__MANGLED,...__ // C++ method folly::split -> split#folly#__MANGLED,..._ // function within namespace NSNumber numberWithBool: -> numberWithBool:#NSNumber#class // ObjC method ``` Reviewed By: jvillard Differential Revision: D4689701 fbshipit-source-id: c3acfc6	8 years ago
Jules Villard	e5863f5420	[siof] handle constexpr constructors Summary: Globals that are constexpr-initializable do not participate in SIOF. Reviewed By: sblackshear Differential Revision: D4277216 fbshipit-source-id: fd601c8	8 years ago
Josh Berdine	0cf71c74ef	Sort nodes when printing cfg to dot file Summary: Currently cfg nodes are written into dot files in whatever order they appear in a hash table. This seems unnecessarily sensitive, so this diff sorts the nodes. Reviewed By: dulmarod Differential Revision: D4232377 fbshipit-source-id: a907cc6	8 years ago
Sam Blackshear	708c0bf1f8	[backend] eliminate phantom spaces in printing of types Summary: These are dangerous if you are trying to compare a type to a string, and they're also unsightly. Reviewed By: jvillard Differential Revision: D4189956 fbshipit-source-id: 14ce127	8 years ago
Cristiano Calcagno	a71902355f	[debug][dotty] Fix issue in dotty output where overloaded functions were conflated Reviewed By: jvillard Differential Revision: D4117078 fbshipit-source-id: fdd8e93	8 years ago
Cristiano Calcagno	3fb8801b6c	[IR] Change cfg representation so the node number is per-procedure and not per-cfg Reviewed By: jeremydubreil Differential Revision: D4088075 fbshipit-source-id: 6e517a7	8 years ago
Cristiano Calcagno	847c141912	[tests] Clean up test files shared between frontend and endtoend tests Reviewed By: jberdine Differential Revision: D3893900 fbshipit-source-id: 497effc	8 years ago

50 Commits (d198cb855df5df745a645075a23d233980d99dd8)