infer_clone

Commit Graph

Author	SHA1	Message	Date
Jules Villard	e32f6ca360	[clang] fix bad interaction between ConditionalOperator and initializers Summary: This is several inter-connected changes together to keep the tests happy. The ConditionalOperator `b?t:e` is translated by first creating a placeholder variable to temporarily store the result of the evaluation in each branch, then the real thing we want to assign to reads that variable. But, there are situations where that changes the semantics of the expression, namely when the value created is a struct on the stack (eg, a C++ temporary). This is because in SIL we cannot assign the address of a program variable, only its contents, so by the time we're out of the conditional operator we cannot set the struct value correctly anymore: we can only set its content, which we did, but that results in a "shifted" struct value that is one dereference away from where it should be. So a batch of changes concern `conditionalOperator_trans`: - instead of systematically creating a temporary for the conditional, use the `trans_state.var_exp_typ` provided from above if available when translating `ConditionalOperator` - don't even set anything if that variable was already initialized by merely translating the branch expression, eg when it's a constructor - fix long-standing TODO to propagate these initialization facts accurately for ConditionalOperator (used by `init_expr_trans` to also figure out if it should insert a store to the variable being initialised or not) The rest of the changes adapt some relevant other constructs to deal with conditionalOperator properly now that it can set the current variable itself, instead of storing stuff inside a temp variable. This change was a problem because some constructs, eg a variable declaration, will insert nodes that set up the variable before calling its initialization, and now the initialization happens before that setup, in the translation of the inner conditional operator, which naturally creates nodes above the current one. - add a generic helper to force a sequential order between two translation results, forcing node creation if necessary - use that in `init_expr_trans` and `cxxNewExpr_trans` - adjust many places where `var_exp_typ` was incorrectly not reset when translating sub-expressions The sequentiality business creates more nodes when used, and the conditionalOperator business uses fewer temporary variables, so the frontend results change quite a bit. Note that biabduction tests were invaluable in debugging this. There could be other constructs to adjust similarly to cxxNewExpr that were not covered by the tests though. Added tests in pulse that exercises the previous bug. Reviewed By: da319 Differential Revision: D24796282 fbshipit-source-id: 0790c8d17	4 years ago
Sungkeun Cho	198c700e87	[cost] Add an option printing/suppressing function pointers in cost Summary: This diff adds an option hiding function pointers in costs to users: `cost-suppress-func-ptr` is true by default. Reviewed By: ezgicicek Differential Revision: D24448212 fbshipit-source-id: 88f6b5ea1	4 years ago
Sungkeun Cho	0f44f39b41	[cost] Support closure symbols in operation/allocation cost kinds Summary: This diff adds closure symbols to operation/allocation costs, when function pointer is called. Reviewed By: ezgicicek Differential Revision: D24308550 fbshipit-source-id: 6c5889d41	4 years ago
Sungkeun Cho	4ef0a787db	[cost] Do not print loop line number in trace message Summary: This diff prevents printing line numbers of loop in the trace description, which helps to keep the same descriptions even when the line number of a function is changed in tests. Reviewed By: ezgicicek Differential Revision: D22375584 fbshipit-source-id: 676d1a7cc	4 years ago
Ezgi Çiçek	6d7df79573	[cost] Brush up tests (4) Summary: Fix misleading test names. Correct comments/FP/ok/bad markers. Remove reduntant tests. To be continued... Reviewed By: skcho Differential Revision: D21973583 fbshipit-source-id: bdf5241ab	5 years ago
Ezgi Çiçek	289d64260e	[cost] Record cost traces in cost-report.json Summary: We do not use an arbitrary threshold to test cost results anymore but instead rely on `cost-issues` which do not have any trace attached. This diff adds traces to `costs-report.json` so that we can test cost issues with traces. Reviewed By: skcho Differential Revision: D21858846 fbshipit-source-id: e73321a92	5 years ago
Ezgi Çiçek	2b40497e88	[cost] Remove cost threshold from tests Summary: Now that we have a way to write cost issues, let's not rely on some arbitrary threshold (and also get rid of `EXPENSIVE_EXECUTION_TIME` issues in tests). One consequence of this is that we will loose the cost traces in tests since `costs-report.json` doesn't have any traces. Next diff fixes that. Reviewed By: skcho Differential Revision: D21837574 fbshipit-source-id: 86b4d028d	5 years ago
Ezgi Çiçek	4858d29147	[cost] Add ability to test costs-report.json Summary: In order to test cost analysis results, currently we rely on having an arbitrary cost threshold (200) and report issues that exceed this cost. For instance, a cost of 201 is considered expensive and reported as `EXPENSIVE_EXECUTION_TIME` issue in cost tests. This means, if we change the cost analysis in a slight way that results in some constant cost increase under 200, we wouldn't able to detect it. I find this unsatisfactory and somewhat hacky. This diff adds the ability to write the result of `costs-report.json` into a separate `cost-issues.exp` and then compare the actual costs (not only than relying on this arbitrary threshold reporting mechanism). Reviewed By: skcho Differential Revision: D21816312 fbshipit-source-id: 93b531928	5 years ago
Jules Villard	b1e35a728d	[biabd] rename test directories from {biabduction,errors,infer} to {biabduction} Summary: The directory names had some interesting variety due to historical reasons. - {c,cpp,objc,objcpp}/errors/ date from the time when infer was only biabduction - java/infer/ dates from the time when we had an "--analyzer" option and "infer" was one of them (sic), and eg another was "eradicate". - c/biabduction/ dates from the time when the biabduction analysis was being migrated to the "checkers" (AI) framework. For some reasons the tests there are not a subset of c/infer/ but seem to be entirely new tests. The convention now dictates that we should name all of these */biabduction/. This diff moves the existing tests from c/biabduction/ into c/biabduction/misc/. Reviewed By: mityal Differential Revision: D21300147 fbshipit-source-id: 516d1cb15	5 years ago
Ezgi Çiçek	7deaae6598	[cost] Rename ZERO_* to *_UNREACHABLE_AT_EXIT Summary: This diff renames `ZERO_XXX` issues to more appropriately named and descriptive `XXX_UNREACHABLE_AT_EXIT` and replaces bottom with unreachable in cost kinds and issues. Reviewed By: skcho Differential Revision: D20140301 fbshipit-source-id: eb6076b30	5 years ago
Ezgi Çiçek	ebbc0fc7f2	[cost] Add traces for ZERO_* issues Summary: The issue type `ZERO_EXECUTION_TIME` actually corresponds to bottom state but has been mistakenly used to mean - unreachable nodes (program never reaching exit state) - having zero cost (e.g. for allocations). Note that, for execution costs, the latter doesn't make sense since we always incur a unit cost for the start node. Hence, a function with empty body will have unit cost. For allocations or IO however, we only incur costs for specific primitives, so a function with no allocations/IO could have a zero cost. However, there is no point reporting functions with zero cost as a specific issue type. Instead, what we want to track is the former, i.e. functions whose cost becomes 0 due to program never reaching exit state. This diff aims to split these cases into two by only reporting on the latter and adds traces to bottom/unreachable cost by creating a special category in polynomials. Next diff will rename `ZERO_XXX` to `XXX_UNREACHABLE_AT_EXIT`. Reviewed By: skcho Differential Revision: D20005774 fbshipit-source-id: 46b9abd5a	5 years ago
Ezgi Çiçek	cbd506011f	[cost] Add tests for ZERO_EXECUTION_COST Summary: We had no tests that resulted in `ZERO_EXECUTION_COST`. Let's fix that. Reviewed By: skcho Differential Revision: D20097504 fbshipit-source-id: 56c23fea0	5 years ago
Sungkeun Cho	88813fdaa7	[inferbo] Revise division by constant Summary: This diff tries to make less imprecise division by constants results. For example, the results of the division `[l, u] / c`, where `c` is a positive constant, are: 1. If `l/c` or `u/c` is representable in the bound domain, it uses the precise bounds, i.e., `[l/c, u/c]`. 2. If it is not representable, it tries to make conservative results: if `0<=l<=u`, it returns `[0, u]` because `0 <= [l/c, u/c] <= u` if `l<=u<=0`, it returns `[l, 0]` because `l <= [l/c, u/c] <= 0` if `l<=0<=u`, it returns `[l, u]` because `l <= [l/c, u/c] <= u` 3. otherwise, it returns top, `[-oo, +oo]` Reviewed By: ezgicicek Differential Revision: D18270380 fbshipit-source-id: 8fb14c0e4	5 years ago
Sungkeun Cho	21c890f23d	[inferbo] Revise widen of bounds Summary: This diff revises widening functions of bounds that have a linear form and a min/max form. For example, for lower bounds, * 3 ▽ (1+min(2, x)) = (1+min(2, x)) * 3+x ▽ (3+min(2, x)) = (3+min(2, x)) Reviewed By: jvillard Differential Revision: D17420786 fbshipit-source-id: ff9eebed3	5 years ago
Sungkeun Cho	f79871c5fa	[cost] Ignore character symbols in the cost results Summary: This diff ignores character symbols in the cost results, in order to avoid FPs from parser code. Reviewed By: ezgicicek Differential Revision: D17132053 fbshipit-source-id: d9cf8bd26	5 years ago
Ezgi Çiçek	5fa9f89285	[cost] Fix misleading test names Summary: Functions with empty body have unit cost, not zero. The unit cost comes from the start node. Reviewed By: skcho Differential Revision: D16855642 fbshipit-source-id: 6b5181faf	5 years ago
Ezgi Çiçek	89782dfff9	[cost] Mask min/max symbols when printing big O Summary: We want to keep big O notation as simple as possible in cost analysis reports (especially in diff time). Therefore, let's not show constants/min/max in big O notations even though the resulting asymptotic bound might be inaccurate. Developers can click on the trace and see the actual cost. Reviewed By: skcho Differential Revision: D16731351 fbshipit-source-id: 2e16f7eca	5 years ago
Ezgi Çiçek	9c5b704ddd	[cost] Record bigO in error trace description Summary: In order to test changes to bigO notation, let's record them in test results. Reviewed By: skcho Differential Revision: D16763972 fbshipit-source-id: c1376909b	5 years ago
Ezgi Çiçek	0f43930f40	[cost] Refactor cost issue types and enable detecting allocation complexity increase on cold start Summary: - Add allocation costs to `costs-report.json` and enable diffing over allocation costs. - Also, let's be more consistent and modular in naming our cost issues. - introduce a generic issue type `X_TIME_COMPLEXITY_INCREASE` where `X` can be one of the cost kinds. If the function is on the cold start, issue can have the `COLD_START` suffix. Similarly for infinite/zero/expensive calls. - Change `PERFORMANCE_VARIATION` -> `EXECUTION_TIME_COMPLEXITY_INCREASE` - Add new issue type for `ALLOCATION_COMPLEXITY_INCREASE_COLD_START` which will be enabled by default - Refactor cost issues to be more modular and succinct. This also makes addition of a new cost kind very easy by adding the kind into the `enabled_cost_kinds` list in `CostKind.ml` Reviewed By: mbouaziz Differential Revision: D15822681 fbshipit-source-id: cf89ece59	6 years ago
Josh Berdine	cfc1c8be36	[copyright] Remove years Reviewed By: jvillard Differential Revision: D15771884 fbshipit-source-id: e2997e3a3	6 years ago
Jules Villard	686231ec6e	[SIL] change `variable_initialization()` builtin to a new auxiliary instruction Summary: Instead of emitting an ad-hoc builtin on variable declaration emit a new metadata instruction. This allows us to remove the code matching on that ad-hoc builtin that had to be inserted in several checkers. Inferbo & pulse used that information meaningfully and had to undergo some minor changes to cope with the new metada instruction. Reviewed By: ezgicicek Differential Revision: D14833100 fbshipit-source-id: 9b3009d22	6 years ago
Ezgi Çiçek	1884994cc0	[cost] Allow program variables to occur in control variables Reviewed By: mbouaziz Differential Revision: D14439186 fbshipit-source-id: 41f33b1a8	6 years ago
Dino Distefano	67b42bf021	Added new issue types for Allocation and IO Reviewed By: ezgicicek Differential Revision: D14437988 fbshipit-source-id: 3e107d9e9	6 years ago
Mehdi Bouaziz	564d0113b4	[Cost] More precise traces for Top Reviewed By: ezgicicek Differential Revision: D14350180 fbshipit-source-id: 2cb8b0cd0	6 years ago
Mehdi Bouaziz	264a97794d	[inferbo] Exact result for (c1 - max(d, x)) + (c2 + x) Reviewed By: skcho Differential Revision: D14185196 fbshipit-source-id: 92d8430a1	6 years ago
Mehdi Bouaziz	b48884bce7	[Cost] Traces for Top values Reviewed By: ezgicicek Differential Revision: D14247738 fbshipit-source-id: 4270649d2	6 years ago
Sungkeun Cho	a56902dc9b	[inferbo] Widening threshold by comparison Summary: This diff adds a constant to the set of widening thresholds if the constant is compared to an abstract value in condition expressions. Each abstract value has its own set of thresholds. Reviewed By: mbouaziz Differential Revision: D14147150 fbshipit-source-id: ca0db34d4	6 years ago
Ezgi Çiçek	cd20abfc88	[cost] Add trace to symbols in polynomial bounds Summary: Record where each symbol in a polynomial is coming from: either a loop, function call or a modeled call. Reviewed By: mbouaziz Differential Revision: D14047420 fbshipit-source-id: 56d0bd926	6 years ago
Sungkeun Cho	ad08184d3b	[inferbo] Keep alias of simple plus/minus arithmetic Summary: It keeps alias of simple plus/minus arithmetic in order to pruning the value of "++i" expression. Reviewed By: mbouaziz Differential Revision: D14080230 fbshipit-source-id: d3af32a32	6 years ago
Mehdi Bouaziz	17fc4ca5cf	[cost] Simplify & optimize exit cost + threshold Summary: - There is no need to use AI to compute a dot product: let's just fold over all nodes, but still do it in order (using the WTO) to report at the right place - The previous version was computing a dot product on nodes for each node, which was quadratic, the new version is linear - Report only once, the first time the threshold is reached (if in a loop, report at the loop head) Reviewed By: ddino Differential Revision: D14028171 fbshipit-source-id: b4a840c6e	6 years ago
Mehdi Bouaziz	1b8927badd	[inferbo/cost] Do not produce inferbo issues on Cost and Purity analysis Reviewed By: skcho Differential Revision: D13827167 fbshipit-source-id: 734950a1e	6 years ago
Mehdi Bouaziz	5616940ec0	[inferbo] Symbols for one value Summary: For abstract values representing one concrete value, create only one symbol instead of two. Still create two symbols (lb, ub) for abstract values representing multiple concrete values (like array cells). As a consequence, comparisons of symbolic values are more precise (we can even prove equality). I expect to remove a bunch of FPs. Another consequence is the disappearance of `.lb` and `.ub` in many reports. Reviewed By: skcho Differential Revision: D13072084 fbshipit-source-id: 9bc0b9881	6 years ago
Sungkeun Cho	f9161b164f	[inferbo] On-demand heap symbol using path Summary: It materializes symbolic values of function parameters on-demand. The on-demand materialization is triggered when finding a value from an abstract memory and joining/widening abstract memories. Depends on D13294630 Main idea: * Symbolic values are on-demand-ly generated by a symbol path and its type * In order to avoid infinite generation of symbolic values, symbol paths are canonicalized by structure types and field names (which means they are abstracted to the same value). For example, in a linked list, a symbolic value `x->next->next` is canonicalized to `x->next` when the structures (`x` and `x->next`) have the same structure type and the same field name (`next`). Changes from the previous code: * `Symbol.t` does not include `id` and `pname` for distinguishing symbols. Now, all symbols are compared by `path:SymbolPath.partial` and `bound_end`. * `SymbolTable` is no longer used, which was used for generating symbolic values with new `id`s. Reviewed By: mbouaziz Differential Revision: D13294635 fbshipit-source-id: fa422f084	6 years ago
Ezgi Çiçek	6017c2ec54	[cost] Fix control variables to pick up global vars in prune instructions Reviewed By: ngorogiannis Differential Revision: D13322062 fbshipit-source-id: 4e8081103	6 years ago
Mehdi Bouaziz	5f60ffaa8f	[inferbo] Trace refactoring Reviewed By: skcho Differential Revision: D13116116 fbshipit-source-id: 0b885dcfb	6 years ago
Ezgi Çiçek	5fa89e2563	[purity] Disable clang Reviewed By: jvillard Differential Revision: D13118428 fbshipit-source-id: f4e86f286	6 years ago
Mehdi Bouaziz	fac9932168	[inferbo] Add traces to Conditions always true/false and Unreachable code Reviewed By: ezgicicek Differential Revision: D13082665 fbshipit-source-id: bb0e4cbf3	6 years ago
Mehdi Bouaziz	0ba4c2c892	[cost] Pretty-printing exponents Reviewed By: ezgicicek Differential Revision: D13050241 fbshipit-source-id: dbac027b7	6 years ago
Mehdi Bouaziz	5ed59b1655	[Inferbo/cost] Improve pretty-printing Reviewed By: skcho Differential Revision: D13045247 fbshipit-source-id: 5485b58c8	6 years ago
Sungkeun Cho	1cbcbe6fb3	[inferbo] Improve division on constant Reviewed By: mbouaziz, jvillard Differential Revision: D12921835 fbshipit-source-id: 9d0e85696	6 years ago
Jules Villard	9aa5582caa	[clang] leave markers of variable initialization for pulse Summary: When initialising a variable via semi-exotic means, the frontend loses the information that the variable was initialised. For instance, it translates: ``` struct Foo { int i; }; ... Foo s = {42}; ``` as: ``` s.i := 42 ``` This can be confusing for backends that need to know that `s` actually got initialised, eg pulse. The solution implemented here is to insert of dummy call to `__variable_initiazition`: ``` __variable_initialization(&s); s.i := 42; ``` Then checkers can recognise that this builtin function does what its name says. Reviewed By: mbouaziz Differential Revision: D12887122 fbshipit-source-id: 6e7214438	6 years ago
Sungkeun Cho	2401f6f6eb	[inferbo] Give a widening threshold of zero Reviewed By: mbouaziz Differential Revision: D12898540 fbshipit-source-id: 95bdaf4f0	6 years ago
Mehdi Bouaziz	3ee96263a7	[inferbo] Simplify and improve Itv.prune_comp Reviewed By: skcho Differential Revision: D10386789 fbshipit-source-id: f9c7e33ef	6 years ago
Sungkeun Cho	fd3f298156	[inferbo] Add narrowing Reviewed By: jvillard Differential Revision: D10334140 fbshipit-source-id: afb247866	6 years ago
Mehdi Bouaziz	3dd97cc40f	[inferbo] Use WTO abstract interpreter Reviewed By: jvillard Differential Revision: D10072723 fbshipit-source-id: aabf3605e	6 years ago
Sungkeun Cho	cd1981a567	[inferbo] Change pp of BinaryOperationCondition Summary: This diff changes pp of binary operation condition in order to avoid a `make test` failure. For the same `uint64_t` type, it is translated to `unsigned long long` in 64bit mac, but `unsigned long` in 64bit linux, which made a `make test` failure. Reviewed By: mbouaziz Differential Revision: D10459466 fbshipit-source-id: 449ab548e	6 years ago
Sungkeun Cho	fb4086c6f6	[inferbo] Add integer overflow issue type Reviewed By: mbouaziz Differential Revision: D10253878 fbshipit-source-id: 9905d7db4	6 years ago
Dino Distefano	3d07754275	Giving cost 1 to procedure with empty body Reviewed By: mbouaziz Differential Revision: D10378093 fbshipit-source-id: e6bff04da	6 years ago
Jules Villard	7615963bf4	[proc-cfg][2/5] fix duplicate symbols detection Summary: Fix the logic for computing duplicate symbols. It was broken at some point and some duplicate symbols creeped into our tests. Fix these, and add a test to avoid duplicate symbols detection to regress again. Also, this removes one use of `Cfg.load`, on the way to removing file-wide CFGs from the database. Reviewed By: ngorogiannis Differential Revision: D10173349 fbshipit-source-id: a0d2365b3	6 years ago
Jules Villard	a29e769b61	[kill -a][1/4] stop using `-a foo` in the infer repo Summary: Goal of the stack: deprecate the `--analyzer` option in favour of turning individual features on and off. This option is a mess: some of the options are now subcommands (compile, capture), others are aliases (infer and checkers), and they can all be replicated using some straightforward combination of other options. This diff: stop using `--analyzer` in tests. It's mostly `checkers` everywhere, which is already the default. `linters` becomes `--no-capture --linters-only`. `infer` is supposed to be `checkers` already. `crashcontext` is `--crashcontext-only`. Reviewed By: mbouaziz Differential Revision: D9942689 fbshipit-source-id: 048281761	6 years ago

1 2

83 Commits (581487ec612942c0c5cb56a38aa5d8a173d71866)