infer_clone

Commit Graph

Author	SHA1	Message	Date
Ezgi Çiçek	cec8cbeff2	[pre-analysis] Add models for `no_return` and handle throw-catch better Summary: - Add `no_return` models for Java's `exit(...)` methods (can be extended further later on) - handle throw-catch better by short-cutting throw nodes to not exit node but to all catch nodes that are reachable by the node. If there is no catch node, we short-cut to the exit node as before. This removes a FP from deadstore tests because before we simply were not able to handle CF from throw-> catch nodes at all. Reviewed By: skcho Differential Revision: D20769039 fbshipit-source-id: e978f6cdb	5 years ago
Daiva Naudziuniene	526af36061	[uninit] Use type information from locals if argument has type void* in function signature Summary: In an intra-procedural analysis we assume that parameters passed by reference to a function will be initialized inside that function. We use the type information of an actual parameter to initialize the fields of the struct. This does not work if a function has a parameter of type void* as the actual parameters also has type void*. To solve this issue, we use type information from local variables. Reviewed By: jvillard Differential Revision: D20670253 fbshipit-source-id: dc9f051ef	5 years ago
Ezgi Çiçek	d97e1c8fdb	[pulse][impurity] Add model for System.exit() Summary: - Model `System.exit()` as early_exit and add a test - Tweak message of methods that are impure due to having no pulse summary (and add a test) Reviewed By: skcho Differential Revision: D20668979 fbshipit-source-id: 6b5589aae	5 years ago
Jules Villard	c73feb82b4	[CLI] kill --pmd-xml option Summary: Hopefully no one uses this. This is in Python and we'd like to get rid of it. Easy enough to either re-implement if needed or to be re-implemented by a third party. Reviewed By: ngorogiannis Differential Revision: D20626344 fbshipit-source-id: 484022482	5 years ago
Ezgi Çiçek	b64ed0bbf2	[impurity] Consider functions with empty pulse summary as impure Summary: As exemplified by added tests, pulse computes an empty summary (with 0 disjuncts) whenever it discovers a contradiction which might be caused by: - discovering aliasing in memory - widening limited number of times in loops and concluding that loop exit conditions are never taken However, AFAIU, it is not possible to have a function with 0 disjunct apart from such anomalities. Even a function which does nothing like `void foo(){}` has 1 disjuncts: ``` Pulse: 1 pre/post(s) #0: PRE: { roots={ }; mem ={ }; attrs={ };} POST: { roots={ }; mem ={ }; attrs={ };} SKIPPED_CALLS: { } ``` The aim of this diff is to consider functions with 0 disjuncts as impure because most often such cases are impure, rather than actually pure. Reviewed By: skcho Differential Revision: D20619504 fbshipit-source-id: 3a8502c90	5 years ago
Ezgi Çiçek	cc815f5d20	[pulse] Only propagate existing WrittenTo attributes at function calls Summary: Previously, at each function call, we added a `WrittenTo` attribute for applying the address of the actuals. However, this results in mistakenly considering each function application that inspects its argument as impure. Instead, we should only propagate `WrittenTo` if the actuals have already `WrittenTo` attributes. For instance, for the following functions ``` public static boolean is_null(Byte a) { return a == null; } public static boolean call_is_null(Byte a) { return is_null(a); } ``` We used to get the following pulse summary for `call_is_null` (showing only one of the disjuncts): ``` #0: PRE: { roots={ &a=v1 }; mem ={ v1 -> { * -> v2 } }; attrs={ v1 -> { MustBeValid }, v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) } };} POST: { roots={ &a=v1, &return=v8 }; mem ={ v1 -> { * -> v2 }, v8 -> { * -> v4 } }; attrs={ v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]), WrittenTo-----------WRONG }, v4 -> { Arith =1, BoItv (1), Invalid ConstantDereference(is the constant 1), WrittenTo-----------WRONG }, v8 -> { WrittenTo } };} SKIPPED_CALLS: { } ``` where we mistakenly recorded a `WrittenTo` for `v2` (what `a` points to). As a result, we considered `call_is_null` as impure :( This diff fixes that since the callee `is_null` doesn't have any `WrittenTo` attributes for its parameter `a`. So, we don't propagate `WrittenTo` and get the following summary ``` #0: PRE: { roots={ &a=v1 }; mem ={ v1 -> { * -> v2 } }; attrs={ v1 -> { MustBeValid }, v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) } };} POST: { roots={ &a=v1, &return=v8 }; mem ={ v1 -> { * -> v2 }, v8 -> { * -> v4 } }; attrs={ v2 -> { Arith =null, BoItv ([max(0, v2), min(0, v2)]) }, v4 -> { Arith =1, BoItv (1), Invalid ConstantDereference(is the constant 1) }, v8 -> { WrittenTo } };} SKIPPED_CALLS: { } ``` Reviewed By: skcho Differential Revision: D20490102 fbshipit-source-id: 253d8ef64	5 years ago
Nikos Gorogiannis	d1e0375f4a	[racerd] stop reporting on locals Summary: There has never been a sufficient formal basis for soundness nor completeness of reports on locals. This diff changes the domain to effectively concern only expressions rooted at formals or globals. Reviewed By: ezgicicek Differential Revision: D19769201 fbshipit-source-id: 36ae04d8c	5 years ago
Ezgi Çiçek	a4c3925d9a	[impurity] Track unique accesses Summary: Impurity domain was tracking all changes to variables (with a list of traces that containing all write/invalid accesses). This results in having long traces with multiple access events for the same variable. For instance, ``` void swap_impure(int[] array, int i, int j) { int tmp = array[i]; array[i] = array[j]; \\ included in the trace array[j] = tmp; \\ included in the trace } ``` here we recorded both array accesses. This diff changes the domain to include accesses so that we only keep track of a single trace per access. Array accesses are only recorded once. Note that we want to record all unique accesses, not just the first one, because impurity will be used for hoisting/cost where we will invalidate impure arguments and consider all the rest as not changing. Reviewed By: jvillard Differential Revision: D20385745 fbshipit-source-id: d3647dad3	5 years ago
Ezgi Çiçek	e3c89b1f10	[impurity] Fix include_value_history Summary: D20362149 missed - to pass the optional argument `include_value_history` to the recursive call in `PulseTrace.add_to_errlog`. - to set `include_value_history=false` for skipped calls. This diff fixes these issues. Reviewed By: skcho Differential Revision: D20385604 fbshipit-source-id: 176e4d010	5 years ago
Jules Villard	e5b50d7d46	[quandaryBO] delete :( Summary: This was never quite finished and inferbo has a new way to do sort of the same thing. Reviewed By: skcho, ngorogiannis Differential Revision: D20362619 fbshipit-source-id: 7c7935d47	5 years ago
Ezgi Çiçek	b90d7c42d3	[impurity] Do not add value history in impurity traces Summary: Impurity traces are quite big due to recording values histories. Let's simplify the traces by removing pulse's value histories. Reviewed By: skcho Differential Revision: D20362149 fbshipit-source-id: 8a2a6115e	5 years ago
Mitya Lyubarskiy	bd83813b3e	[tech debt] Remove experimental nullability checkers Summary: These were not used (and were actually activated byt the same config param). They both are in experimental stage that never reached maturity. Since the team does not have immediate plans to work on ObjC nullability checker; and since "eradicate" (now known as nullsafe) is the main solution for Java, removing it is sensible. Reviewed By: jvillard Differential Revision: D20279866 fbshipit-source-id: 79e64992b	5 years ago
Sungkeun Cho	175af05b8b	[inferbo] Add model of vector resize Reviewed By: ezgicicek Differential Revision: D19942412 fbshipit-source-id: 0e3d6816e	5 years ago
Sungkeun Cho	9562ab4d68	[inferbo] Suppress integer overflow in hash functions Summary: This diff suppresses integer overflow issues in functions that includes "hash" in its name. Reviewed By: jvillard Differential Revision: D19942654 fbshipit-source-id: d86fa4f00	5 years ago
Sungkeun Cho	4311371d81	[inferbo] Revise split model semantics Reviewed By: ezgicicek Differential Revision: D19942999 fbshipit-source-id: c1d945160	5 years ago
Sungkeun Cho	22601846b2	[inferbo] Add model of emplace_back Reviewed By: ezgicicek Differential Revision: D19941739 fbshipit-source-id: b6067ad0d	5 years ago
Sungkeun Cho	b9db4fa4b9	[inferbo] Add model for C++ exception throw Reviewed By: ezgicicek Differential Revision: D19767841 fbshipit-source-id: b76e5a8f3	5 years ago
Sungkeun Cho	412e8a977c	[inferbo] Add model for std::shared_ptr<>::operator-> Summary: This diff gives semantics of `std::shared_ptr<>::operator->` as simple assignment. Reviewed By: ezgicicek Differential Revision: D19743415 fbshipit-source-id: 888444125	5 years ago
Sungkeun Cho	ca04002f6c	[inferbo] Revise finding constructors on std::make_shared Summary: When finding a proper constructor for `std::make_shared`, the given parameter types are sometimes slightly different, e.g., const int vs int. This diff loosens the condition of the types on finding constructors. Reviewed By: ngorogiannis Differential Revision: D19743198 fbshipit-source-id: f90213109	5 years ago
Sungkeun Cho	92e7aeeb3e	[infer] Fix clang frontend for switch statment Summary: This diff fixes the clang translation for switch statement. It assumed that `default:` comes always at last, which introduced some unreachable nodes inadvertently, e.g. when `default:` comes at first. Reviewed By: dulmarod Differential Revision: D19793138 fbshipit-source-id: 1e8b52c0d	5 years ago
Nikos Gorogiannis	a6da208e9d	[starvation] use access expressions instead of access paths Summary: The goals are: - Increase precision in C-languages by ditching access paths. - Help with eventually sharing the abstract address module with RacerD. - Reports are now language-mode specific (eg `->` in clang vs `.` in Java). It's not exactly access expressions used here. Instead the pattern `(base, access list)` is used where `access` is `HilExp.Access.t`. This is done to ease the way `deriving` is used for creating two comparison functions, one that cares about the root variable and one that doesn't; and also because the main function that recurses over accesses (`normalise_access_list`) visits the accesses from innermost to outermost. Also, kill some dead code. Reviewed By: skcho Differential Revision: D19741545 fbshipit-source-id: 013bf1a89	5 years ago
Sungkeun Cho	f94990a0c2	[inferbo] Add taint domain Summary: This diff adds a taint domain in Inferbo. The taint value will be used to find vulnerable array accesses in the following diffs. Reviewed By: ezgicicek Differential Revision: D19391028 fbshipit-source-id: 566b4c0fe	5 years ago
Nikos Gorogiannis	777eb33870	[racerd] use concurrent context for reporting in C++/ObjC Summary: To emulate the `ThreadSafe` contract in C++/ObjC, reporting was gated behind a check that ensured a C++/ObjC class has a `std::mutex` member (plus other filters). This is reasonable, but it has some drawbacks - other locks may be used, and therefore must be added to the member check; - locking mechanisms that use the object itself as a monitor cannot be modelled (`synchronized` in ObjC) RacerD already has `ThreadsDomain` which models our guess on whether a method is expected to run in a concurrent context, and which in C++/ObjC boils down to whether the method non-transitively acquires a lock. This should be a good enough indicator that the class should be checked regardless of whether the locks are member fields. This diff gates the C++/ObjC check on that abstract property. Reviewed By: dulmarod Differential Revision: D19558355 fbshipit-source-id: 229d7ff82	5 years ago
Sungkeun Cho	7f25684491	[inferbo] Check nested global array access Summary: This diff fixes the array access checking function for nested global arrays. We had assumed that RHS of `store` statement in SIL does not include array access expression, but that is not true: for global arrays, SIL can have statements like `*LHS = GlobalArray[n][m]`. Reviewed By: ezgicicek Differential Revision: D19300153 fbshipit-source-id: 256325642	5 years ago
Sungkeun Cho	f8ee0a14aa	[inferbo] Give semantics of std::make_shared as simple constructor Summary: This diff gives semantics of `std::make_shared` as simple constructor, i.e., it changes function call of `std::make_chared<C>(i)` to the constructor `C(i)`. Reviewed By: ngorogiannis Differential Revision: D19432338 fbshipit-source-id: 0d838e555	5 years ago
Nikos Gorogiannis	21cff2d659	[starvation] substitute arguments over parameters in locks Reviewed By: skcho Differential Revision: D19374478 fbshipit-source-id: 48f59327d	5 years ago
Ezgi Çiçek	a0fd5a0e6a	[pulse] Refactor attributes into domain Summary: Let's move attributes into Pulse's domain. Reviewed By: jvillard Differential Revision: D19533915 fbshipit-source-id: 995fd12da	5 years ago
Sungkeun Cho	c93c3163d6	[inferbo] Get global constant array values from initializers Summary: This diff gets global constant array values from their initializers. The `find_global_array` function is added to memory domain, which finds values of global array locations during the ondemand value generation. Reviewed By: ngorogiannis Differential Revision: D19300143 fbshipit-source-id: 7b0b84c42	5 years ago
Nikos Gorogiannis	279f1c85ce	[racerd] abbreviate procnames in report text Summary: If a race exists in two or more overloads of the same method and we use only the class and method name in the report text, then the current bug hashing algorithm will identify the two reports as duplicates. To avoid this, the report had the class, method and list of type parameters. This is unreadable, however, and redundant (the report is already located within the method in question). So at the risk of duplicates, use only class+method names. Also, fix a bug in `Procname.pp_simplified ~withclass` where `withclass` was ignored for C++/ObjC methods. Now: > Read/Write race. Non-private method `FrescoVitoImageSpec.onCreateInitialState(...)` indirectly reads with synchronization from `factory.AnimatedFactoryProvider.sImpl`. Potentially races with unsynchronized write in method `FrescoVitoImageSpec.onEnteredWorkingRange(...)`.@ [Litho components are required to be thread safe because of multi-threaded layout](https://fburl.com/background-layout). Reporting because current class is annotated `MountSpec`, so we assume that this method can run in parallel with other non-private methods in the class (including itself). Before > Read/Write race. Non-private method `void FrescoVitoImageSpec.onCreateInitialState(ComponentContext,StateValue,StateValue,Uri,MultiUri,ImageOptions,FrescoContext,Object,ImageListener)` indirectly reads with synchronization from `factory.AnimatedFactoryProvider.sImpl`. Potentially races with unsynchronized write in method `FrescoVitoImageSpec.onEnteredWorkingRange(...)`.@ [Litho components are required to be thread safe because of multi-threaded layout](https://fburl.com/background-layout). Reporting because current class is annotated `MountSpec`, so we assume that this method can run in parallel with other non-private methods in the class (including itself). Reviewed By: artempyanykh Differential Revision: D19462277 fbshipit-source-id: aebc20d89	5 years ago
Ezgi Çiçek	dd59a141f0	[impurity] Rely on set of skipped functions to determine impurity Summary: Currently, impurity analysis is oblivious to skipped functions which might e.g. return a non-deterministic value, write to memory or have some other side-effect. This diff fixes that by relying on Pulse's skipped functions to determine impurity. Any unknown function which is not modeled to be pure is assumed to be impure. This is a heuristic. We could have assumed them to be pure by default as well. Reviewed By: jvillard Differential Revision: D19428514 fbshipit-source-id: 82efe04f9	5 years ago
Jules Villard	3bf771bff4	[pulse] add model for std::vector<>::at() Summary: Kinda forgot to model this when `operator[]` was modelled. Reviewed By: skcho Differential Revision: D19433156 fbshipit-source-id: 49fbafc8a	5 years ago
Sungkeun Cho	1305db390a	[infer] Load global array constant Summary: This diff enables loading array contents of global constant. Reviewed By: ezgicicek Differential Revision: D19329014 fbshipit-source-id: 9dfae2754	5 years ago
Sungkeun Cho	16f18792f1	[infer] Capture global initializer ondemand Summary: This diff captures global initializers ondemand, like we do for functions defined in headers. Reviewed By: ezgicicek Differential Revision: D19346947 fbshipit-source-id: 05174e6a4	5 years ago
Ezgi Çiçek	6f64131ae6	[pulse] Do not havoc arguments of unknown functions that are pointers to const Reviewed By: skcho Differential Revision: D19331312 fbshipit-source-id: b450a819b	5 years ago
Nikos Gorogiannis	ca1ba2511b	[starvation] tests for a multi-file C++ deadlock Summary: Demonstrate that the per-file type environments don't prevent the deadlock report here. The fear was that when the analyser tries to locate the methods of the endpoint class, it might fail to do so because the types might be stored in different type environments (per file). Reviewed By: mityal Differential Revision: D19225908 fbshipit-source-id: 097e4aeea	5 years ago
Jules Villard	49fb5b7c85	[pulse] do arithmetic on pointers too Summary: A plus is a plus, no need to give up when +/- is about pointers. This gets rid of some false positives involving pointer arithmetic. However, the problem remains if we make things a bit more inter-procedural. This is documented in an added test. Reviewed By: ezgicicek Differential Revision: D18932877 fbshipit-source-id: 4ad1cfe72	5 years ago
Nikos Gorogiannis	ce39017611	[typ][fieldname] make java representation more sharing friendly and typesafe Summary: The `Typ.FIeldname` module has many issues. Among those: - It has 5 different string/printing functions and most of them do radically different things in Java and in Clang. - There is no type safety: creating a Clang field and calling a Java function on it will lead to a crash (`rindex_exn` etc, there are usually no dots in Clang fields). - It uses a single string for Java fields, containing the package, the class and the field, e.g., `java.lang.Object.field`. This is wasteful, because - there is no sharing of strings for packages/classes, and, - string operations need to be performed every time we need the field or the class or the package alone. This diff preserves the behaviour of the module's interface, so the API problems remain. However, by using a saner representation for Java fields we can get small performance and large memory gains (the type environment in Java is much smaller, about 30-40%). In addition, many functions on clang fields would previously do string manipulations (look for `.` and split on it) before returning the final field unchanged -- now they use the type of the field for that. Reviewed By: jvillard Differential Revision: D18908864 fbshipit-source-id: a72d847cc	5 years ago
Jules Villard	e06a43a677	[pulsebo] use inferbo more in summaries Summary: - Do most of the work of `solve_arithmetic_constraints` inside `subst_attribute` instead, since we need to re-use the latter function for post-conditions where the first function is not appropriate. - When substituting arithmetic constraints, we refine arithmetic information (both concrete intervals and inferbo), which can lead to inconsistent states. Instead of recording the new arithmetic facts by returning a new current state, just act as a map on attributes. This is to enable doing the point above. - All this lead to a somewhat messy refactoring... - Rename `CannotApplyPre` to `Contradiction` since it's used for post-conditions as well now Reviewed By: skcho Differential Revision: D18889120 fbshipit-source-id: d81647143	5 years ago
Jules Villard	a42e15147b	[pulse] fix test for by-ref automatic initialisation Summary: Pointers are hard... The previous test had no chance of doing initialisation of the pointer by reference and was in fact a false negative (and still is, fix incoming). Renamed functions to stress the false negative and added a test that is really (potentially) doing pointer initialisation by reference. Reviewed By: skcho Differential Revision: D18888008 fbshipit-source-id: 1e72408c7	5 years ago
Jules Villard	eb52b28f91	[pulsebo] use inferbo in prunes Summary: Finally use information from the inferbo intervals in pulse's domain to make decisions about whether conditionals are feasible or not. Reviewed By: skcho Differential Revision: D18811193 fbshipit-source-id: d80a28657	5 years ago
Martin Trojer	fbe1c92c17	clang9 Reviewed By: jvillard Differential Revision: D18748966 fbshipit-source-id: 23aa4574f	5 years ago
Sungkeun Cho	ab7c61b836	[inferbo] Extend bound domain to express multiplication of bounds Summary: This diff extends the bound domain to express multiplication of bounds in some simple cases. Reviewed By: ezgicicek Differential Revision: D18745246 fbshipit-source-id: 4f2dcb42c	5 years ago
Jules Villard	df49f318f6	[pulse] havoc formals passed by reference to unknown procedures Summary: This gets rid of false positives when something invalid (eg null) is passed by reference to an initialisation function. Havoc'ing what the contents of the pointer to results in being optimistic about said contents in the future. Also surprisingly gets rid of some FNs (which means it can also introduce FPs) in the `std::atomic` tests because a path condition becomes feasible with havoc'ing. There's a slight refinement possible where we don't havoc pointers to const but that's more involved and left as future work. Reviewed By: skcho Differential Revision: D18726203 fbshipit-source-id: 264b5daeb	5 years ago
Jules Villard	32f60f3d3c	[pulse] model the fact `free(0)` is a no-op Summary: It's a well-known fact that pulse should know too. To avoid splitting the abstract state systematically, only act if we know the pointer is exactly 0 to avoid reporting a nullptr dereference on `free(x)`. Reviewed By: ezgicicek Differential Revision: D18708575 fbshipit-source-id: 1cc3f6908	5 years ago
Jules Villard	3fbefbad34	[pulse] model some of `std::atomic` Summary: Turns out code uses atomics in important places, modelling it removes FPs. The tests are copied from biabduction and adapted and extended a bit. I didn't implement compare_exchange primitives for now (plus, giving them a sequential semantics like in biabduction is probably a bit cheeky). Reviewed By: skcho Differential Revision: D18708576 fbshipit-source-id: a3581b8a4	5 years ago
Sungkeun Cho	61ae040077	[pulse] Add bo_itv to pulse attributes Summary: This diff adds inferbo's interval values to pulse's attributes. The added values will be used to filter out infeasible passes in the following diffs. Reviewed By: jvillard Differential Revision: D18726667 fbshipit-source-id: c1125ac6e	5 years ago
Jules Villard	d79bd90b81	[pdesc] new pre-analysis to diverge after "noreturn" function calls Summary: A plugin update allows infer to know when a function doesn't return according to its attributes. This propagates this info all the way to the attributes of each function, and then use this information in a new pre-analysis that cuts the links to successor nodes of each `Call` instruction to a function that does not return. NOTE: The "no_return" `CallFlag.t` was dead code, following diffs deal with that (by removing it). Reviewed By: dulmarod Differential Revision: D18573922 fbshipit-source-id: 85ec64eca	5 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	5 years ago
Jules Villard	b03ca78bf3	[pdesc][refactor] ability to set normal and exceptional succs independently Summary: - more flexible API - less error-prone thanks to named parameters - also takes care of adjusting predecessors of the previous successors! This fixes some (probably harmless) bugs in the frontends. Reviewed By: dulmarod Differential Revision: D18573923 fbshipit-source-id: ad97b3607	5 years ago
Jules Villard	f81c9d56e3	[pulse] arithmetic operations Summary: Model +/- when we know the concrete interval for a value. Reviewed By: skcho Differential Revision: D18528535 fbshipit-source-id: 7c67a7a54	5 years ago

1 2 3 4 5 ...

885 Commits (1e25cf2168c9ffe34afea5cb97f55ce192d80149)