infer_clone

Commit Graph

Author	SHA1	Message	Date
Dulma Churchill	0322e17e72	[IR] print only the passed_as_noescape_block_to attribute if it's set Reviewed By: jvillard Differential Revision: D20247386 fbshipit-source-id: b9299f0cd	5 years ago
Sungkeun Cho	9dbc3981cc	[infer] Add LRU hash table Reviewed By: ngorogiannis Differential Revision: D20199999 fbshipit-source-id: 93d26b822	5 years ago
Artem Pianykh	19093a2fa0	@update-submodule: facebook-clang-plugins Fix handling of non-literal `offsetof` expressions Summary: Update handling of `OffsetOfExpr` based on the new type definition from updated version of clang-plugin. Together with the change to clang-plugin, this essentially fixes hard crash while analysing C/C++ files with non-literal `offsetof` expression. Fixes GH issues [#1178](https://github.com/facebook/infer/issues/1178), [#1212](https://github.com/facebook/infer/issues/1212) Reviewed By: jvillard Differential Revision: D20159173 fbshipit-source-id: 65fc228a4	5 years ago
Jules Villard	2047f4c535	[preanal] inlining synthetic methods as a pre-analysis Summary: This is so that all pre-analyses are together instead of spread across several modules. PS: this function is the worst. Reviewed By: ngorogiannis Differential Revision: D19973285 fbshipit-source-id: b326e99cd	5 years ago
Mitya Lyubarskiy	73e78d9e20	[orchestration][refactoring] Introduce stronger contract for file-level callback Summary: # Current design Infer analysis is currently two staged: 1) proc-level callbacks calculate summary, including writing down the issues if applicable. 2) file-level callbacks (formerly cluster callbacks, see the prev diff) are executed next; they are supposed to emit additional issues that are impossible to emit based on mere proc-context. Currently RacerD and Starvation use file-level callback; in near future we plan to onboard Nullsafe checker as well. # Problem Contract of callback (1) is clear: given a proc and existing summary, the checker updates it and returns a modified summary. This summary later on gets serialized (in-memory + external) and can be consumed by other chechers. Issues written in summary will get reported when analysis is over. In constrast, contract of (2) is wild west: the function returns unit. In practice, what the checkers do is create IssueLog and serialize it to checker-specific directory. Then another part of program (InferPrint.ml) knows about this side effect, reads the error log for checkers and ultimately get it reported together with errors written at stage (1). This is problematic because it is hard to reason about the system and it makes onboarding new checkers to (2) error-prone. # This diff This diff brings (2) on par with (1): now file-level callback has a clear contract: it should be side effect free, and the only responsibility is to fill out and return IssueLog. Additionally, we make the notion of "checker-specific issue directory" an official thing, so the checker only needs to specify the name, everything else will be made automatically by orchestation layer, including cleanup. # Starvation Implementing the new contract is starvation is possible and desirable, but involved: see comment in the code, so we leave it up to the future work to fix that. Reviewed By: ngorogiannis Differential Revision: D20115024 fbshipit-source-id: fb2f9b7e6	5 years ago
Nikos Gorogiannis	e01311c431	[scheduler][callgraph] load graph directly from DB Summary: Currently the call graph of all captured procedures is loaded and then traversed to flag reachable procedures from modified files, followed by deleting the unflagged part, and unflagging the rest. This is a bit wasteful, and doesn't lend itself nicely to constructing directly the reverse call graph, which further diffs will do. This diff loads all captured procedures and callees in a hashconsed table, and performs a BFS from procedures in modified files, to build the call graph in one pass. Reviewed By: fgasperij Differential Revision: D19888965 fbshipit-source-id: eeb59356e	5 years ago
Mitya Lyubarskiy	d94b365b65	Add documentation and better naming around checker callbacks Summary: 1. Some invariants are tricky enough to be documented. This is especially important for cases related with error reporting. Lets document it. 2. Cluster callback -> File callback rename. Reviewed By: ngorogiannis Differential Revision: D20093932 fbshipit-source-id: e716f1f5b	5 years ago
Artem Pianykh	2572819a5b	[nullsafe] Directly model nullability of values from third-party code Summary: We need to be able to differentiate `UncheckedNonnull`s in internal vs third-party code. Previously, those were under one `UncheckedNonnull` nullability which led to hacks for optmistic third-party parameter checks in `eradicateChecks.ml` and lack of third-party enforcement in `Nullsafe(LOCAL, trust=all)` mode (i.e. we want to trust internal unchecked code, but don't want to trust unvetted third-party). Now such values are properly modelled and can be accounted for regularly within rules. Also, various whitelists are refactored using `Nullability.is_considered_nonnull ~nullsafe_mode nullability`. `ErrorRenderingUtils` became a tad more convoluted, but oh well, one step at a time. Reviewed By: mityal Differential Revision: D19977086 fbshipit-source-id: 8337a47b9	5 years ago
Artem Pianykh	b50f13eb18	[nullsafe] Support Nullsafe(Local, trust=all/none) mode Summary: Add support for nullsafe mode with `trust=all` and `trust=none` a case with a specific trust list is not supported yet and needs to be implemented separately. Tests introduce one unexpected `ERADICATE_INCONSISTENT_SUBCLASS_PARAMETER_ANNOTATION` issue which complains about `this` having incorrect nullability; it is a bug and needs to be fixed separately. Reviewed By: mityal Differential Revision: D19662708 fbshipit-source-id: 3bc1e3952	5 years ago
Nikos Gorogiannis	43b3ef60f8	[db][procname] kill dead optional argument Summary: No callers use optional argument. Reviewed By: ezgicicek Differential Revision: D20002257 fbshipit-source-id: e377db4fd	5 years ago
Ezgi Çiçek	ec950666a0	[infer] rely on driving for Procdesc.Node.equal_id Summary: Rather than explicitly defining `equal_id`, let's rely on deriving. Reviewed By: jvillard Differential Revision: D19977777 fbshipit-source-id: 3729b7f69	5 years ago
Nikos Gorogiannis	281385203f	[biabduction] kill guarded by check Summary: RacerD now has a check for this, so remove. Reviewed By: jberdine Differential Revision: D19974163 fbshipit-source-id: 06d7a203d	5 years ago
Nikos Gorogiannis	c10c7a39a6	[java] use a package/classname record for java classes instead of string Summary: Use a record of package, class name to store (qualified) Java class names. This saves the round trip of concatenating then splitting again, etc, as well as saves some memory in the type environment as now the package paths can be shared across classes of the same package (about 10% in tests). Also remove some unfortunate APIs. Reviewed By: jvillard Differential Revision: D19969325 fbshipit-source-id: f7b7f5a55	5 years ago
David Pichardie	64289cde4d	[Java frontend]Javalib's lambda rewritting is making his way through Infer Reviewed By: ngorogiannis Differential Revision: D19970219 fbshipit-source-id: b14bb36a4	5 years ago
Nikos Gorogiannis	ace23a1670	[java] use plain strings instead of mangled for JavaClassName Summary: The way `Mangled.t` is used in `JavaClassName` means that it's always a plain string (we never have a "mangled" part). Remove the indirection and extra allocation. Also, simplify the API by throwing away one function that was used just once and wastefully. Reviewed By: artempyanykh Differential Revision: D19950672 fbshipit-source-id: b61fcba6e	5 years ago
Dulma Churchill	00c52a52c2	[Infer] Dedup the reporting of Captured StrongSelf Summary: For each variable that we identify as a captured strong self, we want to report only the first occurrence. Reviewed By: skcho Differential Revision: D19940031 fbshipit-source-id: f38f642c9	5 years ago
Artem Pianykh	44f41d2929	[infer] Extend annotation framework to handle wider variety of param types Summary: Previous implementation supported only stringy params (strings and stringified bools). Current one exposes a proper variant `Annot.t`, with support for all possible param values in Java except numbers (more on that below). This change is required for implementing `Nullsafe(LOCAL)` as the annotation used to specify nullsafe behaviour has a more complex structure than what we've dealt with before. Why support for number values was not added: supporting numbers requires using `int64`. Unfortunately, adding another variant `Vnum int64` to `Annot.t` causes a runtime failure on assert in `MaximumSharing.ml:133`. It seems that it might be enough to flip `fail_on_nonstring` from `true` to `false`, but since this would require additional testing and is not required for my case, I'll leave checking this to whoever needs to use numeric annot params in future. Reviewed By: ezgicicek Differential Revision: D19855923 fbshipit-source-id: 878e33856	5 years ago
Jules Villard	2c5a297636	@allow-large-files [ocaml] upgrade core to v13 Summary: More newer = more better. This flips the Not_found -> Not_found_s switch, and forbids a bunch more polymorphic comparisons (mostly turned into `int` comparisons for convenience). Earlier diffs prepare for this so this diff is only about breaking changes in the API, of which there are only a few. Reviewed By: jberdine Differential Revision: D19861583 fbshipit-source-id: fe54ce8f0	5 years ago
Jules Villard	66361961b6	[ocaml] more Not_found_s Summary: Core v13 APIs stopped raising `Not_found` and instead raise `Not_found_s`, which wreaks havoc in our codebase. Carefully inspect each `Not_found` and add `Not_found_s` where needed (that way it's compatible with both Core v12 and v13 for now). Reviewed By: jberdine Differential Revision: D19861585 fbshipit-source-id: 9a5361ae9	5 years ago
Jules Villard	a684a1edf0	[ocaml] preparations for core v13 Summary: The big one: - stop using polymorphic `<>`, `<`, `>`, .. - add `<>` to `PolyVariantEqual` escape hatch now that `<>` is as taboo as `=` - Interestingly, there were a lot of uses of `Z.(x < y)`, which although they seem to use `Z.lt` actually used polymorphic comparison. The actual comparison infix operators of `Z` are cleverly hidden in `Z.Compare` instead, which makes them impractical to use... Reviewed By: jberdine Differential Revision: D19861584 fbshipit-source-id: 5dce08ad9	5 years ago
Sungkeun Cho	ca04002f6c	[inferbo] Revise finding constructors on std::make_shared Summary: When finding a proper constructor for `std::make_shared`, the given parameter types are sometimes slightly different, e.g., const int vs int. This diff loosens the condition of the types on finding constructors. Reviewed By: ngorogiannis Differential Revision: D19743198 fbshipit-source-id: f90213109	5 years ago
Dulma Churchill	a864823f38	[SelfInBlock] Fix a bug in exps_of_instr where some expressions were missed. Summary: This fixes false positives in the SelfInBlock checker. Reviewed By: jvillard Differential Revision: D19789906 fbshipit-source-id: 82d4da346	5 years ago
Dulma Churchill	682f8c5355	[SelfInBlock] Add the procname to the is_no_escape_block flag to improve the error message of the weakSelf In Noescape block check Summary: After looking at some reports with blocks inside blocks, it seemed more obvious that adding which method we are talking about makes more clear which block we are talking about. Reviewed By: mityal Differential Revision: D19789285 fbshipit-source-id: 20e0e6804	5 years ago
Nikos Gorogiannis	a6da208e9d	[starvation] use access expressions instead of access paths Summary: The goals are: - Increase precision in C-languages by ditching access paths. - Help with eventually sharing the abstract address module with RacerD. - Reports are now language-mode specific (eg `->` in clang vs `.` in Java). It's not exactly access expressions used here. Instead the pattern `(base, access list)` is used where `access` is `HilExp.Access.t`. This is done to ease the way `deriving` is used for creating two comparison functions, one that cares about the root variable and one that doesn't; and also because the main function that recurses over accesses (`normalise_access_list`) visits the accesses from innermost to outermost. Also, kill some dead code. Reviewed By: skcho Differential Revision: D19741545 fbshipit-source-id: 013bf1a89	5 years ago
Sungkeun Cho	5510223850	[infer] Get rid of the is_cpp_nothrow field and is_cpp_noexcept_method Summary: This diff removes a dead field, `is_cpp_nothrow` and `is_cpp_noexcept_method`. Reviewed By: jvillard Differential Revision: D19489417 fbshipit-source-id: 971a7f533	5 years ago
Jules Villard	81e3dc5069	[HIL] do not crash on confusing function call expressions Summary: When the expression resolving to a function to be called could not be translated to either a proc name or at least an access path, the HIL translation code would crash. However, this is perfectly possible. Moreover, no one actually uses the payload of the `Indirect` datatype so no complication arises from generilising its type to `HilExp.t`, as done in this diff. Reviewed By: ngorogiannis Differential Revision: D19691127 fbshipit-source-id: 4c0400ab7	5 years ago
Sungkeun Cho	f8ee0a14aa	[inferbo] Give semantics of std::make_shared as simple constructor Summary: This diff gives semantics of `std::make_shared` as simple constructor, i.e., it changes function call of `std::make_chared<C>(i)` to the constructor `C(i)`. Reviewed By: ngorogiannis Differential Revision: D19432338 fbshipit-source-id: 0d838e555	5 years ago
Dulma Churchill	05ea5ec844	[clang frontend] Add support for the clang attribute NS_NOESCAPE for Objective-C blocks in Sil Summary: This attribute is given to parameters of methods that take Objective-C blocks to show that they will be used only in the current context and won't "escape" the context. We translate it here, with the goal to use it in a new check later. The check is about not using weakSelf in non-escaping blocks, because retain cycles are not possible. The translation is a bit complex because the annotation comes in the parameter of a method, but in the checker we will need it in the block. So we pass it around in the frontend from the translation of the method call to the translation context and on to the block expression and the block declaration afterwards. Reviewed By: ngorogiannis Differential Revision: D19600377 fbshipit-source-id: dd49539bd	5 years ago
Ezgi Çiçek	43a99745b6	[infer] Get rid of verbose Typ.mk Tvoid Summary: Instead use `Typ.void` which does the same thing Reviewed By: ngorogiannis Differential Revision: D19640598 fbshipit-source-id: 01ff1f6a0	5 years ago
Nikos Gorogiannis	07e91cabf7	[starvation] no inner class normalisation for java Summary: The "access path" memory model (equal access paths iff equal object addresses) is suited to when aliasing occurs only at the roots (i.e. variables). When there is intentional aliasing in the middle of an access path, this model will miss the aliasing. For instance if `[x.f] == [y.g]`, then also `[x.f.h] == [y.g.h]`, but the latter access paths are unequal. In Java, non-static inner classes consistently alias `this.this$0` inside an inner class, which points to the "parent" outer-class object. So if two inner-class objects (belonging to different inner classes) access `this(type:InnerClassA).this$0.f` and `this(type:InnerClassB).this$0.f` the equality will be missed (many other combinations exist). This isn't strictly due to the memory model -- any alias analysis would have to do some class invariant inference to detect this. For this purpose `AccessPath.inner_class_normalize` exists (it replaces `this.this$0` with `this` of the appropriate type), but this breaks the invariant that we know which formal parameter is at the root (there may not even exist a `this` parameter if the method is static). So this was buggy. Here we simply recursively remove the synthetic field prefix of the accesses list, while computing forwards the object type. This is only applied when we check aliasing across threads. This will also allow actuals/parameters substitutions (stacked diff) which normalisation was breaking. Reviewed By: jberdine Differential Revision: D19601455 fbshipit-source-id: 7e42667b6	5 years ago
Nikos Gorogiannis	72a7a0eaab	[racerd] use typenames instead of strings in class map Summary: Keep the type name of the class as the key in the map constructed from class names to their methods in a file. This will be used later, and also why string? Reviewed By: dulmarod Differential Revision: D19557707 fbshipit-source-id: aa8569581	5 years ago
Sungkeun Cho	c93c3163d6	[inferbo] Get global constant array values from initializers Summary: This diff gets global constant array values from their initializers. The `find_global_array` function is added to memory domain, which finds values of global array locations during the ondemand value generation. Reviewed By: ngorogiannis Differential Revision: D19300143 fbshipit-source-id: 7b0b84c42	5 years ago
Nikos Gorogiannis	279f1c85ce	[racerd] abbreviate procnames in report text Summary: If a race exists in two or more overloads of the same method and we use only the class and method name in the report text, then the current bug hashing algorithm will identify the two reports as duplicates. To avoid this, the report had the class, method and list of type parameters. This is unreadable, however, and redundant (the report is already located within the method in question). So at the risk of duplicates, use only class+method names. Also, fix a bug in `Procname.pp_simplified ~withclass` where `withclass` was ignored for C++/ObjC methods. Now: > Read/Write race. Non-private method `FrescoVitoImageSpec.onCreateInitialState(...)` indirectly reads with synchronization from `factory.AnimatedFactoryProvider.sImpl`. Potentially races with unsynchronized write in method `FrescoVitoImageSpec.onEnteredWorkingRange(...)`.@ [Litho components are required to be thread safe because of multi-threaded layout](https://fburl.com/background-layout). Reporting because current class is annotated `MountSpec`, so we assume that this method can run in parallel with other non-private methods in the class (including itself). Before > Read/Write race. Non-private method `void FrescoVitoImageSpec.onCreateInitialState(ComponentContext,StateValue,StateValue,Uri,MultiUri,ImageOptions,FrescoContext,Object,ImageListener)` indirectly reads with synchronization from `factory.AnimatedFactoryProvider.sImpl`. Potentially races with unsynchronized write in method `FrescoVitoImageSpec.onEnteredWorkingRange(...)`.@ [Litho components are required to be thread safe because of multi-threaded layout](https://fburl.com/background-layout). Reporting because current class is annotated `MountSpec`, so we assume that this method can run in parallel with other non-private methods in the class (including itself). Reviewed By: artempyanykh Differential Revision: D19462277 fbshipit-source-id: aebc20d89	5 years ago
Ezgi Çiçek	5b86031798	[frontend] Move clang constants to Procname Reviewed By: ngorogiannis, jvillard Differential Revision: D19429744 fbshipit-source-id: f7e5bc41b	5 years ago
Artem Pianykh	592c746e6b	[java] Make override resolution consider parameter types Summary: Previously, _override resolution_ considered only the number of arguments. This led to many FPs in nullsafe's _Inconsistent Subclass Annotation_ check. Current version also checks that argument types match. However, we still don't handle type parameters and erasure, so in this sense the rules are incomplete. Reviewed By: ngorogiannis, mityal Differential Revision: D19393201 fbshipit-source-id: a0c75b8dd	5 years ago
Nikos Gorogiannis	a79a819679	[typ][javaclass] abstract typename Summary: The type-name definition for Java can be potentially improved (eg increase sharing, or comparison speed, much like `QualifiedCppName`) by switching away from `Mangled.t` which is essentially a string. First step is to abstract the type. Reviewed By: jberdine Differential Revision: D19087508 fbshipit-source-id: 91a81f63b	5 years ago
Nikos Gorogiannis	b8d51b0493	[starvation] use root component in lock order Summary: Now that we have the kind of lock stored (global/class obj/path rooted at parameter), use it for comparison/equality, while ignoring the root variable of the access path, which is only used for printing. Reviewed By: skcho Differential Revision: D19346801 fbshipit-source-id: c65661dc6	5 years ago
Sungkeun Cho	4ddf46268f	[infer] Create missing result directories Summary: This diff creates missing result directories in its running. The problem was that `infer-out/captured` and its sub-directories were not ready at the step 3 below, which crashed with exceptions. 1. `infer capture -- [target build]` 2. `infer analyze --merge` 3. `infer analyze --merge --debug --reanalyze --procedures-filter '.foo.'` Reviewed By: ngorogiannis Differential Revision: D19274672 fbshipit-source-id: af84000d7	5 years ago
Nikos Gorogiannis	91fa6a5404	[typ] extract Procname from Typ Summary: No reason for this to be in Typ Reviewed By: skcho Differential Revision: D19162727 fbshipit-source-id: d6940637a	5 years ago
Nikos Gorogiannis	33352623a5	[typ] extract Fieldname from Typ Summary: There is no reason to have this in Typ. Reviewed By: skcho Differential Revision: D19161946 fbshipit-source-id: 7d9b4f249	5 years ago
Nikos Gorogiannis	cef051dd1a	[typ] extract Struct module Summary: There is no reason for this to be in Typ. Reviewed By: ezgicicek Differential Revision: D19161751 fbshipit-source-id: de33f5fa1	5 years ago
Jules Villard	65d0d18326	[SIL] splitting off biabd stuff from SIL Summary: Move most of IR/Sil.ml into a new file biabduction/Predicates.ml to reflect the fact that they are only useful for the biabduction analysis. Unfortunately this is a huge change. I tried to keep the change to a minimum, it's mostly about doing s/Sil/Predicates/ in lots of places but sometimes I used the trick of specifying parameters or return value types to avoid specifying the module altogether. This isn't done consistently because there were just too many places to change for poor me. Reviewed By: ngorogiannis Differential Revision: D19158530 fbshipit-source-id: d6dbcfe72	5 years ago
Jules Villard	bc799fc6cd	[IR] `PredSymb.dangling_kind option` can be replaced by `bool` Summary: This one is just for fun. Reviewed By: ngorogiannis Differential Revision: D19158529 fbshipit-source-id: 2ccda60ca	5 years ago
Jules Villard	6c988160c1	[IR] kill unused `Sil.hpred` payload Summary: hmmhmmhmm Also `Sil.hpred` is going to move out of IR/ in a few diffs, into biabduction/ where it belongs. Reviewed By: ngorogiannis Differential Revision: D19158535 fbshipit-source-id: e2a889ee2	5 years ago
Jules Villard	0cab96b43e	[SIL] move some stuff to Pvar Summary: Sil.ml contained utility that belong in Pvar.ml Reviewed By: ngorogiannis Differential Revision: D19158532 fbshipit-source-id: 94772baba	5 years ago
Jules Villard	30b74413a5	[SIL] move some printing stuff to Exp Summary: Sil.ml contained utility that belong in Exp.ml. Reviewed By: ngorogiannis Differential Revision: D19158533 fbshipit-source-id: 364c3f350	5 years ago
Jules Villard	a6c8e7c98e	[pp] move utility function from Sil to Pp Summary: Part of making Sil.ml about SIL only. In order to not introduce a dependency istd/Pp -> base/Config, the utilities in Pp don't know when to introduce "diff" colours. Fix it by wrapping them in Sil using the Config option. (we may want to just kill that option at some point). Similarly, move stuff from Io_infer to Pp. Reviewed By: ngorogiannis Differential Revision: D19158534 fbshipit-source-id: 8110cb7f9	5 years ago
Radu Grigore	7bfef217de	[biabduction] Simplify postconditions after re-execution. Summary: This applies some simplifications that were previously done after footprint (and therefore lost), and some simplifications that require looking at both pre and post. Reviewed By: ngorogiannis Differential Revision: D19035494 fbshipit-source-id: bad79534a	5 years ago
Mitya Lyubarskiy	9285c51dfa	[nullsafe] Enum values can be used as non-null without strictification Summary: According to Java semantics, they are always non-null. Internally they are represented as static fields, so they have DeclaredNonnull nullability, which means NullsafeStrict mode would refuse to use them without strictification. Lets teach nullsafe that these guys are non-nullables. See also FN in test case. Reviewed By: ngorogiannis Differential Revision: D19024547 fbshipit-source-id: 8c120fa50	5 years ago
Nikos Gorogiannis	e42bd8cd6c	[typ][fieldname] further reduce and improve interface Summary: - Remove `to_flat_string` as there is `get_field_name` that unambiguously does the same thing. - Make `pp` print only the field in all languages. - Fix `to_full_string` so that it has unified behaviour across java/clang and so that it doesn't print `class Foo.x`, but rather `Foo.x`. Reviewed By: ezgicicek Differential Revision: D18963033 fbshipit-source-id: e2c803c7d	5 years ago
Nikos Gorogiannis	59a95b316c	[typ][fieldname] simplify and streamline interface Summary: Remove Clang and Java submodules of Typ.Fieldname. They are unnecessary and they reflect a fake dichotomy: there is only one fieldname type. To distinguish between fields of Java classes and other C constructs, there is a helper function provided, but the idea is simple: obtain the class type the field belongs to, and check if it's a Java class. This diff still preserves behaviour, but removes as many functions as possible from the interface, to leave a small surface. Reviewed By: mityal Differential Revision: D18962423 fbshipit-source-id: ffe6933ee	5 years ago
Nikos Gorogiannis	c45b55bff1	[typ][fieldname] unify clang and java fieldname types Summary: Unify treatment of Java and Clang fieldnames. Now a field is a struct with a class type-name and a string-field name. This diff is still behaviour preserving. Reviewed By: jvillard Differential Revision: D18953549 fbshipit-source-id: 8cae0d104	5 years ago
Nikos Gorogiannis	2c44035297	[typ][fieldname] eliminate uses of Java.from_string Summary: This function allows any string, and in particular empty class names. As a first step eliminate it in favour of a function that forces the caller to specify distinct class and field names. It turns out that the frontend already has them, so it saves effort along the way. Reviewed By: jvillard Differential Revision: D18953136 fbshipit-source-id: ff3cdfda5	5 years ago
Sungkeun Cho	bc5f740945	[infer] make deadcode is back Reviewed By: jvillard Differential Revision: D18957045 fbshipit-source-id: a6db07309	5 years ago
Sungkeun Cho	1f64acf3de	[litho] Moved is_build_called and added is_return_called Summary: In order to handle the example added: changed domain of `MethodCalled` from `CreatedLocation -> (IsBuildCalled X IsChecked X Set(MethodCall))` to `(CreatedLocation X IsBuildCalled) -> (IsChecked X Set(MethodCall))` This avoids joining of two method calls where one is build-called and the other is not, e.g., ``` if(b) { o.build(); } else { // no build call } ``` changed domain of `NewDomain` from `Created X MethodCalled` to `(Created X MethodCalled) X (Created X MethodCalled)` One is for no returned memory and the other is returned memory. This keeps precision some join points of branches, e.g., ``` if(b) { return; } else { // no return } ``` Reviewed By: ezgicicek Differential Revision: D18909768 fbshipit-source-id: c39d1a1ef	5 years ago
Nikos Gorogiannis	ce39017611	[typ][fieldname] make java representation more sharing friendly and typesafe Summary: The `Typ.FIeldname` module has many issues. Among those: - It has 5 different string/printing functions and most of them do radically different things in Java and in Clang. - There is no type safety: creating a Clang field and calling a Java function on it will lead to a crash (`rindex_exn` etc, there are usually no dots in Clang fields). - It uses a single string for Java fields, containing the package, the class and the field, e.g., `java.lang.Object.field`. This is wasteful, because - there is no sharing of strings for packages/classes, and, - string operations need to be performed every time we need the field or the class or the package alone. This diff preserves the behaviour of the module's interface, so the API problems remain. However, by using a saner representation for Java fields we can get small performance and large memory gains (the type environment in Java is much smaller, about 30-40%). In addition, many functions on clang fields would previously do string manipulations (look for `.` and split on it) before returning the final field unchanged -- now they use the type of the field for that. Reviewed By: jvillard Differential Revision: D18908864 fbshipit-source-id: a72d847cc	5 years ago
Jules Villard	1bde1ef0f0	[pulse] use inferbo's prune in `PRUNE` nodes Summary: After passing a `PRUNE` instruction we can refine the current inferbo intervals for the values involved. Reviewed By: ezgicicek Differential Revision: D18889103 fbshipit-source-id: b521046aa	5 years ago
Sungkeun Cho	2835468df9	[litho] Add substitution at function calls Summary: This diff adds a substitution at function calls. Reviewed By: ezgicicek Differential Revision: D18878045 fbshipit-source-id: e081d1500	5 years ago
Jules Villard	17bef4bd31	[SIL][trivial] rename `text` -> `to_string` and delete useless comments Summary: shrugcity Reviewed By: dulmarod Differential Revision: D18888788 fbshipit-source-id: 41851b3ee	5 years ago
Josh Berdine	3c6e2469de	[ocamlformat] Enable parsing and reformatting docstrings Summary: This diff enables parsing and auto-formatting documentation comments (aka docstrings). I have looked at this entire diff and manually made some changes to improve the formatting. In some cases it looked like it would take too much time, or benefit from someone more familiar with the code doing it, and I instead disabled auto-formatting docstrings in those files. Also, there are some source files where the docstrings are invalid, and some where the structure detected by the parser appears not to match what was intended. Auto-formatting has been disabled for these files. Reviewed By: ezgicicek Differential Revision: D18755888 fbshipit-source-id: 68d72465d	5 years ago
Nikos Gorogiannis	aef34d8384	[starvation][whole-program] analyze constructors for initial attribute state Summary: A current blind spot is when object construction stores specific executors / runnables to object fields, which are then never mutated and accessed from normal methods. IOW the attributes established in the constructor are necessary to report properly inside a normal method (assuming these attributes are not invalidated by method code). To achieve this, first we retain a subset of the final state attributes in the summary (only those that affect instance variables, in constructor methods). Then, when we analyse a non-constructor method: - we analyse all constructors - remove all attributes from the attribute map whose key is not an expression of the form `this.x. ...` - re-localise all remaining keys so that they appear as rooted in the `this` local variable of the current procedure - join (intersect) all such attribute maps - use the result in place of initial state as far as the attribute map is concerned for the analysis of the current procedure, which can now start. This means we can catch idioms that use side-effectful initialisation for configuring certain object fields like executors or runnables. Reviewed By: jvillard Differential Revision: D18707890 fbshipit-source-id: 42ac6108f	5 years ago
Ezgi Çiçek	fb56f42716	[infer] Rename value to arg_payload in ProcnameDispatcher.Call.FuncArg Reviewed By: jvillard Differential Revision: D18707511 fbshipit-source-id: 160a02e07	5 years ago
Ezgi Çiçek	eb8c8af117	[pulse] Move models to ProcnameDispatcher style Summary: Rather than repeatedly matching actuals, let's use `ProcnameDispatcher.ModeledCall` to pick up the actual arguments with their corresponding values. This simplifies the models. Reviewed By: jvillard Differential Revision: D18685855 fbshipit-source-id: 7788bd8bb	5 years ago
Sungkeun Cho	b15395ad60	[infer] Remove marker from procname dispatcher Summary: This diff removes `'markers` and `'captured_types` from the procname dispatcher. They are for checking an integrity when a type is captured from template parameters then it is used to match in parameters. However, we have not used that feature, so which simply complicates the types in the dispatcher without any gain at the moment. Reviewed By: jvillard Differential Revision: D18706254 fbshipit-source-id: f493778d7	5 years ago
Ezgi Çiçek	3d181bd831	[infer] Polymorphic value type for `FuncArg` Reviewed By: jvillard Differential Revision: D18706143 fbshipit-source-id: 96c91db77	5 years ago
Ezgi Çiçek	3792b9b17a	[infer] Record the value of function arguments in ProcnameDispatcher calls Summary: Preperation diff to use `ProcnameDispatcher` for Pulse: it changes function arguments, i.e. `ProcnameDispatcher.Call.FuncArg`, to a record in order to track the value of arguments. To do that, it changes `ProcnameDispatcher.Call` into a functor so that we can parametrize over the type of the value without making changes upwards. Reviewed By: jvillard Differential Revision: D18590224 fbshipit-source-id: 6a13fbc1a	5 years ago
Nikos Gorogiannis	20a7e9d75b	[starvation][whole-program] add a bit of typestate/dataflow Summary: - Unify treatment of modelled and annotated executors by making things go through attributes. - Add a return attribute to summaries, so that we can track flows of thread guards/executors/future stuff through returned values. - Dispatch modeled functions to model summaries. This will help in following diffs where runnables will also go through attributes. Reviewed By: skcho Differential Revision: D18660185 fbshipit-source-id: e26b1083e	5 years ago
Sungkeun Cho	6885fb4256	[infer] Distinguish dummy struct types from normal ones when merging tenv Summary: Some field types of structs are missing in Java. The reason is: * When capture, empty struct types are added without their fields. * The empty struct types are overwritten to the global tenv when merging all tenvs. As a fix, this diff add a boolean field, `dummy`, in `Typ.Struct.t`, then avoids that non-dummy types are replaced by dummy types. Reviewed By: ngorogiannis Differential Revision: D18657323 fbshipit-source-id: 4a263f8e7	5 years ago
Jules Villard	b3d0461317	[IR] kill PredSymb.func_attribute by moving sentinel attrs to its own ProcAttribute field Summary: This is a better home for knowing whether a function has sentinel args according to its prototype declaration. Reviewed By: dulmarod, artempyanykh Differential Revision: D18573919 fbshipit-source-id: 13f58eaa2	5 years ago
Jules Villard	a9df6a917f	[IR] kill never-true "no_return" flag of Tfun type desc Summary: Another dead flag that one could mistakenly think is accurate. Reviewed By: dulmarod Differential Revision: D18573925 fbshipit-source-id: 129a9cff5	5 years ago
Jules Villard	997948914f	[IR] remove dead no_return CallFlag Summary: This was never set to true except in a wrong way in the Java frontend (see previous diff). Reviewed By: dulmarod Differential Revision: D18573927 fbshipit-source-id: 4c9d1a855	5 years ago
Jules Villard	d79bd90b81	[pdesc] new pre-analysis to diverge after "noreturn" function calls Summary: A plugin update allows infer to know when a function doesn't return according to its attributes. This propagates this info all the way to the attributes of each function, and then use this information in a new pre-analysis that cuts the links to successor nodes of each `Call` instruction to a function that does not return. NOTE: The "no_return" `CallFlag.t` was dead code, following diffs deal with that (by removing it). Reviewed By: dulmarod Differential Revision: D18573922 fbshipit-source-id: 85ec64eca	5 years ago
Jules Villard	78a33acb77	[cfg] run pre-analysis lazily in ondemand Summary: This also prints the CFGs after pre-analysis for individual procedures in infer-out/captured/<filename>/<proc>.dot. One can also look up the CFGs before pre-analysis in infer-out/captured/proc_cfgs_frontend.dot. Context: I want to add a pre-analysis that needs to look at proc attributes inter-procedurally. For this to make sense it has to happen after all of capture, and before analysis. Thus, this diff brings back the lazy running of the pre-analysis like in D15803492, except that we still make sure to run the pre-analyses systematically regardless of the checkers being run by running the pre-analysis from ondemand.ml. Also we don't need to re-introduce the "did_preanalysis" proc attribute for the same reason that the pre-analysis is now run once and for all by ondemand.ml (instead of each individual checker back in the days). This has the benefit of running the pre-analysis only when needed, and the drawback that several concurrent processes analysing the same proc descs will duplicate work. Since pre-analyses are supposed to be very fast I assume that neither is a big deal. If they become more expensive then the benefit gets bigger and the drawback is just the same as with regular analyses. Reviewed By: skcho Differential Revision: D18573920 fbshipit-source-id: de350eaef	5 years ago
Sungkeun Cho	b1698ab0ea	[inferbo] Get static value of EMPTY from class initializer in Java Summary: This diff get static value with `EMPTY` field from class initializer. Reviewed By: ngorogiannis Differential Revision: D18616588 fbshipit-source-id: 26414c9b2	5 years ago
Jules Villard	8289c7e7c7	[dot] move "dot" render of biabduction specs Summary: This allows us to move the CFG rendering to IR/. The parts of that file concerning CFGs and those concerning Biabduction specs were entirely disjoint, it turns out, so that was easy. Reviewed By: jberdine Differential Revision: D18573924 fbshipit-source-id: 0a5ab6478	5 years ago
Jules Villard	b03ca78bf3	[pdesc][refactor] ability to set normal and exceptional succs independently Summary: - more flexible API - less error-prone thanks to named parameters - also takes care of adjusting predecessors of the previous successors! This fixes some (probably harmless) bugs in the frontends. Reviewed By: dulmarod Differential Revision: D18573923 fbshipit-source-id: ad97b3607	5 years ago
Jules Villard	6ecf4066e8	[pulse] model std::integral_constant Summary: cpp_initialization Reviewed By: skcho Differential Revision: D18528537 fbshipit-source-id: ab5f8038a	5 years ago
Dulma Churchill	bf581e0b72	[self in block] Add a check for strongSelf not checked for null Summary: The variable strongSelf, because it is equal to a weak captured pointer, needs to be checked for null before being used, otherwise it could be null by the time the block is executed. Added this check to the SelfInBlock checker, and removed it from biabduction. We want to migrate all the objc checks from biabduction, so it will be easier to change and faster and more reliable. Moreover, this check is more general, it will flag any use of unchecked strongSelf, not just a dereference. Reviewed By: skcho Differential Revision: D18403849 fbshipit-source-id: a9cf5d80b	5 years ago
Josh Berdine	8d20e4d64d	[ocamlformat] Upgrade ocamlformat version Reviewed By: jvillard Differential Revision: D18162727 fbshipit-source-id: ffb9f7541	5 years ago
Sungkeun Cho	773766e3f7	[inferbo] Function call of Java enum values in class initializer Summary: This diff adds semantics of Java function calls of enum `values` inside class initializers. * Java class initializer function initializes a specific field `$VALUES`, which points to the list of enum values. * The `values` function of enum class returns the value of `$VALUES`. The problem is when the `values` function is called inside the class initializer, for example: ``` enum Color { RED, GREEN, BLUE; static { for (Color c : Color.values()) {} } } ``` This introduces a recursive dependency: the class initializer calls `Color.values` and the function returns `Color.$VALUES` the value of which should be initialized in the class initializer. To address the problem, this diff finds the value of `$VALUES` in its abstract memory when `values` is called inside the class initializer. Reviewed By: ezgicicek Differential Revision: D18349281 fbshipit-source-id: 21766c20f	5 years ago
Mitya Lyubarskiy	027ff479d1	[nullsafe] 3rd party annotations from the repo are respected in nullsafe Summary: Follow ups will include error messaging that makes the choice clear Reviewed By: artempyanykh Differential Revision: D18347664 fbshipit-source-id: b6f005726	5 years ago
Dulma Churchill	43823266ec	[self in block] Add a new checker to detect correct uses of when ObjC blocks capture self. Reviewed By: skcho Differential Revision: D18245267 fbshipit-source-id: 6e3f1a7f7	5 years ago
Mitya Lyubarskiy	0c3e568fa4	[Pp] Rename Pp.to_string Summary: Now that we have two similar functions, it becomes confusing, because `Pp.to_string` and `Pp.string_of_pp` can seem to do the same stuff, while in reality they do the opposite. Well, it is still bit confusing, because the proper names would be `Pp.pp_of_to_string` and `Pp.to_string_of_pp`, but I think this high level order names are not necessary given that in most cases they will be used as concrete functions. I think `Pp.of_string` captures such usages better than `to_string` used to do: you need to pp stuff, but you have a string (or, technically, a function that returns a string), so you pretty print OF that string, aren't you? Reviewed By: jvillard Differential Revision: D18245876 fbshipit-source-id: fd4b6ab68	5 years ago
Nikos Gorogiannis	d154415cd0	[starvation] add path sensitivity restricted to thread status Summary: Steal a page from RacerD (and improve interface of) on using certain calls to assert execution on a particular thread. Reduces FPs and FNs too. Reviewed By: dulmarod Differential Revision: D18199843 fbshipit-source-id: 5bdff0dfe	5 years ago
Jules Villard	2e4fbb7fe5	[pulse] intervals! Summary: This adds a more interesting value domain to pulse: concrete intervals. There are still two main limitations: 1. arithmetic operations are all over-approximated: any assignment involving arithmetic operations is replaced by non-determinism 2. abstract values that are discovered to be equal are not merged into one Reviewed By: skcho Differential Revision: D18058972 fbshipit-source-id: 0492a590f	5 years ago
Nikos Gorogiannis	e9b0ca9ce4	[AI] rename Domain.( <= ) to Domain.leq Summary: The way `<=` is used in `AbstractDomain` prevents infix use and forces bracketing it everywhere. Replace with simple `leq`. Reviewed By: jvillard Differential Revision: D18201854 fbshipit-source-id: 8175224e4	5 years ago
Sungkeun Cho	96668ed7d8	[cost] Fix function name matching Summary: `Str.regexp_string` should be used to find a method name instead of `Str.regexp`. Reviewed By: ezgicicek Differential Revision: D18136598 fbshipit-source-id: c4b56dd64	5 years ago
Jules Villard	b818102bad	[pvar] simplified names for generated variables Summary: This will avoid printing stuff like "0$?%__sil_tmpSIL_materialize_temp__n$2 declared" to the poor unsuspecting user. The non-verbose stuff is used only by pulse so far as far I can tell so hopefully this doesn't break anything. Reviewed By: ezgicicek Differential Revision: D17908943 fbshipit-source-id: 8ef4f1a8f	5 years ago
Nikos Gorogiannis	9dbe55c419	[java tracing] goodbye Summary: Unused and in the way. Reviewed By: jvillard Differential Revision: D17878363 fbshipit-source-id: 1b6410e08	5 years ago
Sungkeun Cho	c509f1c178	[cost] Add FB-specific cost models Summary: This diff adds some FB-specific cost models. Reviewed By: ezgicicek Differential Revision: D17787903 fbshipit-source-id: cc49fad83	5 years ago
Sungkeun Cho	dda1486a67	[inferbo] Introduce inequality for size alias target Summary: This diff introduces an inequality for the size alias targets, in order to get preciser array lengths after loops. The alias domain in inferbo was able to express strict equality between alias source and its targets, e.g. x=size(array). Now, for the size alias target, it can express less than or equal relations, e.g. x>=size(array). Reviewed By: ezgicicek Differential Revision: D17606222 fbshipit-source-id: 2557d3bd0	5 years ago
Ezgi Çiçek	8c1fdab0a8	[java] Enhance annotation parsing with the ability to pick up parameter names Summary: When we have an annotation like `Prop(varArg = X)` or ` ThreadSafe(enableChecks = true)`, we were not able to pick up the names of the parameters like `varArg` or `enableChecks`. This diff fixes that. Reviewed By: skcho, ngorogiannis Differential Revision: D17571377 fbshipit-source-id: 5293b5810	5 years ago
Jules Villard	c19d9254b4	[typ] make use of pretty printers instead of strings Summary: As per previous diff, attempt to allocate fewer strings. This doesn't seem to affect perf although allocating less might reduce memory pressure. Reviewed By: mityal Differential Revision: D17423973 fbshipit-source-id: e2e37b071	5 years ago
Jules Villard	088b083d87	[typ] prefer pretty printing to string building Summary: My spidey senses were tingling. Next diff uses the `pp` functions everywhere it was kind of obvious how to change the code to do so. It doesn't improve perf but is less clowny that way. It might lessen memory pressure since allocating strings is expensive and this code was doing a lot of it. Reviewed By: ngorogiannis Differential Revision: D17450324 fbshipit-source-id: 632cee584	5 years ago
Mitya Lyubarskiy	fc651cb876	[nullsafe] Remove deadcode Summary: deadcode was introduced in D17313660 Reviewed By: ngorogiannis Differential Revision: D17395584 fbshipit-source-id: eeb4fa0eb	5 years ago
Sungkeun Cho	962e56cb1b	[infer] Use typ instead of root_typ if possible Summary: This diff makes the checkers, except biabduction, to use `typ` instead of `root_typ` of `Load`/`Store` statemetns. Reviewed By: dulmarod Differential Revision: D17203105 fbshipit-source-id: 8be9b5158	5 years ago
Sungkeun Cho	3916d1b3bc	[infer] Add type field in Sil.Store Summary: It adds typ field in Sil.Store. The field will be used by the analyzer in the following diffs. Motivation: Interbo generates a symbolic value when evaluating expressions including parameter symbols. At that time, it is done with depending on their types, e.g., an integer, a pointer to struct or a pointer to array. Without the type, it is hard to generate a correct symbolic value that will be instantiated later in call sites. Thus, evaluating RHS of the store statement, the type of RHS is better to be given. Reviewed By: dulmarod Differential Revision: D17185346 fbshipit-source-id: f0945c40f	5 years ago
Dulma Churchill	27ea5d041b	[biabduction] Rename use_after_free to avoid name clash with Pulse Summary: Use_after_free was used both for biabduction and pulse, and the biabduction version is blacklisted by default. As a result, the Pulse version was also disabled unintentionally. This changes the name of the old use_after_free so that now we can get use_after_free bugs whenever pulse is enabled. Reviewed By: skcho Differential Revision: D17182687 fbshipit-source-id: 539ca69de	5 years ago
Sungkeun Cho	3250ff35d2	[infer] Add typ field in Sil.Load Summary: It adds `typ` field in Sil.Load. The field will be used by the analyzer in the following diffs. Motivation: Interbo generates a symbolic value when evaluating expressions including parameter symbols. At that time, it is done with depending on their types, e.g., an integer, a pointer to struct or a pointer to array. Without the type, it is hard to generate a correct symbolic value that will be instantiated later in call sites. Thus, evaluating RHS of the load statement, the type of RHS is better to be given. Reviewed By: jvillard Differential Revision: D17163350 fbshipit-source-id: f7f0f1429	5 years ago
Sungkeun Cho	a50fcaf2dd	[infer] Use inline record for Sil.Load and Sil.Store Summary: It uses inline record for Sil.Load and Sil.Store for preparing the following extention. Reviewed By: dulmarod Differential Revision: D17161288 fbshipit-source-id: 637ea7bfa	5 years ago
Sungkeun Cho	78cfc867a5	[inferbo] Print non-verbose program variables Summary: It prints non-verbose program variables in the report. Reviewed By: ngorogiannis Differential Revision: D17163943 fbshipit-source-id: c3f3c2887	5 years ago
Nikos Gorogiannis	b8954e714e	[sqlite] write-server implementation Summary: Implementation of write-serializer for Sqlite. Points of note: - A Unix socket is used for communication. This avoids buffer-size limitations, as the objects we send for writing may exceed said limits. - No daemon is used if running under buck or in genrule mode, as this usually means a single-threaded job capturing into the DB. - When the daemon is running, read-only access is not enforced for other processes. This makes starting and stopping the daemon during Infer execution easier and more robust. In WAL mode this should not have any effect on performance. - This version is not economical with connections, it uses one per query, todo. Reviewed By: jvillard Differential Revision: D17077183 fbshipit-source-id: fa9877d6c	5 years ago
Nikos Gorogiannis	83aea33c68	[sqlite] move all writes to one module Summary: Write contention is becoming a problem in parallel capture (eg when make runs with high parallelism) or when analysis writes CFGs to the DB in parallel (eg when analysing blocks in ObC). This is believed to lead to BUSY errors in Sqlite. This is step 1 of a process where all writes are cordoned-off in one module, and fixing the interface for that module. Reviewed By: skcho Differential Revision: D16985034 fbshipit-source-id: 3d7ce381b	5 years ago
Mitya Lyubarskiy	356ec9afe5	[easy] make method with side-effects looks like it has side-effects Summary: `from_string` is too benign in constrast with what this method is really doing (and oh my what it is really doing). There are a lot of potential follow ups to clean this up even more, but this is beyond the scope of this diff Reviewed By: jvillard Differential Revision: D17070826 fbshipit-source-id: 3d190039e	5 years ago
Nikos Gorogiannis	ccc7dcbc1e	[racerd] use access expressions in place of paths Summary: Access paths are too coarse to properly address C/C++ instructions, and lead to false positives and negatives. Begin the process of porting the underlying domains to access expressions, in a results-preserving way. This roughly consists in: - Adding missing functions in `AccessExpression` to mirror those in `AccessPath`. - Replacing `AccessExpression` for `AccessPath` and removing conversions from the former to the latter except in: - Printing functions, to ensure formatting issues won't change tests/CI. - Reporting/deduplication still happens through access path conversion, as we need an analogue of `ModuloThis` for `AccessExpression`. - In selected places, ignore any access type not present in `AccessPath` (ie. dereference/take address of). Reviewed By: jberdine Differential Revision: D16856721 fbshipit-source-id: 5e3a88b75	5 years ago
Jules Villard	41c003ace1	[biabd] rename models-related things to "biabduction-..." Summary: The models are only for biabduction so try to make that clearer in the code and documentation. Reviewed By: skcho Differential Revision: D16603147 fbshipit-source-id: 4a2be53de	5 years ago
Sungkeun Cho	a3229fc43a	[inferbo] Suppress intended integer underflow of unsigned integer Summary: Sometimes programmers use integer underflow to get a maximum number of that type. This diff assumes that integer underflows from the syntactical form `(unsigned 0) - constant` is intended by the programmer, and suppresses the alarms of which. Reviewed By: ezgicicek Differential Revision: D16560639 fbshipit-source-id: 206f30dbc	5 years ago
Jules Villard	128f37985d	[ocaml] upgrade most dependencies Summary: newer is better, right? All the code changes in infer are because of core being bumped to v0.12. Reviewed By: jberdine Differential Revision: D16223183 fbshipit-source-id: f3c339966	5 years ago
Martin Trojer	124036ea0b	New faster version of Diff/Test-Determinator Reviewed By: jvillard Differential Revision: D15876508 fbshipit-source-id: f5d407025	5 years ago
Nikos Gorogiannis	ae4f7561b3	[hil] class constant types Reviewed By: jvillard Differential Revision: D16073011 fbshipit-source-id: a05ec2b6a	5 years ago
Jules Villard	7f12ced394	[pulse] move to SIL proper Summary: [apologies for the unreviewable diff...] Get rid of HIL expressions in pulse. This finishes the HIL -> SIL migration. The first step made pulse start from SIL instructions but would translate most accesses to HIL to re-use most of the existing pulse code. This diff gets rid of the intermediate translation of SIL expressions to HIL expressions. Big changes: 1. `PulseOperations` mostly rewritten, driven by using `Exp.t` instead of `HilExp.AccessExpression.t` for everything. 2. Stop trying to reverse-engineer what addresses mean in terms of access paths from program variables. Rely on the trace pointing at the right places in the code to be enough. This is because it wasn't that useful (and could even be misleading when wrong) but could be prohibitively expensive in degenerate cases (eg nodes with tens of thousands of successive array accesses...) 3. `PulseAbductiveDomain.apply_post` now returns the computed return value instead of recording it itself. 4. Change of vocabulary: `materialize` -> `eval`, `crumb` -> `event` 5. Function calls arguments are now evaluated prior to doing anything else, which saves everything else from having to (remember to) do that. In particular, this changes how models look quite a bit. Reviewed By: mbouaziz Differential Revision: D15986373 fbshipit-source-id: 1d79935de	6 years ago
Radu Grigore	10d87eec4e	[topl] Simple error reporting. Reviewed By: jvillard Differential Revision: D15875271 fbshipit-source-id: 148206be9	6 years ago
Mehdi Bouaziz	0efd8960e1	[Tenv] Maximum sharing Summary: Reduces the size of the `tenv` by sharing values as most as possible, in an untyped - but supposedly safe - way, by using black magic on objects. Can be reused for other things later. Reviewed By: ngorogiannis Differential Revision: D15855870 fbshipit-source-id: 169a4b86b	6 years ago
Radu Grigore	384b3c5798	Assert that there is at most one flowgraph per procedure name. Reviewed By: jvillard Differential Revision: D15695839 fbshipit-source-id: 979531edb	6 years ago
Mehdi Bouaziz	5f8514a8c2	[sqlite] Normalize blobs used for comparison Summary: Using `Marshal.to_string` to create SQLite values used in comparisons is brittle as there is no guarantee that it will return the same value for structurally equal values. When adding sharing, this will definitely break. From the SQLite queries I found, only `SourceFile` and `Procname` are used in comparisons. I haven't tested performance. It shouldn't change anything for `SourceFile` as there is no possible sharing. It shouldn't change much for `Procname` as they are pretty small anyway. Reviewed By: ngorogiannis Differential Revision: D15923122 fbshipit-source-id: ce4af1fe3	6 years ago
Jules Villard	04233ee49b	[clang] destroy C++ temporaries Summary: Inject destructor calls to destroy a temporary when its lifetime ends. Reviewed By: mbouaziz Differential Revision: D15674209 fbshipit-source-id: 0f783a906	6 years ago
Jules Villard	0592bac25e	[pulse] explain SIL logical variables in terms of program access paths Summary: Now that HIL doesn't help us anymore we need to reconstruct its mapping "SIL logical var -> program access path". We already have everything we need in pulse: it suffices to walk the current memory graph starting from program variables until we find the value of the temporary we are interested in. This diff also builds some type machinery to make sure all accesses are explained. Reviewed By: mbouaziz Differential Revision: D15824959 fbshipit-source-id: 722c81b39	6 years ago
Jules Villard	c9f4768be7	[pulse] move to SIL Summary: It turns out HIL gets in the way of a precise heap analysis. For instance, instead of: ``` n$0 = &x.f _ = delete(&x) &y = n$0 ``` HIL tries hard to forget about intermediate variables and shows instead ``` _ = delete(&x) &y = &x.f ``` Oops, that's a use-after-delete, whereas the original code was safe. While it's easy to write SIL programs that are completely unsound for HIL, they are not generated very often from the frontends. In fact, the problem became apparent only when making the clang frontend translate C++ temporaries destructors, which produces the situation above routinely. This diff makes the minimal amount of change to make Pulse build and produce equivalent results (minus HIL bugs) starting from SIL instead of HIL. The reporting sucks for now because we need to translate SIL temporaries back into program access paths. This is done in the next diff. Reviewed By: mbouaziz Differential Revision: D15824961 fbshipit-source-id: 8e4e2a3ed	6 years ago
Ezgi Çiçek	fedb8e5136	[infer] Cleanup preanalysis Summary: Preanalysis is performed at the frontend now. Hence, we don't need to repeatedly check/set when/if it is performed. Reviewed By: mbouaziz Differential Revision: D15863175 fbshipit-source-id: f9c6b7ae1	6 years ago
Nikos Gorogiannis	013d153538	[buck/java2] hashcons the global tenv during merging Summary: One "interesting" feature of the approach of merging the captured targets in Java, is that we union their type environments, as opposed to store partial tenvs together with each source file, which is the case for Clang. This means - the final global type environment is potentially huge because it contains all the types in all targets. - all analysis workers start by loading that tenv in memory, meaning we consume `\|size of tenv\| x #cpus` memory, which can tip the balance towards OOMs This diff attempts to economise on global tenv size. This is done by increasing sharing which is then preserved by marshalling. It's done in a brute force way, with hashtables for each struct component, and is not fully effective due to the recursion amongst types and types names, as well types appearing inside other constructs such as procnames. This is done when calling `Tenv.store` so that - the computation can be parallelised somewhat (capture is parallel, merging is not) - buck caching will benefit from smaller tenvs. This saves about 24% of total memory devoted to the type environment. Reviewed By: mbouaziz Differential Revision: D15840054 fbshipit-source-id: 6f03be1a4	6 years ago
Jules Villard	db800f138b	[clang] rewrite scope computations Summary: This started as an attempt to understand how to modify the frontend to inject destructors for C++ temporaries (see next diffs). This diff rewrites the existing logic for computing the list of variables that should be destroyed at the end of each statement, either because it's the end of their syntactic scope or because control flow branches outside of their syntactic scope. The frontend translates a function from the last instructions to the first, but scope computation needs to be done in the other direction, so it's done in a separate pass before the main translation happens. That first pass creates a map from statements in the AST to the list of variables that should be destroyed at the end of these statements. This is still the case now. Before, that map would be computed in a bit of a weird way: scopes are naturally a stack but instead of that the structure maintained was a flat list + a counter to know where the current scope ended in that list. In this diff, redo the computation maintaining a stack of scopes instead, which is a bit cleaner. Also treat more instructions as introducing a new scope, eg if, for, ... Reviewed By: mbouaziz Differential Revision: D15674208 fbshipit-source-id: c92429e82	6 years ago
Jules Villard	eaa5c32432	[clang] some more debug info Summary: Somewhat trivial: add a string to "Destruction" nodes to indicate why they were created. Rename the main `instruction_aux` function into `instruction_translate` (see next diff for why). Reviewed By: mbouaziz Differential Revision: D15674211 fbshipit-source-id: 8a7eda72c	6 years ago
Jules Villard	696731523d	[pname dispatcher] more permissive templated function match Summary: This allows to match `foo<int_&>` and many other horrible names. Reviewed By: mbouaziz Differential Revision: D15825403 fbshipit-source-id: c892033aa	6 years ago
Josh Berdine	cfc1c8be36	[copyright] Remove years Reviewed By: jvillard Differential Revision: D15771884 fbshipit-source-id: e2997e3a3	6 years ago
Ezgi Çiçek	d2eb3c8cc6	[inefficient-keyset-iterator] New checker for finding inefficient keySet iterator Summary: This is a simple checker that identifies inefficient uses of `keySet` iterator where (not only the key but also) the value is accessed via `get(key)`. It is more efficient to use `entrySet` iterator which already returns both key-value pairs. This optimization would get rid of many extra lookups which can be expensive. We simply traverse the CFG starting from the loop head upwards and pick up the map that is iterated over. Then, we check in the loop nodes if there is a call to `get(...)` over this map. If, so we report. Reviewed By: ngorogiannis Differential Revision: D15737779 fbshipit-source-id: 702465b4e	6 years ago
Radu Grigore	d86e2f0d1c	[topl] Generate monitor. Summary: The synthetic methods from `topl.Property` are now nonempty: they simulate a nondeterministic automaton. Reviewed By: jvillard Differential Revision: D15668471 fbshipit-source-id: 050408283	6 years ago
Radu Grigore	047c64c528	[topl] Instrument SIL. Summary: Instrument SIL according to TOPL properties. Roughly, the instrumentation is a set of calls into procedures that simulate a nondeterministic automaton. For now, those procedures are NOP dummies. Reviewed By: jvillard Differential Revision: D15063942 fbshipit-source-id: d22c2f6fa	6 years ago
Ezgi Çiçek	99bc7363bf	[cost] Suppress reports on Java access methods Reviewed By: ngorogiannis Differential Revision: D15696182 fbshipit-source-id: 2f84789a7	6 years ago
Nikos Gorogiannis	bc61543875	[buckjava2] refactor Reviewed By: jberdine Differential Revision: D15516135 fbshipit-source-id: e8067cf66	6 years ago
Radu Grigore	3cf774a142	Fixed typos in comments. Reviewed By: ddino Differential Revision: D15512983 fbshipit-source-id: aa693cc5a	6 years ago
Jules Villard	d586630edf	[pules] do not print templated part of function names Summary: This messes with the deduplication heuristic when templated function names show up in the error messages, since the heuristic demands that the error messages are the same. Reviewed By: mbouaziz Differential Revision: D15374333 fbshipit-source-id: 70232d254	6 years ago
Jules Villard	5de9bc29d2	[pulse] better error messages Summary: Improve the error messages, change is more or less documented in the code. Reviewed By: mbouaziz Differential Revision: D15374334 fbshipit-source-id: f1dd54180	6 years ago
Jules Villard	b700af9ffb	[hil] do not put parens around trivial expressions Summary: `(x)` -> `x` `&(x)` -> `&x` everything else unchanged Reviewed By: mbouaziz Differential Revision: D15374360 fbshipit-source-id: af5ef4e66	6 years ago
Martin Trojer	e7ad99eed0	Using DB to store modified functions Reviewed By: jvillard Differential Revision: D15181951 fbshipit-source-id: 96be170c9	6 years ago
Nikos Gorogiannis	7106de35a3	[issuelogs] less imperative Reviewed By: jvillard Differential Revision: D15278599 fbshipit-source-id: 54b190d94	6 years ago
Nikos Gorogiannis	d082f36448	[sqlite] calls in the db Reviewed By: mbouaziz, martintrojer Differential Revision: D15199334 fbshipit-source-id: 7938a2024	6 years ago
Nikos Gorogiannis	8450ac36d8	[trivial] procname should implement Hashable Summary: No reason to use custom function name and not implement `Hashable`. Reviewed By: mbouaziz Differential Revision: D15097603 fbshipit-source-id: 7303fc15e	6 years ago
Jeremy Dubreil	95ddfd04ca	Revert "[topl] Synthesize trivial procedures." Reviewed By: mbouaziz Differential Revision: D15087665 fbshipit-source-id: 001f31093	6 years ago
Mehdi Bouaziz	2a0ec8c0db	Fix infer explore --source-files-procedure-names Reviewed By: jeremydubreil Differential Revision: D15050147 fbshipit-source-id: 61a44a81a	6 years ago
Radu Grigore	86aae0b8ed	[topl] Synthesize trivial procedures. Summary: TOPL properties are essentially automata, which will be modeled as a set of procedures. The code-to-analyze makes calls into these procedures, thereby driving the automaton. In this commit, these calls do not do anything. The point is to prepare the hook-up mechanism. Reviewed By: jvillard Differential Revision: D14819650 fbshipit-source-id: d95ecdb3d	6 years ago
Jules Villard	1e3fafb558	[report] avoid embarrassing "object `null` could be null" message Summary: A long-standing easter egg from infer error messages is the "object `null` could be null and is dereferenced at line ...". I tried to fix this but the part that generates the first "null" in the message and the part that generates the second one are very far apart and it's hard to see how to make the second part aware of the first in a clean way. Instead, hack around it by detecting if the string representing the value is literally `null` and in that case chop `could be null ` from the error messages... Reviewed By: jeremydubreil Differential Revision: D14972324 fbshipit-source-id: ccc48ce6b	6 years ago
Jules Villard	95132bc3f0	[report] restore missing "could be null and is dereferenced" message for nullable dereference Summary: We get messages like " object returned by `getArguments()` at line 101." instead of " object returned by `getArguments()` could be null and is dereferenced at line 101.". Tracking it down, it happens for nullable-looking values, but I don't know why. It seems that something regressed but I couldn't track it down. So, just generate the error message in the same way as for non-nullable objects in this case to fix the non-sensical message. Reviewed By: jeremydubreil Differential Revision: D14972325 fbshipit-source-id: 2a97501cc	6 years ago
Jules Villard	b5589661ce	[pulse] improve error messages and traces Summary: Feedback from peterogithub: - mention which access path is being invalidated and accessed in the message - mention the line at which it was invalidated (the line at which it's accessed is already the line at which we report) - traces for stack variable/C++ temporary address escapes - delete double implementation of the same functionality in `PulseTrace`: `location_of_action_start` is the same as `outer_location_of_action`... Reviewed By: jberdine Differential Revision: D14800294 fbshipit-source-id: 3d9ab9b3d	6 years ago
Jules Villard	ada032ee2c	[pulse] improve error messages and traces Summary: The previous message formatting had regressed and produced non-sensical messages. More importantly, remove template parameters from error messages to trigger the heuristic in `InferPrint` that deduplicates errors that are on the same line with the same error type and message. Without this we get hundreds of reports that correspond to as many instantiations of the same code. Reviewed By: ngorogiannis Differential Revision: D14747979 fbshipit-source-id: 3c4aad2b1	6 years ago
Jules Villard	53b1577b4c	[pulse][interproc 3/3] interproc call Summary: biggest_diff Reviewed By: jberdine Differential Revision: D14387150 fbshipit-source-id: 6d6ddeffc	6 years ago
Jules Villard	686231ec6e	[SIL] change `variable_initialization()` builtin to a new auxiliary instruction Summary: Instead of emitting an ad-hoc builtin on variable declaration emit a new metadata instruction. This allows us to remove the code matching on that ad-hoc builtin that had to be inserted in several checkers. Inferbo & pulse used that information meaningfully and had to undergo some minor changes to cope with the new metada instruction. Reviewed By: ezgicicek Differential Revision: D14833100 fbshipit-source-id: 9b3009d22	6 years ago
Jules Villard	2151be9c25	[issues] do not dedup issues when `Config.filtering` is unset Summary: Deduplication can make debugging infer findings trickier. Reviewed By: ngorogiannis Differential Revision: D14773548 fbshipit-source-id: 731c7b749	6 years ago
Jules Villard	ebe5028ca1	[SIL] add `Skip` metadata instruction Summary: springcleaning2 Reviewed By: ezgicicek Differential Revision: D14827673 fbshipit-source-id: 0d3cf730b	6 years ago
Jules Villard	b665e1c575	[SIL][HIL] distinguish auxiliary instructions as `Metadata` Summary: Bundle all non-semantic-bearing instructions into a `Metadata _` instruction in SIL. - On a documentation level this makes clearer the distinction between instructions that encode the semantics of the program and those that are just hints for the various backend analysis. - This makes it easier to add more of these auxiliary instructions in the future. For example, the next diff introduces a new `Skip` auxiliary instruction to replace the hacky `ExitScope([], Location.dummy)`. - It also makes it easier to surface all current and future such auxiliary instructions to HIL as the datatype for these syntactic hints can be shared between SIL and HIL. This diff brings `Nullify` and `Abstract` to HIL for free. Reviewed By: ngorogiannis Differential Revision: D14827674 fbshipit-source-id: f68fe2110	6 years ago
David Lively	5d4a27ea54	RFC: stop using _ to separate ObjC/C++ class name from method in Typ.Procname.to_string Reviewed By: jvillard Differential Revision: D14736442 fbshipit-source-id: 500df354b	6 years ago

1 2 3 4 5 ...

1042 Commits (b71521a90a04b103d397bcf16377a729e546b363)