infer_clone

Commit Graph

Author	SHA1	Message	Date
Nikos Gorogiannis	386f303b1d	[scheduler][restart] use proc_uids instead of serialised procnames as targets Summary: Eliminate the need to serialise procnames when sending work from the restart scheduler to the workers, by sending the proc_uid instead. This is (much) shorter than the byte representation of the proc_name and it's the primary DB key of the procedures table, so it can be used by the worker to obtain the full procname. Also, reduce GC churn by using folds in the scheduler startup instead of copying lists over and over. Reviewed By: jberdine Differential Revision: D23566131 fbshipit-source-id: 1472aa990	5 years ago
Nikos Gorogiannis	5406fa3224	[scheduler][restart] use filenames instead of procnames for dependencies Summary: Limit communication bandwidth and serialisation burden by sending procedure filename strings (which are bounded at ~100 bytes) instead of serialising procnames through the socket to the scheduler (which are unbounded and have been seen to reach ~30kB in the worst case for templated procedures). Context: Under the restart scheduler, a worker working on a procedure X that discovers a race on a dependency Y it needs fails the computation of X and sends to the scheduler the procname Y. The next time X is about to be rescheduled, the scheduler checks whether Y is still being analysed, by checking if the lock for Y still exists. This check uses the procedure filename already, so we can send that instead. Reviewed By: jvillard Differential Revision: D23554995 fbshipit-source-id: 9828e71a2	5 years ago
Jules Villard	de47214bcd	[absint] do not log restart scheduler exceptions Summary: These exceptions were caught earlier before but D21257474 made absint log an error every time before reraising them. The exception type had to move to IR/ or absint/, so I moved "SchedulerTypes" to "absint/TaskSchedulerTypes" and added the restart scheduler's exception there. There is already a "Scheduler.ml" file in absint/ so to address the ambiguity I added "Task" in front of that one. Reviewed By: ngorogiannis Differential Revision: D21348593 fbshipit-source-id: 58055c9b7	5 years ago
Nikos Gorogiannis	e01311c431	[scheduler][callgraph] load graph directly from DB Summary: Currently the call graph of all captured procedures is loaded and then traversed to flag reachable procedures from modified files, followed by deleting the unflagged part, and unflagging the rest. This is a bit wasteful, and doesn't lend itself nicely to constructing directly the reverse call graph, which further diffs will do. This diff loads all captured procedures and callees in a hashconsed table, and performs a BFS from procedures in modified files, to build the call graph in one pass. Reviewed By: fgasperij Differential Revision: D19888965 fbshipit-source-id: eeb59356e	5 years ago
Nikos Gorogiannis	3f4458361c	[scheduler][callgraph] only load defined procs Summary: To ease scheduling, it would be best to only load the procnames of procedures that are (a) defined and (b) reachable from the modified files. The frontends play various games with the DB properties: - In Clang all methods have a CFG even if they are undefined. Also, looking for non-NULL CFG rows in the DB brings up methods unreachable from modified files (?). - In Java, some procedures have NULL CFGs. In addition, some of those have `attr_kind!=0`. We only load those procedures that have both non-NULL CFGs and `attr_kind!=0`. That seems to give meaningful numbers, esp. wrt reachable procedures from files. Reviewed By: jberdine Differential Revision: D20068376 fbshipit-source-id: 992b65b4a	5 years ago
Fernando Gasperi Jabalera	5c5609591e	[scheduler][restart] Reduce live-locking by using data produced on failure Summary: When a worker fails because it can't a get the lock of a `Procname` it will include it in the exception that it throws so the `RestartScheduler` can record it as a dependency. Then when scheduling a new work item from `RestartScheduler.next` it will check if this dependency is already met, if it isn't it will not schedule the `Procname` yet. Reviewed By: ngorogiannis Differential Revision: D19820331 fbshipit-source-id: b48cacc9a	5 years ago
Nikos Gorogiannis	ddfa6fc96e	[scheduler][callgraph] use counter instead of set of scheduled procs Summary: No reason to use a set when an integer will suffice. This further reduces GC churn. Reviewed By: fgasperij Differential Revision: D19888300 fbshipit-source-id: 9fc8c73f5	5 years ago
Nikos Gorogiannis	a44e138dd4	[scheduler][callgraph] use a queue for graph leaves instead of a list Summary: Queues are implemented using a circular array, so should be less GC-heavy than continually allocating/freeing list nodes. Reviewed By: jberdine, fgasperij Differential Revision: D18504104 fbshipit-source-id: 93d29c253	5 years ago
Nikos Gorogiannis	757f6ee829	[scheduler][static callgraph] remove lazy init now that it's enforced in ProcessPool Summary: Building the call graph should be done only in the scheduler process after having forked all workers. This was achieved by a lazy init pattern, whereby the first time `next` was called, it would build the call graph, on the assumption that `next` is only ever called in the scheduler after forking. D19769741 made this compulsory regardless the scheduler by passing a thunk to `ProcessPool` which is called to obtain the actual scheduler, on the right process and after the fork. This means we don't need the custom lazy init logic any more. In addition, that set up used a DB query to overapproximate the number of procedures to analyse, because this was supposed to be provided before forking. Now this is also not needed, and on top of that we can provide the exact number after building the call graph. Reviewed By: ezgicicek Differential Revision: D19833974 fbshipit-source-id: 7f6d51d93	5 years ago
Fernando Gasperi Jabalera	11300370ed	Add task_result type for scheduler analysis tasks Summary: The RestartScheduler needs to know if the worker finished it's task because: 1. there was no more work to do or 2. found that a needed Procname was already taken (this part is not yet implemented) This need was addressed by (i) making the functions that the workers execute return a value of task_result.t intead of unit and (ii) adding a constructor to the worker_message.t (FinishedTask). Reviewed By: ngorogiannis Differential Revision: D19467783 fbshipit-source-id: a76b02b6c	5 years ago
Fernando Gasperi Jabalera	87b29a2d72	Add --scheduler option Reviewed By: ngorogiannis Differential Revision: D19330599 fbshipit-source-id: f185b92ab	5 years ago
Nikos Gorogiannis	91fa6a5404	[typ] extract Procname from Typ Summary: No reason for this to be in Typ Reviewed By: skcho Differential Revision: D19162727 fbshipit-source-id: d6940637a	5 years ago
Nikos Gorogiannis	e7874c74f4	[call-graph sched] small simplifications Summary: There was some over-general treatment of reachability, in anticipation of changes that didn't happen. In particular, we only need to flag/remove single nodes, as they must be leaves to be scheduled, therefore we never need to traverse their successors, because there aren't any. Reviewed By: jvillard Differential Revision: D18425905 fbshipit-source-id: b86490542	5 years ago
Nikos Gorogiannis	be43364d05	[sched] refactor into a more sane structure Summary: - Convert `task_generator` into a module of `ProcessPool` and collect inside the two combinators which were in semi-random places. - Make `SyntacticCallGraph` export a `task_generator` as opposed to a call-graph builder. - Separate `target` type and put it in its own module to avoid dependency cycles. Reviewed By: skcho Differential Revision: D18425718 fbshipit-source-id: 7957edac8	5 years ago
Phoebe Nichols	1415be9153	Log the reverse analysis call graph for tests Summary: The reverse analysis call-graph is logged if `--debug-level-analysis` > 0, so that its value can be inspected for tests Reviewed By: jvillard Differential Revision: D16440567 fbshipit-source-id: 1ec6af1f3	6 years ago
Phoebe Nichols	578b1c95f1	Add function to add an edge to the call graph Summary: The reverse call graph will be constructed by adding edges one-by-one, so expose functionality in CallGraph to add a single edge to the graph Reviewed By: jvillard Differential Revision: D16285016 fbshipit-source-id: 553fe1ecf	6 years ago
Phoebe Nichols	82eb91fe71	Move core CallGraph API from SyntacticCallGraph.ml to CallGraph.ml Summary: Move the logic that is general to any call graph from SyntacticCallGraph.ml into CallGraph.ml This will allow the call graph logic to be re-used in a later diff Reviewed By: ezgicicek Differential Revision: D16265150 fbshipit-source-id: 10a067f28	6 years ago
Phoebe Nichols	3e7f500ae8	Rename CallGraph.ml to SyntacticCallGraph.ml Summary: `CallGraph.ml` computes a call graph using the explicit procedure calls in the source code (ie computes a syntactic call graph) I am going to be adding code for an 'analysis call graph' that gives the callees of a procedure from the perspective of the analyses in infer This diff renames `CallGraph.ml` to avoid confusion with the new analysis call graph logic Reviewed By: ngorogiannis, jvillard Differential Revision: D16204436 fbshipit-source-id: 67bed8e28	6 years ago

18 Commits (990d0fbed5f6686e6710347270f61b3545203157)