This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/Analysis/
-
lib/
-
Analysis/
1/4
LiveVariables.cpp

Differential D87519

[analyzer][Liveness][NFC] Enqueue the CFGBlocks post-order
Needs ReviewPublic

Authored by Szelethus on Sep 11 2020, 9:12 AM.

Download Raw Diff

Details

Reviewers

xazax.hun
NoQ
vsavchenko
balazske
martong
baloghadamsoftware
steakhal

Diff Detail

Event Timeline

Szelethus created this revision.Sep 11 2020, 9:12 AM

Herald added subscribers: cfe-commits, ASDenysPetrov, Charusso and 8 others. · View Herald TranscriptSep 11 2020, 9:12 AM

Szelethus requested review of this revision.Sep 11 2020, 9:12 AM

Harbormaster completed remote builds in B71382: Diff 291241.Sep 11 2020, 9:12 AM

xazax.hun added inline comments.Sep 11 2020, 11:10 AM

clang/lib/Analysis/LiveVariables.cpp
522	With `BackwardDataflowWorklist`, each `enqueueBlock` will insert the block into a `llvm::PriorityQueue`. So regardless of the insertion order, `dequeue` will return the nodes in the reverse post order. Inserting elements in the right order into the heap might be beneficial is we need to to less work to "heapify". But on the other hand, we did more work to insert them in the right order, in the first place. All in all, I am not sure whether the comment is still valid and whether this patch would provide any benefit over the original code.

martong mentioned this in D87518: [analyzer][Liveness][NFC] Remove an unneeded pass to collect variables that appear in an assignment.Sep 14 2020, 5:55 AM

martong added inline comments.Sep 14 2020, 6:43 AM

clang/lib/Analysis/LiveVariables.cpp
522	Yes, what Gabor says makes sense. On the other hand I don't see any overhead - I might be wrong though - in the post order visitation. And it makes the code more consistent IMHO. Well, it would be important to know why the original author put the // FIXME: we should enqueue using post order. there. The blamed commit 77ff930fff15c3fc76101b38199dad355be0866b is not saying much.

baloghadamsoftware added inline comments.Sep 28 2020, 8:39 AM

clang/lib/Analysis/LiveVariables.cpp
522	Please compare the execution time with and without this patch. I think it is difficult do decide in theory which one costs more: the heapification during insertion or the reverse ordering before the insertion.

I don't insist on this patch, though I will end up removing the FIXME even if I leave the actual code unchanged, as it seems to be outdated.

clang/lib/Analysis/LiveVariables.cpp
522	I think the performance hit, given that there is any, must be negligible. On the project c4, a tiny C compiler written in 4 large functions, the current liveness analysis runtimes on my machine in debug mode look like this: Liveness analysis on next: 7.397604e-03s Liveness analysis on expr: 1.203549e-02s Liveness analysis on stmt: 5.470070e-04s Liveness analysis on main: 1.344798e-02s Liveness analysis on main: 1.415095e-02s Liveness analysis on stmt: 5.550660e-04s Liveness analysis on expr: 1.197334e-02s Liveness analysis on next: 7.181059e-03s After this patch, they look like this: Liveness analysis on next: 7.313751e-03s Liveness analysis on expr: 1.211920e-02s Liveness analysis on stmt: 5.582670e-04s Liveness analysis on main: 1.372210e-02s Liveness analysis on main: 1.437104e-02s Liveness analysis on stmt: 5.685340e-04s Liveness analysis on expr: 1.269498e-02s Liveness analysis on next: 7.094738e-03s Mind that this measured the entire analysis, not just enqueuing. I think its fair to say that even on relatively large functions, the difference is within the margin of error, and is most definitely incomparable to its only user's runtime, the static analyzer itself. Well, it would be important to know why the original author put the [TODO] there. Yep, I think its just an outdated comment.

What about analyzing a Sema translation unit? That should be beefy enough to have a longer runtime.

Revision Contents

Path

Size

clang/

lib/

Analysis/

LiveVariables.cpp

9 lines

Diff 291241

clang/lib/Analysis/LiveVariables.cpp

	//=- LiveVariables.cpp - Live Variable Analysis for Source CFGs ----------*-==//			//=- LiveVariables.cpp - Live Variable Analysis for Source CFGs ----------*-==//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file implements Live Variables analysis for source-level CFGs.			// This file implements Live Variables analysis for source-level CFGs.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "clang/Analysis/Analyses/LiveVariables.h"			#include "clang/Analysis/Analyses/LiveVariables.h"
	#include "clang/AST/Stmt.h"			#include "clang/AST/Stmt.h"
	#include "clang/AST/StmtVisitor.h"			#include "clang/AST/StmtVisitor.h"
				#include "clang/Analysis/Analyses/PostOrderCFGView.h"
	#include "clang/Analysis/AnalysisDeclContext.h"			#include "clang/Analysis/AnalysisDeclContext.h"
	#include "clang/Analysis/CFG.h"			#include "clang/Analysis/CFG.h"
	#include "clang/Analysis/FlowSensitive/DataflowWorklist.h"			#include "clang/Analysis/FlowSensitive/DataflowWorklist.h"
	#include "llvm/ADT/DenseMap.h"			#include "llvm/ADT/DenseMap.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include <algorithm>			#include <algorithm>
	#include <vector>			#include <vector>

	▲ Show 20 Lines • Show All 473 Lines • ▼ Show 20 Lines

	std::unique_ptr<LiveVariables>			std::unique_ptr<LiveVariables>
	LiveVariables::computeLiveness(AnalysisDeclContext &AC, bool killAtAssign) {			LiveVariables::computeLiveness(AnalysisDeclContext &AC, bool killAtAssign) {

	// No CFG? Bail out.			// No CFG? Bail out.
	CFG *cfg = AC.getCFG();			CFG *cfg = AC.getCFG();
	if (!cfg)			if (!cfg)
	return nullptr;			return nullptr;
				assert(AC.getAnalysis<PostOrderCFGView>() &&
				"If the CFG exists, we should be able to create a post order view of "
				"it!");

	// The analysis currently has scalability issues for very large CFGs.			// The analysis currently has scalability issues for very large CFGs.
	// Bail out if it looks too large.			// Bail out if it looks too large.
	if (cfg->getNumBlockIDs() > 300000)			if (cfg->getNumBlockIDs() > 300000)
	return nullptr;			return nullptr;

	LiveVariablesImpl *LV = new LiveVariablesImpl(AC, killAtAssign);			LiveVariablesImpl *LV = new LiveVariablesImpl(AC, killAtAssign);

	// Construct the dataflow worklist. Enqueue the exit block as the			// Construct the dataflow worklist. Enqueue the exit block as the
	// start of the analysis.			// start of the analysis.
	BackwardDataflowWorklist worklist(*cfg, AC);			BackwardDataflowWorklist worklist(*cfg, AC);
	llvm::BitVector everAnalyzedBlock(cfg->getNumBlockIDs());			llvm::BitVector everAnalyzedBlock(cfg->getNumBlockIDs());

	// FIXME: we should enqueue using post order.			for (const CFGBlock B : AC.getAnalysis<PostOrderCFGView>()) {
				xazax.hunUnsubmitted Not Done Reply Inline Actions With `BackwardDataflowWorklist`, each `enqueueBlock` will insert the block into a `llvm::PriorityQueue`. So regardless of the insertion order, `dequeue` will return the nodes in the reverse post order. Inserting elements in the right order into the heap might be beneficial is we need to to less work to "heapify". But on the other hand, we did more work to insert them in the right order, in the first place. All in all, I am not sure whether the comment is still valid and whether this patch would provide any benefit over the original code. xazax.hun: With `BackwardDataflowWorklist`, each `enqueueBlock` will insert the block into a `llvm…
				martongUnsubmitted Not Done Reply Inline Actions Yes, what Gabor says makes sense. On the other hand I don't see any overhead - I might be wrong though - in the post order visitation. And it makes the code more consistent IMHO. Well, it would be important to know why the original author put the // FIXME: we should enqueue using post order. there. The blamed commit 77ff930fff15c3fc76101b38199dad355be0866b is not saying much. martong: Yes, what Gabor says makes sense. On the other hand I don't see any overhead - I might be wrong…
				baloghadamsoftwareUnsubmitted Not Done Reply Inline Actions Please compare the execution time with and without this patch. I think it is difficult do decide in theory which one costs more: the heapification during insertion or the reverse ordering before the insertion. baloghadamsoftware: Please compare the execution time with and without this patch. I think it is difficult do…
				SzelethusAuthorUnsubmitted Done Reply Inline Actions I think the performance hit, given that there is any, must be negligible. On the project c4, a tiny C compiler written in 4 large functions, the current liveness analysis runtimes on my machine in debug mode look like this: Liveness analysis on next: 7.397604e-03s Liveness analysis on expr: 1.203549e-02s Liveness analysis on stmt: 5.470070e-04s Liveness analysis on main: 1.344798e-02s Liveness analysis on main: 1.415095e-02s Liveness analysis on stmt: 5.550660e-04s Liveness analysis on expr: 1.197334e-02s Liveness analysis on next: 7.181059e-03s After this patch, they look like this: Liveness analysis on next: 7.313751e-03s Liveness analysis on expr: 1.211920e-02s Liveness analysis on stmt: 5.582670e-04s Liveness analysis on main: 1.372210e-02s Liveness analysis on main: 1.437104e-02s Liveness analysis on stmt: 5.685340e-04s Liveness analysis on expr: 1.269498e-02s Liveness analysis on next: 7.094738e-03s Mind that this measured the entire analysis, not just enqueuing. I think its fair to say that even on relatively large functions, the difference is within the margin of error, and is most definitely incomparable to its only user's runtime, the static analyzer itself. Well, it would be important to know why the original author put the [TODO] there. Yep, I think its just an outdated comment. Szelethus: I think the performance hit, given that there is any, must be negligible. On the project…
	for (const CFGBlock *B : cfg->nodes()) {
	worklist.enqueueBlock(B);			worklist.enqueueBlock(B);
	}			}

	while (const CFGBlock *block = worklist.dequeue()) {			while (const CFGBlock *block = worklist.dequeue()) {
	// Determine if the block's end value has changed. If not, we			// Determine if the block's end value has changed. If not, we
	// have nothing left to do for this block.			// have nothing left to do for this block.
	LivenessValues &prevVal = LV->blocksEndToLiveness[block];			LivenessValues &prevVal = LV->blocksEndToLiveness[block];

	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines