diff options
| author | Andrew Trick <atrick@apple.com> | 2011-05-06 17:09:08 +0000 |
|---|---|---|
| committer | Andrew Trick <atrick@apple.com> | 2011-05-06 17:09:08 +0000 |
| commit | aab77fe5749d48bd8ca8556e8954319505c3fc44 (patch) | |
| tree | c0aa04e229eeac9d12b71f68d3c5ad071d333463 /llvm/lib/CodeGen/PostRASchedulerList.cpp | |
| parent | 17b532728b2d5bba2539eb04f672e1ec4c19756a (diff) | |
| download | llvm-aab77fe5749d48bd8ca8556e8954319505c3fc44.zip llvm-aab77fe5749d48bd8ca8556e8954319505c3fc44.tar.gz llvm-aab77fe5749d48bd8ca8556e8954319505c3fc44.tar.bz2 | |
Post-RA scheduler compile time fix. Quadratic computation of DAG node depth.
The post-ra scheduler was explicitly updating the depth of a node's
successors after scheduling it, regardless of whether the successor
was ready. This is quadratic for DAGs with transitively redundant
edges. I simply removed the useless update of depth, which is lazilly
computed later.
Fixes <rdar://problem/9044332> compiler takes way too long to build TextInput.
llvm-svn: 130992
Diffstat (limited to 'llvm/lib/CodeGen/PostRASchedulerList.cpp')
| -rw-r--r-- | llvm/lib/CodeGen/PostRASchedulerList.cpp | 14 |
1 files changed, 10 insertions, 4 deletions
diff --git a/llvm/lib/CodeGen/PostRASchedulerList.cpp b/llvm/lib/CodeGen/PostRASchedulerList.cpp index 60c24b7..848af2a 100644 --- a/llvm/lib/CodeGen/PostRASchedulerList.cpp +++ b/llvm/lib/CodeGen/PostRASchedulerList.cpp @@ -540,10 +540,16 @@ void SchedulePostRATDList::ReleaseSucc(SUnit *SU, SDep *SuccEdge) { #endif --SuccSU->NumPredsLeft; - // Compute how many cycles it will be before this actually becomes - // available. This is the max of the start time of all predecessors plus - // their latencies. - SuccSU->setDepthToAtLeast(SU->getDepth() + SuccEdge->getLatency()); + // Standard scheduler algorithms will recomute the depth of the successor + // here as such: + // SuccSU->setDepthToAtLeast(SU->getDepth() + SuccEdge->getLatency()); + // + // However, we lazily compute node depth instead. Note that + // ScheduleNodeTopDown has already updated the depth of this node which causes + // all descendents to be marked dirty. Setting the successor depth explicitly + // here would cause depth to be recomputed for all its ancestors. If the + // successor is not yet ready (because of a transitively redundant edge) then + // this causes depth computation to be quadratic in the size of the DAG. // If all the node's predecessors are scheduled, this node is ready // to be scheduled. Ignore the special ExitSU node. |
