Enable ThinLTO with incremental compilation. #53673

michaelwoerister · 2018-08-24T15:10:04Z

This is an updated version of #52309. This PR allows rustc to use (local) ThinLTO and incremental compilation at the same time. In theory this should allow for getting compile-time improvements for small changes while keeping the runtime performance of the generated code roughly the same as when compiling non-incrementally.

The difference to #52309 is that this version also caches the pre-LTO version of LLVM bitcode. This allows for another layer of caching:

if the module itself has changed, we have to re-codegen and re-optimize.
if the module itself has not changed, but a module it imported from during ThinLTO has, we don't need to re-codegen and don't need to re-run the first optimization phase. Only the second (i.e. ThinLTO-) optimization phase is re-run.
if neither the module itself nor any of its imports have changed then we can re-use the final, post-ThinLTO version of the module. (We might have to load its pre-ThinLTO version though so it's available for other modules to import from)

michaelwoerister · 2018-08-24T15:10:24Z

@bors try

bors · 2018-08-24T15:10:35Z

⌛ Trying commit ee14d4a5f2818fa96865ad03544716f2f124f32b with merge 2d7e52fcdbcbb66446270bc3f48b60f5ec50c119...

kennytm · 2018-08-24T15:37:51Z

@bors try

bors · 2018-08-24T15:39:10Z

⌛ Trying commit 1d0f3fd0ae87ccd0f8823e045ad675a05f2926c3 with merge 937c465de94cf2edd249c4355619eda8fa8616ee...

rust-highfive · 2018-08-24T16:30:10Z

The job x86_64-gnu-llvm-5.0 of your PR failed on Travis (raw log). Through arcane magic we have determined that the following fragments from the build log may contain information about the problem.

Click to expand the log.

[00:05:23] * memmap 
[00:05:23] some tidy checks failed
[00:05:23] 
[00:05:23] 
[00:05:23] command did not execute successfully: "/checkout/obj/build/x86_64-unknown-linux-gnu/stage0-tools-bin/tidy" "/checkout/src" "/checkout/obj/build/x86_64-unknown-linux-gnu/stage0/bin/cargo" "--no-vendor" "--quiet"
[00:05:23] 
[00:05:23] 
[00:05:23] failed to run: /checkout/obj/build/bootstrap/debug/bootstrap test src/tools/tidy
[00:05:23] Build completed unsuccessfully in 0:00:50
[00:05:23] Build completed unsuccessfully in 0:00:50
[00:05:23] make: *** [tidy] Error 1
[00:05:23] Makefile:79: recipe for target 'tidy' failed

The command "stamp sh -x -c "$RUN_SCRIPT"" exited with 2.
travis_time:start:00a20dae
$ date && (curl -fs --head https://google.com | grep ^Date: | sed 's/Date: //g' || true)
---
travis_time:end:3453066c:start=1535127707846023606,finish=1535127707855087437,duration=9063831
travis_fold:end:after_failure.3
travis_fold:start:after_failure.4
travis_time:start:2db993c0
$ ln -s . checkout && for CORE in obj/cores/core.*; do EXE=$(echo $CORE | sed 's|obj/cores/core\.[0-9]*\.!checkout!\(.*\)|\1|;y|!|/|'); if [ -f "$EXE" ]; then printf travis_fold":start:crashlog\n\033[31;1m%s\033[0m\n" "$CORE"; gdb -q -c "$CORE" "$EXE" -iex 'set auto-load off' -iex 'dir src/' -iex 'set sysroot .' -ex bt -ex q; echo travis_fold":"end:crashlog; fi; done || true
travis_fold:end:after_failure.4
travis_fold:start:after_failure.5
travis_time:start:09be6192
travis_time:start:09be6192
$ cat ./obj/build/x86_64-unknown-linux-gnu/native/asan/build/lib/asan/clang_rt.asan-dynamic-i386.vers || true
cat: ./obj/build/x86_64-unknown-linux-gnu/native/asan/build/lib/asan/clang_rt.asan-dynamic-i386.vers: No such file or directory
travis_fold:end:after_failure.5
travis_fold:start:after_failure.6
travis_time:start:01198ad8
$ dmesg | grep -i kill

I'm a bot! I can only do what humans tell me to, so if this was not helpful or you have suggestions for improvements, please ping or otherwise contact @TimNN. (Feature Requests)

alexcrichton · 2018-08-24T17:18:03Z

src/librustc_codegen_llvm/base.rs

+        Err(e) => {
+            let msg = format!("Error while trying to load ThinLTO import data \
+                               for incremental compilation: {}", e);
+            sess.fatal(&msg)


Should this perhaps fall back on returning a new ThinLTOImports instance? It seems like if a previous compiler is ctrl-c'd at just the right time it could poison future compilers to return this error message

Yes, good catch.

alexcrichton · 2018-08-24T17:24:01Z

src/librustc_codegen_llvm/back/write.rs

@@ -983,6 +1006,9 @@ pub fn start_async_codegen(tcx: TyCtxt,
        allocator_config.emit_bc_compressed = true;
    }

+    modules_config.emit_pre_thin_lto_bc =
+        need_pre_thin_lto_bitcode_for_incr_comp(sess);


Should this be swapped with save_temps above so -C save-temps always emits it?

foo.pre-thin-lto.bc should actually be exactly the same as foo.thin-lto-input.bc, so I didn't really make an effort to add it to the save-temps output. Do you think it's worth the trouble?

I see what you mean now. Yes, it should be swapped so we don't overwrite the value.

alexcrichton · 2018-08-24T17:28:38Z

src/librustc_codegen_llvm/back/write.rs

+            execute_copy_from_cache_work_item(cgcx, work_item, timeline)
+        }
+        work_item @ WorkItem::LTO(_) => {
+            execute_lto_work_item(cgcx, work_item, timeline)


It looks like each of these methods takes the bound work_item but quickly unwraps it, could instead they be bound in this match and the value passed in here?

(ping on this comment)

alexcrichton · 2018-08-24T17:34:11Z

src/librustc_codegen_llvm/back/lto.rs

+                len: module.data().len(),
+            });
+            serialized.push(module);
+            module_names.push(name);


This looks the same as the loop above, so could chain be used to process both in one go? The modules_to_optimize local variable looks like it can be hoisted above the loop too perhaps?

alexcrichton · 2018-08-24T17:38:14Z

src/librustc_codegen_llvm/back/lto.rs

+            // the cache instead of having been recompiled...
+            let current_imports = ThinLTOImports::from_thin_lto_data(data);
+
+            // ... so we load this additional information from the previous


I'm not sure I'm following what's going on here. Aren't all CGUs loaded into the ThinLTOData instance? Is this perhaps an older comment?

AFAIK when we redo ThinLTO we have to unconditionally load the ThinLTO buffer for all CGUs coming inas input, so I can't quite figure out where some would be missing, but you can likely enlighten me!

alexcrichton · 2018-08-24T17:39:59Z

src/librustc_codegen_llvm/back/lto.rs

+    pub fn save_to_file(&self, path: &Path) -> io::Result<()> {
+        use std::io::Write;
+        let file = File::create(path)?;
+        let mut writer = io::BufWriter::new(file);


For this and load_from_file below I'd imagine that these maps are pretty small (on the order of CGUs, not symbols) which probably means that we can reasonable hold the entire contents of this serialized file in memory. In that case it's probably much faster to read/write the file in one go (and do all other operations in memory)

alexcrichton · 2018-08-24T17:59:01Z

src/librustc_codegen_llvm/base.rs

+    No,
+    PreThinLto,
+    PostThinLto,
+    PostThinLtoButImportedFrom,


I'm slightly confused by this enum variant, but I think this is the same confusion that I had before perhaps?

If any CGU is either "no" reusable or pre-thin-lto, I think that means that all CGUs need to be loaded for the ThinLTO data collection stage.

In thinking about this as well, I think this function below may not work in general? I think we can only determine CGU post-thin-lto CGU reuse after the ThinLTO data is created, right? Put another way, I think the possible reuse states are:

Everything is green, nothing has changed.

All modified CGUs need to be re-codegen'd

Afterwards, ThinLTOData is created, using the cached ThinLTO buffers for unmodified CGUs and freshly created buffers for re-codegen'd CGUs.

Now there's a graph of CGU to CGUs-imported, as well as whether each CGU is red/green (green for cached, red for just codegen'd)

Any red CGU is re-thin-LTO'd.

Any green CGU which imports from a red CGU is re-thin-LTO'd

Here, before we create the ThinLTOData, I don't think we can determine that a green CGU only imports from other green CGUs? LLVM seems like it could do fancy things such as:

Let's have three CGUs, A, B, and C.

A/B are green and C is red

Previously, A imported from B and not C

Afterwards, though, A ends up importing from both B and C (for whatever reason)

I think that this classification below would mean that A is "as green as can be" but it actually needs to be re-thin-LTO'd?

I may also just be lost with the classification of names here...

alexcrichton · 2018-08-24T18:00:52Z

src/librustc_codegen_llvm/base.rs

+                });
+                true
+            }
+            CguReUsable::PostThinLtoButImportedFrom => {


I suppose to elaborate on my comment above, the way I expected this to work these two latter states wouldn't be possible. It seems like don't really need to handle the case that literally nothing changed as it's not so important. In that case we can assume something changed which means that everything will either be codegen'd or sent as a pre-thin-lto module to the backend. After the synchronization point we'd then make another decision about CGU reuse and such.

bors · 2018-08-24T18:14:49Z

☀️ Test successful - status-travis
State: approved= try=True

alexcrichton · 2018-08-24T18:44:30Z

@rust-timer build 937c465de94cf2edd249c4355619eda8fa8616ee

rust-timer · 2018-08-24T18:44:31Z

Success: Queued 937c465de94cf2edd249c4355619eda8fa8616ee with parent 57e13ba, comparison URL.

michaelwoerister · 2018-08-24T19:50:36Z

Yes, I can see one case where your A/B/C example would be handled sub-optimally: A references functions in both B and C, but in session 1 ThinLTO classifies no exported functions in C (and called from A) as potentially inlineable. Therefore the import data will show no edge from A to C. Then, in session 2, C is changed and now some function there has become small enough to be elegible for inlining. The algorithm in the PR would re-translate C (because it changed) but it would take the cached version of A since it has no edge to C. It would therefore not be able to inline functions from C into A although that might be possible now.

There are a couple of factors that somewhat lessen the negative effect:

If there is any potentially inlinable function in C that is called from A then an edge from A to C will exist and all functions in C will be covered because A will be included in the LTO phase.
If there is a function in C that is explicitly marked with #[inline] then changing that will invalidate A completely (as well as C) and both will be part of the LTO phase again.

That being said, deferring the classification to until after the index is built would solve the problem reliably (and is probably more in line with how the linker-plugin works). Unless I'm overlooking something, it shouldn't be too hard to implement it this way fortunately :) Good catch!

michaelwoerister · 2018-08-29T11:01:08Z

OK, so the perf results (which don't contain the proposed changes yet but should be kind of valid anyway) look better than last time:

Some cases profit a lot from incr. comp. (e.g. webrender, encoding, crates.io). The patched incremental: println case compiles 4-6 times faster than a non-incremental build. Many of the other cases are still twice as fast. If the runtime performance of the generated code is acceptable, that would be pretty good! Some crates though just hate incr. comp. it seems (yes, I'm looking at you style-servo).

michaelwoerister · 2018-08-31T13:22:13Z

@alexcrichton, I just pushed a commit that implements the algorithm as suggested by you. The code actually got simpler :) Let's see how it performs.

@bors try

bors · 2018-08-31T13:22:18Z

🔒 Merge conflict

This pull request and the master branch diverged in a way that cannot be automatically merged. Please rebase on top of the latest master branch, and let the reviewer approve again.

How do I rebase?

Assuming self is your fork and upstream is this repository, you can resolve the conflict following these steps:

git checkout incr-thinlto-2000 (switch to your branch)
git fetch upstream master (retrieve the latest master)
git rebase upstream/master -p (rebase on top of it)
Follow the on-screen instruction to resolve conflicts (check git status if you got lost).
git push self incr-thinlto-2000 --force-with-lease (update this PR)

You may also read Git Rebasing to Resolve Conflicts by Drew Blessing for a short tutorial.

Please avoid the "Resolve conflicts" button on GitHub. It uses git merge instead of git rebase which makes the PR commit history more difficult to read.

Sometimes step 4 will complete without asking for resolution. This is usually due to difference between how Cargo.lock conflict is handled during merge and rebase. This is normal, and you should still perform step 5 to update this PR.

Error message

warning: Cannot merge binary files: src/Cargo.lock (HEAD vs. heads/homu-tmp)
Auto-merging src/librustc/session/mod.rs
Auto-merging src/librustc/session/config.rs
Auto-merging src/Cargo.lock
CONFLICT (content): Merge conflict in src/Cargo.lock
Automatic merge failed; fix conflicts and then commit the result.

michaelwoerister · 2018-09-03T12:51:36Z

I think all nits should be addressed now. I added some info!() output that shows which CGUs are loaded from cache and which are re-compiled.

@bors r=alexcrichton

bors · 2018-09-03T12:51:36Z

📌 Commit 21d05f6 has been approved by alexcrichton

bors · 2018-09-03T14:00:03Z

⌛ Testing commit 21d05f6 with merge ee73f80...

…hton Enable ThinLTO with incremental compilation. This is an updated version of #52309. This PR allows `rustc` to use (local) ThinLTO and incremental compilation at the same time. In theory this should allow for getting compile-time improvements for small changes while keeping the runtime performance of the generated code roughly the same as when compiling non-incrementally. The difference to #52309 is that this version also caches the pre-LTO version of LLVM bitcode. This allows for another layer of caching: 1. if the module itself has changed, we have to re-codegen and re-optimize. 2. if the module itself has not changed, but a module it imported from during ThinLTO has, we don't need to re-codegen and don't need to re-run the first optimization phase. Only the second (i.e. ThinLTO-) optimization phase is re-run. 3. if neither the module itself nor any of its imports have changed then we can re-use the final, post-ThinLTO version of the module. (We might have to load its pre-ThinLTO version though so it's available for other modules to import from)

bors · 2018-09-03T16:31:28Z

☀️ Test successful - status-appveyor, status-travis
Approved by: alexcrichton
Pushing ee73f80 to master...

alexcrichton · 2018-09-03T17:34:18Z

😲

ljedrz · 2018-09-03T21:00:41Z

What happened to the perf? Plenty of stuff turned very red, is this expected?

nnethercote · 2018-09-04T03:38:45Z

Indeed, this had a calamitous effect on compile times for incremental opt builds, and I don't understand how this was deemed acceptable prior to landing. I think it should be backed out ASAP.

sentry-cli-opt
        avg: 254.9%     min: -0.0%      max: 972.3%
cargo-opt
        avg: 180.3%     min: 0.1%       max: 662.0%
syn-opt
        avg: 174.1%?    min: -2.4%?     max: 646.3%?
regex-opt
        avg: 108.3%     min: 0.1%       max: 326.0%
clap-rs-opt
        avg: 87.4%      min: 0.1%       max: 313.8%
regression-31157-opt
        avg: 78.0%      min: -1.2%      max: 262.6%
crates.io-opt
        avg: 71.9%      min: -0.2%      max: 243.2%
tokio-webpush-simple-opt
        avg: 65.9%      min: -0.1%      max: 156.1%
webrender-opt
        avg: 53.7%      min: 0.0%       max: 152.2%
hyper-opt
        avg: 40.4%      min: -0.0%      max: 119.7%
piston-image-opt
        avg: 19.5%      min: 0.0%       max: 50.0%
issue-46449-opt
        avg: 21.5%      min: -0.1%      max: 42.0%
ripgrep-opt
        avg: 14.6%      min: 0.1%       max: 39.5%
encoding-opt
        avg: 8.3%       min: 0.0%       max: 23.6%
inflate-opt
        avg: 9.1%?      min: 0.4%?      max: 18.2%?
html5ever-opt
        avg: 5.6%       min: -0.0%      max: 16.9% 
deeply-nested-opt
        avg: 5.7%       min: -0.1%      max: 16.6%
futures-opt
        avg: 2.4%       min: -0.1%      max: 8.9%
ucd-opt
        avg: 1.4%       min: -0.1%      max: 3.7%
keccak-opt
        avg: 1.1%       min: -0.1%      max: 3.2%
helloworld-opt
        avg: 1.2%       min: 0.2%       max: 2.6%

michaelwoerister · 2018-09-04T07:21:04Z

Yes, the effects on compile times were expected. Let me explain what's going on here: This PR enables a new combination of compiler settings (ThinLTO + incremental compilation) that we've wanted to have for years and that, as per the existing rules, is now selected as the default when doing optimized, incremental builds. The old behavior (optimized, incremental builds without the additional ThinLTO pass) is still available when compiling with -Clto=no and it's performance should not be affected by the changes in here. In fact, if you look at the non-incremental opt benchmarks, performance has gone up quite a bit in some cases (regex -1.5%, cargo -3.2%, crates.io -4.1%).

Without ThinLTO, incremental opt builds produce much slower code. In many cases benchmarks performed 2-3 times worse because of reduced IPO opportunities. If that code is fast enough for your needs, great, but there was no way we could make incremental compilation the default for optimized builds in Cargo. With ThinLTO enabled this might change. Once this is part of a nightly compiler, we'll test what runtime performance of code produced this way looks like; if it's close enough to non-incremental builds, we can make incr. comp. the default for opt builds in Cargo, giving compile time reductions of 50-85% for small changes!

Note that Cargo still defaults to non-incremental compilation for opt builds, so none of this will be visible to end users yet.

nnethercote · 2018-09-04T08:37:09Z

Huh, ok.

In fact, if you look at the non-incremental opt benchmarks, performance has gone up quite a bit in some cases (regex -1.5%, cargo -3.2%, crates.io -4.1%).

If you look at the perf results for just this PR, there are no improvements. (The few green entries are almost certainly noise, belonging to benchmarks that have high variance.)

michaelwoerister · 2018-09-04T09:21:07Z

If you look at the perf results for just this PR, there are no improvements.

Yeah, I was wondering why I hadn't seen those improvements in the try builds before :) That makes more sense anyway.

But it looks like script-servo-opt and style-servo-opt failed...

alexcrichton · 2018-09-05T16:09:52Z

@michaelwoerister hm oh I also just realized, this didn't actually add any tests? Would it be possible to add a few incremental + optimized tests to exercise these code paths? (I don't think we can really test it works without a disassembly and brittle tests), but we can at least try to run it through the ringer!

michaelwoerister · 2018-09-06T09:41:51Z

The existing incremental tests will actually test some of this when optimize-tests = true is set in config.toml. All tests are then compiled with ThinLTO, so we at least see if we get strange linker errors. I actually had to track down a few of them.

I'll think about how to test this some more. Maybe expand codegen tests a little so they properly deal with multiple CGUs? Or a run-make test if everything else fails...

alexcrichton · 2018-09-06T16:30:51Z

Oh nevermind then, carry on! So long as something broke when implementing this sounds like it's being exercised which is all I would look for :)

eddyb · 2018-09-08T11:32:03Z

src/librustc_codegen_llvm/llvm/ffi.rs

@@ -1622,6 +1626,11 @@ extern "C" {
        Data: &ThinLTOData,
        Module: &Module,
    ) -> bool;
+    pub fn LLVMRustGetThinLTOModuleImports(
+        Data: *const ThinLTOData,


This should be &ThinLTOData.

michaelwoerister · 2018-09-10T18:05:24Z

I do have a few ideas for 2 or 3 tests. I'll make a PR this week, if I get to it. It requires exposing the different caching levels to the test framework. That's a good idea anyway but it's not totally trivial because of that.

@alexcrichton

…nerics-for-incr-comp, r=alexcrichton incr.comp.: Don't automatically enable -Zshare-generics for incr. comp. builds. So far the compiler would automatically enable sharing of monomorphizations for incremental builds. That was OK because without (Thin)LTO this could have very little impact on the runtime performance of the generated code. However, since rust-lang#53673, ThinLTO and incr. comp. can be combined, so the trade-off is not as clear anymore. This PR removes the automatic tie between the two options. Whether monomorphizations are shared between crates or not now _only_ depends on the optimization level. r? @alexcrichton

…t-lto-imports, r=michaelwoerister save LTO import info and check it when trying to reuse build products Fix rust-lang#59535 Previous runs of LTO optimization on the previous incremental build can import larger portions of the dependence graph into a codegen unit than the current compilation run is choosing to import. We need to take that into account when we choose to reuse PostLTO-optimization object files from previous compiler invocations. This PR accomplishes that by serializing the LTO import information on each incremental build. We load up the previous LTO import data as well as the current LTO import data. Then as we decide whether to reuse previous PostLTO objects or redo LTO optimization, we check whether the LTO import data matches. After we finish with this decision process for every object, we write the LTO import data back to disk. ---- What is the scenario where comparing against past LTO import information is necessary? I've tried to capture it in the comments in the regression test, but here's yet another attempt from me to summarize the situation: 1. Consider a call-graph like `[A] -> [B -> D] <- [C]` (where the letters are functions and the modules are enclosed in `[]`) 2. In our specific instance, the earlier compilations were inlining the call to`B` into `A`; thus `A` ended up with a external reference to the symbol `D` in its object code, to be resolved at subsequent link time. The LTO import information provided by LLVM for those runs reflected that information: it explicitly says during those runs, `B` definition and `D` declaration were imported into `[A]`. 3. The change between incremental builds was that the call `D <- C` was removed. 4. That change, coupled with other decisions within `rustc`, made the compiler decide to make `D` an internal symbol (since it was no longer accessed from other codegen units, this makes sense locally). And then the definition of `D` was inlined into `B` and `D` itself was eliminated entirely. 5. The current LTO import information reported that `B` alone is imported into `[A]` for the *current compilation*. So when the Rust compiler surveyed the dependence graph, it determined that nothing `[A]` imports changed since the last build (and `[A]` itself has not changed either), so it chooses to reuse the object code generated during the previous compilation. 6. But that previous object code has an unresolved reference to `D`, and that causes a link time failure! ---- The interesting thing is that its quite hard to actually observe the above scenario arising, which is probably why no one has noticed this bug in the year or so since incremental LTO support landed (PR rust-lang#53673). I've literally spent days trying to observe the bug on my local machine, but haven't managed to find the magic combination of factors to get LLVM and `rustc` to do just the right set of the inlining and `internal`-reclassification choices that cause this particular problem to arise. ---- Also, I have tried to be careful about injecting new bugs with this PR. Specifically, I was/am worried that we could get into a scenario where overwriting the current LTO import data with past LTO import data would cause us to "forget" a current import. ~~To guard against this, the PR as currently written always asserts, at overwrite time, that the past LTO import-set is a *superset* of the current LTO import-set. This way, the overwriting process should always be safe to run.~~ * The previous note was written based on the first version of this PR. It has since been revised to use a simpler strategy, where we never attempt to merge the past LTO import information into the current one. We just *compare* them, and act accordingly. * Also, as you can see from the comments on the PR itself, I was quite right to be worried about forgetting past imports; that scenario was observable via a trivial transformation of the regression test I had devised.

…ts, r=mw save LTO import info and check it when trying to reuse build products Fix #59535 Previous runs of LTO optimization on the previous incremental build can import larger portions of the dependence graph into a codegen unit than the current compilation run is choosing to import. We need to take that into account when we choose to reuse PostLTO-optimization object files from previous compiler invocations. This PR accomplishes that by serializing the LTO import information on each incremental build. We load up the previous LTO import data as well as the current LTO import data. Then as we decide whether to reuse previous PostLTO objects or redo LTO optimization, we check whether the LTO import data matches. After we finish with this decision process for every object, we write the LTO import data back to disk. ---- What is the scenario where comparing against past LTO import information is necessary? I've tried to capture it in the comments in the regression test, but here's yet another attempt from me to summarize the situation: 1. Consider a call-graph like `[A] -> [B -> D] <- [C]` (where the letters are functions and the modules are enclosed in `[]`) 2. In our specific instance, the earlier compilations were inlining the call to`B` into `A`; thus `A` ended up with a external reference to the symbol `D` in its object code, to be resolved at subsequent link time. The LTO import information provided by LLVM for those runs reflected that information: it explicitly says during those runs, `B` definition and `D` declaration were imported into `[A]`. 3. The change between incremental builds was that the call `D <- C` was removed. 4. That change, coupled with other decisions within `rustc`, made the compiler decide to make `D` an internal symbol (since it was no longer accessed from other codegen units, this makes sense locally). And then the definition of `D` was inlined into `B` and `D` itself was eliminated entirely. 5. The current LTO import information reported that `B` alone is imported into `[A]` for the *current compilation*. So when the Rust compiler surveyed the dependence graph, it determined that nothing `[A]` imports changed since the last build (and `[A]` itself has not changed either), so it chooses to reuse the object code generated during the previous compilation. 6. But that previous object code has an unresolved reference to `D`, and that causes a link time failure! ---- The interesting thing is that its quite hard to actually observe the above scenario arising, which is probably why no one has noticed this bug in the year or so since incremental LTO support landed (PR #53673). I've literally spent days trying to observe the bug on my local machine, but haven't managed to find the magic combination of factors to get LLVM and `rustc` to do just the right set of the inlining and `internal`-reclassification choices that cause this particular problem to arise. ---- Also, I have tried to be careful about injecting new bugs with this PR. Specifically, I was/am worried that we could get into a scenario where overwriting the current LTO import data with past LTO import data would cause us to "forget" a current import. ~~To guard against this, the PR as currently written always asserts, at overwrite time, that the past LTO import-set is a *superset* of the current LTO import-set. This way, the overwriting process should always be safe to run.~~ * The previous note was written based on the first version of this PR. It has since been revised to use a simpler strategy, where we never attempt to merge the past LTO import information into the current one. We just *compare* them, and act accordingly. * Also, as you can see from the comments on the PR itself, I was quite right to be worried about forgetting past imports; that scenario was observable via a trivial transformation of the regression test I had devised.

michaelwoerister changed the title ~~Incr thinlto 2000~~ Enable ThinLTO with incremental compilation. Aug 24, 2018

This comment has been minimized.

Sign in to view

michaelwoerister force-pushed the incr-thinlto-2000 branch from ee14d4a to 1d0f3fd Compare August 24, 2018 15:35

kennytm added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Aug 24, 2018

alexcrichton reviewed Aug 24, 2018

View reviewed changes

TimNN added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 28, 2018

michaelwoerister force-pushed the incr-thinlto-2000 branch from 1d0f3fd to 8fdf3e6 Compare August 31, 2018 13:19

michaelwoerister added 4 commits August 31, 2018 15:22

Provide a way of accessing the ThinLTO module import map in rustc.

2e587df

Persist ThinLTO import data in incr. comp. session directory.

d97d1e1

Make codegen not be a query (since it's not a real query anyway).

72c1993

Support local ThinLTO with incremental compilation.

64a738d

incr.ThinLTO: Do some cleanup and add some logging.

21d05f6

michaelwoerister force-pushed the incr-thinlto-2000 branch from 737f1ef to 21d05f6 Compare September 3, 2018 11:27

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Sep 3, 2018

bors merged commit 21d05f6 into rust-lang:master Sep 3, 2018

alexcrichton mentioned this pull request Sep 3, 2018

Make Vec derefing inlinable #52704

Closed

eddyb mentioned this pull request Sep 8, 2018

Tracking issue for MIR-only RLIBs #38913

Closed

eddyb reviewed Sep 8, 2018

View reviewed changes

michaelwoerister mentioned this pull request Sep 25, 2018

incr.comp.: Don't automatically enable -Zshare-generics for incr. comp. builds. #54557

Merged

michaelwoerister mentioned this pull request Oct 22, 2018

Tracking Issue for Incremental Compilation #47660

Open

32 tasks

alexcrichton mentioned this pull request Dec 18, 2018

Enable ThinLTO and incremental compilation #48996

Closed

pnkfelix mentioned this pull request Dec 4, 2019

save LTO import info and check it when trying to reuse build products #67020

Merged

Enable ThinLTO with incremental compilation. #53673

Enable ThinLTO with incremental compilation. #53673

Conversation

michaelwoerister commented Aug 24, 2018 • edited Loading

michaelwoerister commented Aug 24, 2018

bors commented Aug 24, 2018

This comment has been minimized.

kennytm commented Aug 24, 2018 • edited Loading

bors commented Aug 24, 2018

rust-highfive commented Aug 24, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bors commented Aug 24, 2018

alexcrichton commented Aug 24, 2018

rust-timer commented Aug 24, 2018

michaelwoerister commented Aug 24, 2018

michaelwoerister commented Aug 29, 2018

michaelwoerister commented Aug 31, 2018

bors commented Aug 31, 2018

michaelwoerister commented Sep 3, 2018

bors commented Sep 3, 2018

bors commented Sep 3, 2018

bors commented Sep 3, 2018

alexcrichton commented Sep 3, 2018

ljedrz commented Sep 3, 2018 • edited Loading

nnethercote commented Sep 4, 2018

michaelwoerister commented Sep 4, 2018

nnethercote commented Sep 4, 2018

michaelwoerister commented Sep 4, 2018

alexcrichton commented Sep 5, 2018

michaelwoerister commented Sep 6, 2018

alexcrichton commented Sep 6, 2018

Choose a reason for hiding this comment

michaelwoerister commented Sep 10, 2018

michaelwoerister commented Aug 24, 2018 •

edited

Loading

kennytm commented Aug 24, 2018 •

edited

Loading

ljedrz commented Sep 3, 2018 •

edited

Loading