swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Nate Chandler	af5cd31f3c	[Test] Added test for SILCombiner::processInst.	2025-01-13 17:00:23 -08:00
Nate Chandler	5ce6c6cf14	[NFC] SILCombine: Extract processing function. The pass is structured to drain an instruction worklist and perform a sequence of operations on each popped instruction. Extract that sequence of operations into a new processInstruction function. Enables testing the sequence on a single instruction.	2025-01-13 16:57:23 -08:00
Erik Eckstein	9aff288be4	Optimizer: re-implement the `pointer_to_address` SILCombine peephole optimizations in swift Which consists of * removing redundant `address_to_pointer`-`pointer_to_address` pairs * optimize `index_raw_pointer` of a manually computed stride to `index_addr` * remove or increase the alignment based on a "assumeAlignment" builtin This is a big code cleanup but also has some functional differences for the `address_to_pointer`-`pointer_to_address` pair removal: * It's not done if the resulting SIL would result in a (detectable) use-after-dealloc_stack memory lifetime failure. * It's not done if `copy_value`s must be inserted or borrow-scopes must be extended to comply with ownership rules (this was the task of the OwnershipRAUWHelper). Inserting copies is bad anyway. Extending borrow-scopes would only be required if the original lifetime of the pointer extends a borrow scope - which shouldn't happen in save code. Therefore this is a very rare case which is not worth handling.	2024-12-21 08:28:22 +01:00
Erik Eckstein	bf496aa4f6	Optimizer: add simplification for fix_lifetime Canonicalize a `fix_lifetime` from an address to a `load` + `fix_lifetime`: ``` %1 = alloc_stack $T ... fix_lifetime %1 ``` -> ``` %1 = alloc_stack $T ... %2 = load %1 fix_lifetime %2 ``` This transformation is done for `alloc_stack` and `store_borrow` (which always has an `alloc_stack` operand). The benefit of this transformation is that it enables other optimizations, like mem2reg. This peephole optimization was already done in SILCombine, but it didn't handle store_borrow. A good opportunity to make an instruction simplification out of it. This is part of fixing regressions when enabling OSSA modules: rdar://140229560	2024-12-13 12:06:20 +01:00
Erik Eckstein	6b38f2aab4	Optimizer: simplify load_borrow * Remove dead `load_borrow` instructions (replaces the old peephole optimization in SILCombine) * If the `load_borrow` is followed by a `copy_value`, combine both into a `load [copy]`	2024-12-11 12:32:33 +01:00
Erik Eckstein	c97502374b	Optimizer: add constant folding of `classify_bridge_object` Constant fold `classify_bridge_object` to `(false, false)` if the operand is known to be a swift class.	2024-10-08 16:24:46 +02:00
Nate Chandler	11abefee60	[NFC] SILCombine: Use DeadEndBlocksAnalysis. Instead of having a separately calculated version.	2024-07-22 20:35:30 -07:00
Nate Chandler	30d779e946	[Test] Added sil_combine_instruction.	2024-05-03 08:06:21 -07:00
Kuba Mracek	b642d771be	[embedded] Compile-time (literal) KeyPaths for Embedded Swift Enable KeyPath/AnyKeyPath/PartialKeyPath/WritableKeyPath in Embedded Swift, but for compile-time use only: - Add keypath optimizations into the mandatory optimizations pipeline - Allow keypath optimizations to look through begin_borrow, to make them work even in OSSA. - If a use of a KeyPath doesn't optimize away, diagnose in PerformanceDiagnostics - Make UnsafePointer.pointer(to:) transparent to allow the keypath optimization to happen in the callers of UnsafePointer.pointer(to:).	2024-03-20 15:35:46 -07:00
Ben Barham	ef8825bfe6	Migrate llvm::Optional to std::optional LLVM has removed llvm::Optional, move over to std::optional. Also clang-format to fix up all the renamed #includes.	2024-02-21 11:20:06 -08:00
Erik Eckstein	e718bfe8ed	Optimizer: reimplement simplifications for copy_value and destroy_value in swift So that they can run in the OnoneSimplification pass	2023-10-27 10:47:07 +02:00
Erik Eckstein	d91759daf8	SILCombine: remove the dead alloc_ref peephole optimization It's not needed because dead alloc_refs are removed by the DeadObjectElimination pass	2023-09-19 15:10:30 +02:00
Evan Wilde	f3ff561c6f	[NFC] add `llvm` namespace to `Optional` and `None` This is phase-1 of switching from llvm::Optional to std::optional in the next rebranch. llvm::Optional was removed from upstream LLVM, so we need to migrate off rather soon. On Darwin, std::optional, and llvm::Optional have the same layout, so we don't need to be as concerned about ABI beyond the name mangling. `llvm::Optional` is only returned from one function in ``` getStandardTypeSubst(StringRef TypeName, bool allowConcurrencyManglings); ``` It's the return value, so it should not impact the mangling of the function, and the layout is the same as `std::optional`, so it should be mostly okay. This function doesn't appear to have users, and the ABI was already broken 2 years ago for concurrency and no one seemed to notice so this should be "okay". I'm doing the migration incrementally so that folks working on main can cherry-pick back to the release/5.9 branch. Once 5.9 is done and locked away, then we can go through and finish the replacement. Since `None` and `Optional` show up in contexts where they are not `llvm::None` and `llvm::Optional`, I'm preparing the work now by going through and removing the namespace unwrapping and making the `llvm` namespace explicit. This should make it fairly mechanical to go through and replace llvm::Optional with std::optional, and llvm::None with std::nullopt. It's also a change that can be brought onto the release/5.9 with minimal impact. This should be an NFC change.	2023-06-27 09:03:52 -07:00
Erik Eckstein	50c23a1640	Optimizer: implement the SILCombine peephole optimizations for retain_value and release_value in Swift	2023-06-07 14:18:38 +02:00
Erik Eckstein	f3851f7503	Optimizer: implement load simplification in Swift	2023-05-22 15:34:26 +02:00
Slava Pestov	08965406cb	SILOptimizer: Add various pack peepholes to SILCombiner	2023-04-12 18:29:06 -04:00
Nate Chandler	ec3f005f31	[CanonOSSALifetime] Run on lexical lifetimes. Previously, the utility bailed out on lexical lifetimes because it didn't respect deinit barriers. Here, deinit barriers are found and added to liveness if the value is lexical. This enables copies to be propagated without hoisting destroys over deinit barriers. rdar://104630103	2023-03-25 21:17:26 -07:00
Erik Eckstein	cc68bd98c9	Swift Optimizer: rework pass context types and instruction passes * split the `PassContext` into multiple protocols and structs: `Context`, `MutatingContext`, `FunctionPassContext` and `SimplifyContext` * change how instruction passes work: implement the `simplify` function in conformance to `SILCombineSimplifyable` * add a mechanism to add a callback for inserted instructions	2023-01-16 15:11:34 +01:00
Anton Korobeynikov	a5e83817b2	Reapply "Implement several peephole optimizations to unblock further optimizations of autodiff code" with correctness fix (#62012 ) * Implement several peephole optimizations to unblock further optimizations of autodiff code 1. Simplify differentiable_function_extract of differentiable_function. Before: %x = differentiable_function(%orig, %jvp, %vjp) %y = differentiable_function_extract [original] %x After: %y = %orig 2. Push conversion instructions inside of differentiable_function. This unblocks inlining and specialization. Before: %x = differentiable_function(%orig, %jvp, %vjp) %y = convert_escape_to_noescape %x After: %orig' = convert_escape_to_noescape %orig %jvp' = convert_escape_to_noescape %jvp %vjp' = convert_escape_to_noescape %vjp %y = differentiable_function(%orig', %jvp', %vjp') 3. Another peephole is needed for reordering function conversion instructions to enable full inlining: (convert_escape_to_noescape (convert_function (thin_to_thick_function x))) => (convert_escape_to_noescape (thin_to_thick_function (convert_function x))) Co-authored-by: Dan Zheng <danielzheng@google.com>	2022-11-16 23:21:27 -08:00
Meghana Gupta	5e20112e0c	Revert "Implement several peephole optimizations to unblock further optimizations of autodiff code (#60520 )" This reverts commit `2f5492f572`.	2022-10-25 13:35:14 -07:00
Erik Eckstein	ed54253d29	SIL Optimizer: remove legacy C++ passes They were used as a backup during the transition to Swift passes. Now they are not needed anymore.	2022-10-20 18:31:06 +02:00
Anton Korobeynikov	2f5492f572	Implement several peephole optimizations to unblock further optimizations of autodiff code (#60520 ) * Implement several peephole optimizations to unblock further optimizations of autodiff code 1. Simplify differentiable_function_extract of differentiable_function. Before: %x = differentiable_function(%orig, %jvp, %vjp) %y = differentiable_function_extract [original] %x After: %y = %orig 2. Push conversion instructions inside of differentiable_function. This unblocks inlining and specialization. Before: %x = differentiable_function(%orig, %jvp, %vjp) %y = convert_escape_to_noescape %x After: %orig' = convert_escape_to_noescape %orig %jvp' = convert_escape_to_noescape %jvp %vjp' = convert_escape_to_noescape %vjp %y = differentiable_function(%orig', %jvp', %vjp') 3. Another peephole is needed for reordering function conversion instructions to enable full inlining: (convert_escape_to_noescape (convert_function (thin_to_thick_function x))) => (convert_escape_to_noescape (thin_to_thick_function (convert_function x))) * Remove broken disabled folding transform: - It is disabled - It is broken - It is certainly not something mandatory Co-authored-by: Dan Zheng <danielzheng@google.com>	2022-09-26 09:44:04 -07:00
Erik Eckstein	0083b1b3e3	SILCombine: some refactoring NFC	2022-02-09 13:25:30 +01:00
Erik Eckstein	3522ba1521	SILOptimizer: rename LibswiftPassInvocation -> SwiftPassInvocation And a few other small related changes: * remove libswiftPassInvocation from SILInstructionWorklist (because it's not needed) * replace start/finishPassRun with start/finishFunction/InstructionPassRun NFC	2022-01-05 10:15:56 +01:00
Erik Eckstein	3540c01125	rename initializeLibSwift -> InitializeSwiftModules and some updates in comments.	2021-12-22 11:31:52 +01:00
Andrew Trick	c47fe03241	Optimize Builtin.assumeAlignment -> pointer_to_address Case #1. Literal zero = natural alignment %1 = integer_literal $Builtin.Int64, 0 %2 = builtin "uncheckedAssertAlignment" (%0 : $Builtin.RawPointer, %1 : $Builtin.Int64) : $Builtin.RawPointer %3 = pointer_to_address %2 : $Builtin.RawPointer to [align=1] $Int Erases the `pointer_to_address` `[align=]` attribute: Case #2. Literal nonzero = forced alignment. %1 = integer_literal $Builtin.Int64, 16 %2 = builtin "uncheckedAssertAlignment" (%0 : $Builtin.RawPointer, %1 : $Builtin.Int64) : $Builtin.RawPointer %3 = pointer_to_address %2 : $Builtin.RawPointer to [align=1] $Int Promotes the `pointer_to_address` `[align=]` attribute to a higher value. Case #3. Folded dynamic alignment %1 = builtin "alignof"<T>(%0 : $@thin T.Type) : $Builtin.Word %2 = builtin "uncheckedAssertAlignment" (%0 : $Builtin.RawPointer, %1 : $Builtin.Int64) : $Builtin.RawPointer %3 = pointer_to_address %2 : $Builtin.RawPointer to [align=1] $*T Erases the `pointer_to_address` `[align=]` attribute.	2021-11-30 12:23:46 -08:00
Erik Eckstein	49351c4759	SILOptimizer: extract the peephole optimization for Builtin.canBeClass into a separate utility.	2021-10-28 18:43:14 +02:00
Andrew Trick	0e696e1d67	SILCombine - Fix worklist logic for OSSA canonicalization Required before fixing/re-enabling OSSA RAUW utilities. Make sure the SILCombine worklist canonicalizes all the copies and guarantees termination. Run canonicalization on every existing copy_value once and once for every new copy_value added during SILCombine. Only add copies and their uses back to the worklist if canonicalization deleted an instruction. Add tracing for sinking forwaring instructions.	2021-10-15 14:48:41 -07:00
Erik Eckstein	8be0ca07c2	SIL optimizer: remove unbalanced retains/releases from immortal objects ARC operations don't have an effect on immortal objects, like the empty array singleton or statically allocated arrays. Therefore we can freely remove and retain/release instructions on such objects, even if there is no paired balanced ARC operation. This optimization can only be done with a minimum deployment target of Swift 5.1, because in that version we added immortal ref count bits. The optimization is implemented in libswift. Additionally, the remaining logic of simplifying strong_retain and strong_release is also ported to libswift. rdar://81482156	2021-08-23 10:23:59 +02:00
Erik Eckstein	90c71ad002	libswift: improve and simplify pass invocation * unify FunctionPassContext and InstructionPassContext * add a modification API: PassContext.setOperand * automatic invalidation notifications when the SIL is modified	2021-08-23 10:21:12 +02:00
Andrew Trick	290093ea90	Simplify lifetime canonicalization in SILCombine And fix the way it handles of borrow scopes so we can enable borrow scope rewiting. Make sure SILCombine only does canonicalization that operates on a self-contained single-value-lifetime. It's important to limit SILCombine to transformations where each individual step converges quickly to a more canonical form. Rewriting borrow scopes requires the copy propagation pass to coordinate all the individual transformations. Make canonicalizeLifetimes a SILCombine utility. This moves complexity out of the main loop. SILCombine knows which values it wants to canonicalize and can directly call either canonicalizeValueLifetime or canonicalizeFunctionArgument for each one. Respect the -enable/disable-copy-propagation options.	2021-07-01 21:46:29 -07:00
Erik Eckstein	e8d408d036	libswift: implement SILCombine's global_value optimization as instruction pass in libswift. But keeping the old visit function as legacy	2021-06-09 11:33:41 +02:00
Erik Eckstein	1010f1e2ae	libswift: infrastructure to define "instruction" passes. Instruction passes are basically visit functions in SILCombine for a specific instruction type. With the macro SWIFT_INSTRUCTION_PASS such a pass can be declared in Passes.def. SILCombine then calls the run function of the pass in libswift.	2021-06-09 11:33:41 +02:00
Andrew Trick	0407a4e34a	Add UpdatingInstructionIterator. Track in-use iterators and update them both when instructions are deleted and when they are added. Safe iteration in the presence of arbitrary changes now looks like this: for (SILInstruction *inst : deleter.updatingRange(&bb)) { modify(inst); }	2021-06-02 07:38:27 -07:00
Andrew Trick	0f88e0f3cc	Rewrite instruction deletion logic in many passes Fix innumerable latent bugs with iterator invalidation and callback invocation. Removes dead code earlier and chips away at all the redundant copies the compiler generates.	2021-06-02 07:38:27 -07:00
Michael Gottesman	e670d2b4de	[sil-combine] Canonicalize copies using canonicalizeOSSALifetimes when moving newly created instructions into the worklist. To be more explicit, canonicalizeOSSALifetimes is a utility that re-canonicalizes all at once a set of defs that the caller found by applying CanonicalizeOSSALifetime::getCanonicalCopiedDef(copy)). The reason why I am doing this is that when we RAUW in OSSA, we sometimes insert additional copies to make the problem easier for a utility to handle. This lets us canonicalize away any copies before we even leave the pass.	2021-05-01 16:02:21 -07:00
Michael Gottesman	b6bfea1f39	[sil-optimizer] Get rid of the InstModCallback constructors in favor of onDelete/onCreatedNewInst/etc. Without this when constructing an InstModCallback it is hard to distinguish which closure is meant for which operation when passed to the constructor of InstModCallback (if this was in Swift, we could use argument labels, but we do not have such things in c++). This new value type sort of formulation makes it unambiguous which callback is used for what when constructing one of these.	2021-04-26 23:33:33 -07:00
Erik Eckstein	6ec788ff09	SIL: remove the SILOpenedArchetypesTracker Instead, put the archetype->instrution map into SIlModule. SILOpenedArchetypesTracker tried to maintain and reconstruct the mapping locally, e.g. during a use of SILBuilder. Having a "global" map in SILModule makes the whole logic _much_ simpler. I'm wondering why we didn't do this in the first place. This requires that opened archetypes must be unique in a module - which makes sense. This was the case anyway, except for keypath accessors (which I fixed in the previous commit) and in some sil test files.	2021-04-14 08:36:10 +02:00
Erik Eckstein	ea8f43a8ba	SILCombine: remove unneeded reference counting instructions from global_value Delete all reference count instructions on a global_value if the only other uses are projections (ref_element_addr and ref_tail_addr).	2021-03-15 11:54:23 +01:00
Michael Gottesman	ed368ab7d9	[sil-combine] Update COWBufferForReading for OSSA. I was able to use the interior pointer operand API to great effect here.	2021-02-02 15:54:44 -08:00
Erik Eckstein	d33ea9f350	SIL: remove the JointPostDominanceSetComputer helper struct. Instead make `findJointPostDominatingSet` a stand-alone function. There is no need to keep the temporary SmallVector alive across multiple calls of findJointPostDominatingSet for the purpose of re-using malloc'ed memory. The worklist usually contains way less elements than its small size.	2021-02-02 10:20:35 +01:00
Michael Gottesman	ac2ef83951	[sil-combine] Enable cond_br canonicalizations in ossa. The key thing is that all of these while they do modify the branches of the CFG do not invalidate block level CFG analyses like dominance and dead end blocks.	2021-01-28 12:10:15 -08:00
Michael Gottesman	29d2458876	[sil-combine] Enable unchecked_addr_cast formation in ownership mode.	2021-01-28 12:10:15 -08:00
Michael Gottesman	2243ffefa4	Merge pull request #35461 from gottesmm/ossa-interior-ptr-fixup [ownership] Implement Interior Pointer handling API for RAUWing addresses	2021-01-20 09:53:48 -08:00
Michael Gottesman	43d254cf6a	Merge pull request #35501 from gottesmm/sil-combine-apply-visitors [sil-combine] Enable ownership on all of the apply visitors.	2021-01-20 09:52:36 -08:00
Michael Gottesman	94c643cb27	[sil-combine] Enable ownership on all of the apply visitors. All of the non-SILCombiner specific helpers have already been updated for OSSA, so this was not too bad. NOTE: I also added two small combines that delete copy_value, destroy_value with .none arguments. The reason why I added this is that this is a pretty small addition and many of the tests of this code rely on SILCombine being able to eliminate such operations on thin_to_thick_function. NOTE: I also disabled TypePropagation in OSSA, we are going to redo that code when we bring up opaque values.	2021-01-19 22:47:18 -08:00
Michael Gottesman	73ba521e56	[ownership] Add a new API to OwnershipFixupContext::replaceAllAddressUsesFixingInteriorPointerOwnership. In OSSA, we enforce that addresses from interior pointer instructions are scoped within a borrow scope. This means that it is invalid to use such an address outside of its parent borrow scope and as a result one can not just RAUW an address value by a dominating address value since the latter may be invalid at the former. I foresee that I am going to have to solve this problem and so I decided to write this API to handle the vast majority of cases. The way this API works is that it: 1. Computes an access path with base for the new value. If we do not have a base value and a valid access path with root, we bail. 2. Then we check if our base value is the result of an interior pointer instruction. If it isn't, we are immediately done and can RAUW without further delay. 3. If we do have an interior pointer instruction, we see if the immediate guaranteed value we projected from has a single borrow introducer value. If not, we bail. I think this is reasonable since with time, all guaranteed values will always only have a single borrow introducing value (once struct, tuple, destructure_struct, destructure_tuple become reborrows). 4. Then we gather up all inner uses of our access path. If for some reason that fails, we bail. 5. Then we see if all of those uses are within our borrow scope. If so, we can RAUW without any further worry. 6. Otherwise, we perform a copy+borrow of our interior pointer's operand value at the interior pointer, create a copy of the interior pointer instruction upon this new borrow and then RAUW oldValue with that instead. By construction all uses of oldValue will be within this new interior pointer scope.	2021-01-17 20:08:24 -08:00
Michael Gottesman	3acb7df298	[ownership] Some more comment post-commit fixes for `ffa55937c4`.	2021-01-14 13:30:47 -08:00
Michael Gottesman	7ecee99097	[sil-combine] Implement a new class SingleBlockOwnedForwardingInstFolder to help eliminate chains of owned forwarding insts and use it to update upcast and unchecked_ref_cast for ownership. b644c80f90fb7099ec956bb44065b50e432c5146 caused all owned forwarding instructions to be sunk to their uses in the exact cases where we could eliminate the parent forwarding inst (namely that the value we want to fold has no non-debug, non-consuming users). So I was able to implement this just by implementing a single basic block algorithm that works via a planner struct using said canonicalization. One initializes the planner struct with the instruction that is going to either be eliminated or have its forwarding operand set. Then one adds each of the individual chains that lead to the use that we wish to fold, each time checking that we can eliminate the instruction. Once the user has added all of the intermediate forwarding instructions, by construction (see paragraph above), we know we can optimize. So we eliminate all intermediate values and then depending on whether the user called the set value or replace value method we either set front's operand to be the passed in value or we RAUW/erase front with that value. It is important to note that before we do one of those two operations, front's operand is undef, so we need to perform one of these two operations.	2021-01-13 10:43:41 -08:00
Michael Gottesman	279d058bfe	[sil-combine] Canonicalize owned forwarding insts without non-debug non-consuming uses by sinking to their uses. There are a bunch of optimizations in SILCombine where we try to fold an ownership forwarding instruction A into another ownership forwarding instruction B without deleting A. Consider the upcasts in the example below: ``` %0 = upcast %x : $X->Y %1 = upcast %0 : $Y->Z ``` These sorts of optimizations fold the first instruction into the second like so: ``` %0 = upcast %x : $X->Y %1 = upcast %x : $X->Z ``` This creates a problem when we are dealing with owned values since we have just introduced two consumes for %x. To work around this, we have two options: 1. Introduce extra copies. 2. We recognize the situations where we can guarantee that we can delete the first upcast. The first choice I believe is not a choice since breaking a forwarding chain of ownership in favor of extra copies is a less canonical form. That leaves us with the second form. What are the necessary/sufficient conditions for deleting the first upcast. Simply it is that the upcast cannot have any non-debug, non-consuming uses! In such a case, we know that along all paths through the program the value has exactly one non-debug use, one of its consuming uses. If when optimizing upcasts we could recognize that pattern, duplicate the inst along paths not through our 2nd upcast and thus delete the original upcast fixing the ownership error! While this is all nice and good there is a problem with this: it doesn't scale. As I was writing a few optimizations like this I began to note that I had to write different versions of this same helper for many of the visitors (they generally varied by how many forwarding instructions they looked through). As I pondered the above, I chatted a bit with @atrick and during our conversation, we both realized that it is much easier to solve this problem in one block and that the condition above would allow us to sink these instructiosn into the same block and thus if we could check for this condition and canonicialize the IR to sink these instructions before we visiting, we could use a single helper to handle all of these cases.	2021-01-13 10:43:41 -08:00

1 2 3

128 Commits