swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
eeckstein	9304ce951c	Merge pull request #85707 from eeckstein/embedded-witness-method-specialization embedded: change the function representation of directly called witness methods	2025-12-01 09:36:45 +01:00
eeckstein	46c69e40c1	Merge pull request #85533 from eeckstein/fix-access-simplification SILOptimizer: don't remove empty conflicting access scopes	2025-12-01 09:35:48 +01:00
Erik Eckstein	d7b75b6819	Optimizer: register explicit-copy instruction simplifications for SILCombine Usually `explicit_copy_addr` and `explicit_copy_value` don't survive until the first SILCombine pass run anyway. But if they do, the simplifications need to be registered, otherwise SILCombine will complain.	2025-11-27 19:40:31 +01:00
Erik Eckstein	64dd574bea	embedded: change the function representation of directly called witness methods This is needed in Embedded Swift because the `witness_method` convention requires passing the witness table to the callee. However, the witness table is not necessarily available. A witness table is only generated if an existential value of a protocol is created. This is a rare situation because only witness thunks have `witness_method` convention and those thunks are created as "transparent" functions, which means they are always inlined (after de-virtualization of a witness method call). However, inlining - even of transparent functions - can fail for some reasons. This change adds a new EmbeddedWitnessCallSpecialization pass: If a function with `witness_method` convention is directly called, the function is specialized by changing the convention to `method` and the call is replaced by a call to the specialized function: ``` %1 = function_ref @callee : $@convention(witness_method: P) (@guaranteed C) -> () %2 = apply %1(%0) : $@convention(witness_method: P) (@guaranteed C) -> () ... sil [ossa] @callee : $@convention(witness_method: P) (@guaranteed C) -> () { ... } ``` -> ``` %1 = function_ref @$e6calleeTfr9 : $@convention(method) (@guaranteed C) -> () %2 = apply %1(%0) : $@convention(method) (@guaranteed C) -> () ... // specialized callee sil shared [ossa] @$e6calleeTfr9 : $@convention(method) (@guaranteed C) -> () { ... } ``` Fixes a compiler crash rdar://165184147	2025-11-26 16:23:47 +01:00
Erik Eckstein	9a124742b0	Optimizer: add the DeadAccessScopeElimination optimization pass It eliminates dead access scopes if they are not conflicting with other scopes. Removes: ``` %2 = begin_access [modify] [dynamic] %1 ... // no uses of %2 end_access %2 ``` However, dead _conflicting_ access scopes are not removed. If a conflicting scope becomes dead because an optimization e.g. removed a load, it is still important to get an access violation at runtime. Even a propagated value of a redundant load from a conflicting scope is undefined. ``` %2 = begin_access [modify] [dynamic] %1 store %x to %2 %3 = begin_access [read] [dynamic] %1 // conflicting with %2! %y = load %3 end_access %3 end_access %2 use(%y) ``` After redundant-load-elimination: ``` %2 = begin_access [modify] [dynamic] %1 store %x to %2 %3 = begin_access [read] [dynamic] %1 // now dead, but still conflicting with %2 end_access %3 end_access %2 use(%x) // propagated from the store, but undefined here! ``` In this case the scope `%3` is not removed because it's important to get an access violation error at runtime before the undefined value `%x` is used. This pass considers potential conflicting access scopes in called functions. But it does not consider potential conflicting access in callers (because it can't!). However, optimizations, like redundant-load-elimination, can only do such transformations if the outer access scope is within the function, e.g. ``` bb0(%0 : $*T): // an inout from a conflicting scope in the caller store %x to %0 %3 = begin_access [read] [dynamic] %1 %y = load %3 // cannot be propagated because it cannot be proved that %1 is the same address as %0 end_access %3 ``` All those checks are only done for dynamic access scopes, because they matter for runtime exclusivity checking. Dead static scopes are removed unconditionally.	2025-11-24 14:49:45 +01:00
Erik Eckstein	62786b01e2	Optimizer: add the mandatory destroy hoisting pass It hoists `destroy_value` instructions for non-lexical values. ``` %1 = some_ownedValue ... last_use(%1) ... // other instructions destroy_value %1 ``` -> ``` %1 = some_ownedValue ... last_use(%1) destroy_value %1 // <- moved after the last use ... // other instructions ``` In contrast to non-mandatory optimization passes, this is the only pass which hoists destroys over deinit-barriers. This ensures consistent behavior in -Onone and optimized builds.	2025-11-06 21:00:44 +01:00
Aidan Hall	8632e58825	Create Pack Specialisation pass	2025-10-30 14:28:16 +00:00
Erik Eckstein	181b2f1f6d	Optimizer: eliminate `struct_extract`s of an owned `struct` where the `struct_extract`s are inside a borrow scope This is done by splitting the `begin_borrow` of the whole struct into individual borrows of the fields (for trivial fields no borrow is needed). And then sinking the `struct` to it's consuming use(s). ``` %3 = struct $S(%nonTrivialField, %trivialField) // owned ... %4 = begin_borrow %3 %5 = struct_extract %4, #S.nonTrivialField %6 = struct_extract %4, #S.trivialField use %5, %6 end_borrow %4 ... end_of_lifetime %3 ``` -> ``` ... %5 = begin_borrow %nonTrivialField use %5, %trivialField end_borrow %5 ... %3 = struct $S(%nonTrivialField, %trivialField) end_of_lifetime %3 ``` This optimization is important for Array code where the Array buffer is constantly wrapped into structs and then extracted again to access the buffer.	2025-10-08 17:48:37 +02:00
Erik Eckstein	df20d36255	ClosureSpecialization: support for OSSA and a big overhaul Beside supporting OSSA, this change significantly simplifies the pass. The main change is that instead of starting at a closure (e.g. `partial_apply`) and finding all call sites, we now start at a call site and look for closures for all arguments. This makes a lot of things much simpler, e.g. not so many intermediate data structures are required to track all the states. I needed to remove the 3 unit tests because the things those tests were testing are not there anymore. However, the pass is tested with a lot of sil tests (and I added quite a few), which should give good test coverage. The old ClosureSpecializer pass is still kept in place, because at that point in the pipeline we don't have OSSA, yet. Once we have that, we can replace the old pass withe the new one. However, the autodiff closure specializer already runs in the OSSA pipeline and there the new changes take effect.	2025-10-06 12:02:48 +02:00
Erik Eckstein	39b9969049	SIL: use `Test` for the SmallProjectionPath's unit tests Now that we have `Test` available in the SIL module, we can use it for the SmallProjectionPath's unit tests and get rid of the RunUnitTests pass.	2025-09-24 11:46:34 +02:00
Erik Eckstein	efa9f9ebc0	Optimizer: rewrite and improve the ConstantCapturePropagation pass (old name: CapturePropagation) The pass is now rewritten in swift which makes the code smaller and simpler. Compared to the old pass it has two improvements: * It can constant propagate whole structs (and not only builtin literals). This is important for propagating "real" Swift constants which have a struct type of e.g. `Int`. * It constant propagates keypaths even if there are other non-constant closure captures which are not propagated. This is something the old pass didn't do. rdar://151185177	2025-09-04 08:15:46 +02:00
Jakub Florek	07ac8b3478	Add new loop invariant code motion.	2025-08-28 21:00:33 +01:00
Erik Eckstein	b38490b2e2	SwiftCompilerSources: move PhiUpdater.swift from the Optimizer to the SIL module	2025-07-28 14:19:11 +02:00
Erik Eckstein	319f49ad9f	SwiftCompilerSources: move the Verifier to the SIL module	2025-07-28 14:19:11 +02:00
Erik Eckstein	65c9828cb3	SwiftCompilerSources: move the `Context` protocols from the Optimizer to the SIL module This allows to move many SIL APIs and utilities, which require a context, to the SIL module. The SIL-part of SwiftPassInvocation is extracted into a base class SILContext which now lives in SIL. Also: simplify the begin/end-pass functions of the SwiftPassInvocation.	2025-07-28 14:19:07 +02:00
Erik Eckstein	12d9311b15	SILCombine: run builtin simplification as part of SILCombine That means: make `BuiltinInst` conform to `SILCombineSimplifiable`, but still also run the legacy builtin simplification in SILCombine	2025-07-18 07:43:51 +02:00
Meghana Gupta	e317a603fc	Add simplification for end_cow_mutation_addr We insert end_cow_mutation_addr for lifetime dependent values dependent on mutable addresses. end_cow_mutation_addr can be simplified to end_cow_mutation after other optimizations like inlining, specialization etc This PR adds an instruction simplification to transform end_cow_mutation_addr to end_cow_mutation. This can enable array optimizations which look for end_cow_mutation.	2025-07-14 13:46:13 -07:00
Erik Eckstein	6714a72256	Optimizer: re-implement and improve the AllocBoxToStack pass This pass replaces `alloc_box` with `alloc_stack` if the box is not escaping. The original implementation had some limitations. It could not handle cases of local functions which are called multiple times or even recursively, e.g. ``` public func foo() -> Int { var i = 1 func localFunction() { i += 1 } localFunction() localFunction() return i } ``` The new implementation (done in Swift) fixes this problem with a new algorithm. It's not only more powerful, but also simpler: the new pass has less than half lines of code than the old pass. The pass is invoked in the mandatory pipeline and later in the optimizer pipeline. The new implementation provides a module-pass for the mandatory pipeline (whereas the "regular" pass is a function pass). This is required because the mandatory pass needs to remove originals of specialized closures, which cannot be done from a function-pass. In the old implementation this was done with a hack by adding a semantic attribute and deleting the function later in the pipeline. I still kept the sources of the old pass for being able to bootstrap the compiler without a host compiler. rdar://142756547	2025-06-20 08:15:04 +02:00
Erik Eckstein	2b9b2d243c	Optimizer: improve TempLValueOpt * re-implement the pass in swift * support alloc_stack liveranges which span over multiple basic blocks * support `load`-`store` pairs, copying from the alloc_stack (in addition to `copy_addr`) Those improvements help to reduce temporary stack allocations, especially for InlineArrays. rdar://151606382	2025-06-05 06:45:18 +02:00
Erik Eckstein	198d4ab0bb	Optimizer: run TempRValueElimination also at Onone Introduce a new pass MandatoryTempRValueElimination, which works as the original TempRValueElimination, except that it does not remove any alloc_stack instruction which are associated with source variables. Running this pass at Onone helps to reduce copies of large structs, e.g. InlineArrays or structs containing InlineArrays. Copying large structs can be a performance problem, even at Onone. rdar://151629149	2025-05-23 18:56:56 +02:00
Erik Eckstein	85228f2c75	Optimizer: do `mark_dependence` simplification at Onone * re-implement the SILCombine peephole as a Swift instruction simplification * run this simplification also in the OnoneSimplification pass	2025-05-23 18:53:57 +02:00
Erik Eckstein	bc4310b0eb	Optimizer: simplify `unchecked_addr_cast` for vectors Reimplement the simplification in swift and add a new transformation: ``` %1 = unchecked_addr_cast %0 : $Builtin.FixedArray<N, Element> to $Element ``` -> ``` %1 = vector_base_addr %0 : $*Builtin.FixedArray<N, Element> ```	2025-05-12 19:25:12 +02:00
Erik Eckstein	c6b1e3e854	TempRValueElimination: re-implement the pass in swift Beside cleaning up the source code, the motivation for the translation into Swift is to make it easier to improve the pass for some InlineArray specific optimizations (though I'm not sure, yet if we really need those). Also, the new implementation doesn't contain the optimize-store-into-temp optimization anymore, because this is covered by redundant load elimination.	2025-05-06 13:08:09 +02:00
Erik Eckstein	6c31eb0c43	embedded: rewrite the diagnostic pass for embedded swift 1. move embedded diagnostics out of the PerformanceDiagnostics pass. It was completely separated from the other logic in this pass, anyway. 2. rewrite it in swift 3. fix several bugs, that means: missed diagnostics, which led to IRGen crashes * look at all methods in witness tables, including base protocols and associated conformances * visit all functions in the call tree, including generic functions with class bound generic arguments * handle all instructions, e.g. concurrency builtins 4. improve error messages by adding meaningful call-site information. For example: * if the error is in a specialized function, report where the generic function is originally specialized with concrete types * if the error is in a protocol witness method, report where the existential is created	2025-04-18 06:58:40 +02:00
Erik Eckstein	199eb1d6e7	Optimizer: remove redundant `copy_block` instructions Removes a `copy_block` if its only uses, beside ownership instructions, are callees of function calls ``` %2 = copy_block %0 %3 = begin_borrow [lexical] %2 %4 = apply %3() : $@convention(block) @noescape () -> () end_borrow %3 destroy_value %2 ``` -> ``` %4 = apply %0() : $@convention(block) @noescape () -> () ``` rdar://118521396	2025-04-15 11:02:39 +02:00
Artem Chikin	281f84da0f	[Compile Time Values] Rewrite the 'Diagnose Unknown Compile Time Values' diagnostic pass in Swift	2025-03-28 10:30:07 -07:00
Artem Chikin	72a420919a	[Compile Time Values] Add mandatory optimization pipeline driver for '@const' globals	2025-03-27 14:33:38 -07:00
Artem Chikin	d484ec7c1f	[Compile Time Values] Implement a mandatory SIL pass to verify '@const' values	2025-03-27 14:33:35 -07:00
Erik Eckstein	8b2d27007f	DiagnoseInfiniteRecursion: re-implement the pass in swift and fix a bug Fixes a false alarm in case of recursive calls with different type parameters. For example: ``` protocol P { associatedtype E: P } func noRecursionMismatchingTypeArgs1<T: P>(_ t: T.Type) { if T.self == Int.self { return } noRecursionMismatchingTypeArgs1(T.E.self) } ```	2025-03-26 09:14:38 +01:00
Erik Eckstein	46035305aa	Optimizer: improve simplification of alloc_stack * Reimplement most of the logic in Swift as an Instruction simplification and remove the old code from SILCombine * support more cases of existential archetype replacements: For example: ``` %0 = alloc_stack $any P %1 = init_existential_addr %0, $T use %1 ``` is transformed to ``` %0 = alloc_stack $T use %0 ``` Also, if the alloc_stack is already an opened existential and the concrete type is known, replace it as well: ``` %0 = metatype $@thick T.Type %1 = init_existential_metatype %0, $@thick any P.Type %2 = open_existential_metatype %1 : $@thick any P.Type to $@thick (@opened("X", P) Self).Type ... %3 = alloc_stack $@opened("X", any P) Self use %3 ``` is transformed to ``` ... %3 = alloc_stack $T use %3 ```	2025-03-07 15:59:34 +01:00
Erik Eckstein	5572c832e1	SILOptimizer: replace existential archetypes with concrete types in witness_method instructions	2025-03-07 15:59:34 +01:00
Erik Eckstein	71de3d90e0	Optimizer: replace existential archetypes with concrete types in apply instructions If an apply uses an existential archetype (`@opened("...")`) and the concrete type is known, replace the existential archetype with the concrete type 1. in the apply's substitution map 2. in the arguments, e.g. by inserting address casts For example: ``` %5 = apply %1<@opend("...")>(%2) : <τ_0_0> (τ_0_0) -> () ``` -> ``` %4 = unchecked_addr_cast %2 to $*ConcreteType %5 = apply %1<ConcreteType>(%4) : <τ_0_0> (τ_0_0) -> () ```	2025-03-07 15:59:34 +01:00
Erik Eckstein	35097b5a71	Optimizer: simplify `unconditional_checked_cast` to existential metatypes. Replace `unconditional_checked_cast` to an existential metatype with an `init_existential_metatype`, it the source is a conforming type. Note that init_existential_metatype is better than unconditional_checked_cast because it does not need to do any runtime casting.	2025-03-07 15:59:33 +01:00
eeckstein	dff88f968b	Merge pull request #79317 from eeckstein/remove-alloc-vector Remove the experimental FixedArray	2025-02-12 17:03:06 +01:00
Erik Eckstein	5b93eb31bf	Optimizer: remove the AllocVectorLowering pass It's not needed anymore, because the "FixedArray" experimental feature is replaced by inline-arrays.	2025-02-12 10:51:14 +01:00
Erik Eckstein	3f95ce9645	Optimizer: fix spelling of the Simplifiable protocol Rename `Simplifyable` -> `Simplifiable` NFC	2025-02-12 09:01:11 +01:00
Erik Eckstein	ce73deebc4	RedundantLoadElimination: add a "mandatory" redundant load elimination pass And make `eliminateRedundantLoads` callable from other optimizations	2025-02-07 11:30:35 +01:00
Erik Eckstein	9aff288be4	Optimizer: re-implement the `pointer_to_address` SILCombine peephole optimizations in swift Which consists of * removing redundant `address_to_pointer`-`pointer_to_address` pairs * optimize `index_raw_pointer` of a manually computed stride to `index_addr` * remove or increase the alignment based on a "assumeAlignment" builtin This is a big code cleanup but also has some functional differences for the `address_to_pointer`-`pointer_to_address` pair removal: * It's not done if the resulting SIL would result in a (detectable) use-after-dealloc_stack memory lifetime failure. * It's not done if `copy_value`s must be inserted or borrow-scopes must be extended to comply with ownership rules (this was the task of the OwnershipRAUWHelper). Inserting copies is bad anyway. Extending borrow-scopes would only be required if the original lifetime of the pointer extends a borrow scope - which shouldn't happen in save code. Therefore this is a very rare case which is not worth handling.	2024-12-21 08:28:22 +01:00
Hamish Knight	f728466273	[SwiftCompilerSources] Use interpolation instead of `String(describing:)` `String(describing:)` does a bunch of dynamic casts that can be pretty slow. Use interpolation instead, which bypasses them. For `swift-frontend`, this brings the time taken for type-checking an empty file down from ~100ms to ~70ms. For `swift build`, this brings the time taken for a null build down from ~600ms to ~450ms (the larger delta is presumably due to the fact that there's much more Swift code in `swift-package`).	2024-12-19 15:33:39 +00:00
Erik Eckstein	bf496aa4f6	Optimizer: add simplification for fix_lifetime Canonicalize a `fix_lifetime` from an address to a `load` + `fix_lifetime`: ``` %1 = alloc_stack $T ... fix_lifetime %1 ``` -> ``` %1 = alloc_stack $T ... %2 = load %1 fix_lifetime %2 ``` This transformation is done for `alloc_stack` and `store_borrow` (which always has an `alloc_stack` operand). The benefit of this transformation is that it enables other optimizations, like mem2reg. This peephole optimization was already done in SILCombine, but it didn't handle store_borrow. A good opportunity to make an instruction simplification out of it. This is part of fixing regressions when enabling OSSA modules: rdar://140229560	2024-12-13 12:06:20 +01:00
Erik Eckstein	6990a195a3	Optimizer: rename GuaranteedPhiUpdater -> PhiUpdater Because it now has the replacePhisWithIncomingValues utility, which works for all kind of phis.	2024-12-12 09:09:11 +01:00
Erik Eckstein	3e35df0983	Simplification: run begin_borrow simplification in SILCombine	2024-12-11 12:32:34 +01:00
Erik Eckstein	6b38f2aab4	Optimizer: simplify load_borrow * Remove dead `load_borrow` instructions (replaces the old peephole optimization in SILCombine) * If the `load_borrow` is followed by a `copy_value`, combine both into a `load [copy]`	2024-12-11 12:32:33 +01:00
eeckstein	81c65758e3	Merge pull request #78059 from eeckstein/destroy-hoisting Optimizer: add a new destroy-hoisting optimization	2024-12-11 06:18:05 +01:00
Erik Eckstein	5be781a9a0	Optimizer: add a new destroy-hoisting optimization It hoists `destroy_value` instructions without shrinking an object's lifetime. This is done if it can be proved that another copy of a value (either in an SSA value or in memory) keeps the referenced object(s) alive until the original position of the `destroy_value`. ``` %1 = copy_value %0 ... last_use_of %0 // other instructions destroy_value %0 // %1 is still alive here ``` -> ``` %1 = copy_value %0 ... last_use_of %0 destroy_value %0 // other instructions ``` The benefit of this optimization is that it can enable copy-propagation by moving destroys above deinit barries and access scopes.	2024-12-10 16:28:11 +01:00
Erik Eckstein	dd78dc722b	Optimizer: add an optimization to remove `copy_value` of a borrowed value. It removes a `copy_value` where the source is a guaranteed value, if possible: ``` %1 = copy_value %0 // %0 = a guaranteed value // uses of %1 destroy_value %1 // borrow scope of %0 is still valid here ``` -> ``` // uses of %0 ``` This optimization is very similar to the LoadCopyToBorrow optimization. Therefore I merged both optimizations into a single file and renamed it to "CopyToBorrowOptimization".	2024-12-09 20:01:07 +01:00
Erik Eckstein	99ef6f727d	Optimizer: replace `unchecked_enum_data` simplification in SILCombine with the corresponding instruction simplification from SwiftCompilerSources The optimization in SILCombine had a bug (which is already fixed in the instruction simplification).	2024-11-14 09:18:29 +01:00
Erik Eckstein	51e3e5ed80	Optimizer: rename BorrowArgumentsUpdater -> GuaranteedPhiUpdater NFC	2024-11-12 09:26:59 +01:00
Erik Eckstein	6b8c6a3c3b	SIL: rename updateBorrowedFrom to updateBorrowArguments NFC	2024-11-12 09:26:58 +01:00
Erik Eckstein	b8026d74e6	Revert "Revert "Optimizer: improve the load-copy-to-borrow optimization and implement it in swift"" This reverts commit `0666c446ec`.	2024-10-22 08:40:18 +02:00

1 2 3

119 Commits