swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Michael Gottesman	97c73768af	Merge pull request #35253 from gottesmm/pr-7d30d56fcab9d42f3788104b987f62df27540e00 [sil-inst-opt] Improve performance of InstModCallbacks by eliminating indirect call along default callback path.	2021-01-04 15:25:53 -08:00
Michael Gottesman	0de00d1ce4	[sil-inst-opt] Improve performance of InstModCallbacks by eliminating indirect call along default callback path. Specifically before this PR, if a caller did not customize a specific callback of InstModCallbacks, we would store a static default std::function into InstModCallbacks. This means that we always would have an indirect jump. That is unfortunate since this code is often called in loops. In this PR, I eliminate this problem by: 1. I made all of the actual callback std::function in InstModCallback private and gave them a "Func" postfix (e.x.: deleteInst -> deleteInstFunc). 2. I created public methods with the old callback names to actually call the callbacks. This ensured that as long as we are not escaping callbacks from InstModCallback, this PR would not result in the need for any source changes since we are changing a call of a std::function field to a call to a method. 3. I changed all of the places that were escaping inst mod's callbacks to take an InstModCallback. We shouldn't be doing that anyway. 4. I changed the default value of each callback in InstModCallbacks to be a nullptr and changed the public helper methods to check if a callback is null. If the callback is not null, it is called, otherwise the getter falls back to an inline default implementation of the operation. All together this means that the cost of a plain InstModCallback is reduced and one pays an indirect function cost price as one customizes it further which is better scalability. P.S. as a little extra thing, I added a madeChange field onto the InstModCallback. Now that we have the helpers calling the callbacks, I can easily insert instrumentation like this, allowing for users to pass in InstModCallback and see if anything was RAUWed without needing to specify a callback.	2021-01-04 12:51:55 -08:00
Andrew Trick	cead6a5122	Add an OptimizedMandatoryCombine pass variant. It's against the principles of pass design to check the driver mode within the pass. A pass always needs to do the same thing regardless of where it runs in the pass pipeline. It also needs to be possible to test passes in isolation.	2021-01-01 19:22:19 -08:00
Meghana Gupta	b99533aced	Merge pull request #34895 from meg-gupta/cseossa Enable CSE on OSSA	2020-12-23 09:58:46 -08:00
Meghana Gupta	42c031985c	Enable CSE on OSSA	2020-12-22 23:20:06 -08:00
Alex Hoppen	fd932a6b42	[SILOpt] Fix build by only accessing seenUse in non-assert builds	2020-12-19 12:23:58 +01:00
Andrew Trick	bab19976a6	Add a PrunedLiveness utility. This bare-bones utility will be the basis for CanonicalizeOSSALifetime. It is maximally flexible and can be adopted by any analysis that needs SSA-based liveness expressed in terms of the live blocks. It's meant to be layered underneath various higher-level analyses. We could consider revamping ValueLifetimeAnalysis and layering it on top of this. If PrunedLiveness is adopted widely enough, we can combine it with a block numbering analysis so we can micro-optimize the internal data structures.	2020-12-18 18:49:59 -08:00
Meghana Gupta	db24e3e94c	Fix crash in EpilogueArcAnalysis EpilogueARCState for unreachable blocks can be non-existent. Fix the EpilogueARCContext::getState method and its users.	2020-12-17 13:39:22 -08:00
Meghana Gupta	81107e4235	Improve handling of copy_value and destroy_value in (#35011 ) MemoryBehaviorVisitor - Also, compute use points for destroy_value - Cleanup explicit checks for refcount instructions in RLE	2020-12-14 22:44:58 -08:00
Richard Wei	8d8614058b	[AudoDiff] NFC: Replace 'SILAutoDiffIndices' with 'AutoDiffConfig'. (#35079 ) Resolve rdar://71678394 / SR-13889.	2020-12-14 14:32:40 -08:00
Michael Gottesman	1ca55774b2	Merge pull request #34559 from gottesmm/ossa-inst-simplify [inst-simplify] Update for OSSA	2020-12-09 14:31:15 -08:00
Michael Gottesman	259d2bb182	[ownership] Commit a generic replaceAllUsesAndEraseFixingOwnership api and enable SimplifyInstruction on OSSA. This is a generic API that when ownership is enabled allows one to replace all uses of a value with a value with a differing ownership by transforming/lifetime extending as appropriate. This API supports all pairings of ownership /except/ replacing a value with OwnershipKind::None with a value without OwnershipKind::None. This is a more complex optimization that we do not support today. As a result, we include on our state struct a helper routine that callers can use to know if the two values that they want to process can be handled by the algorithm. My moticiation is to use this to to update InstSimplify and SILCombiner in a less bug prone way rather than just turn stuff off. Noting that this transformation inserts ownership instructions, I have made sure to test this API in two ways: 1. With Mandatory Combiner alone (to make sure it works period). 2. With Mandatory Combiner + Semantic ARC Opts to make sure that we can eliminate the extra ownership instructions it inserts. As one can see from the tests, the optimizer today is able to handle all of these transforms except one conditional case where I need to eliminate a dead phi arg. I have a separate branch that hits that today but I have exposed unsafe behavior in ClosureLifetimeFixup that I need to fix first before I can land that. I don't want that to stop this PR since I think the current low level ARC optimizer may be able to help me here since this is a simple transform it does all of the time.	2020-12-09 11:53:56 -08:00
Erik Eckstein	9e43f493f3	GenericSpecializer: use an alternative mangling if the function has re-abstracted resilient type parameters. If the specialized function has a re-abstracted (= converted from indirect to direct) resilient argument or return types, use an alternative mangling: "TB" instead of "Tg". Resilient parameters/returns can be converted from indirect to direct if the specialization is created within the type's resilience domain, i.e. in its module (where the type is loadable). In this case we need to generate a different mangled name for the specialized function to distinguish it from specializations in other modules, which cannot re-abstract this resilient type. This fixes a miscompile resulting from ODR-linking specializations from different modules, which in fact have different function signatures. https://bugs.swift.org/browse/SR-13900 rdar://71914016	2020-12-07 17:23:46 +01:00
Michael Gottesman	8914ba60fd	Merge pull request #34959 from gottesmm/pr-98cbb22de5b33dbf86eecf90b5c5adcda4a81c8d [cfgoptutils] Add a new overload of addNewEdgeValueToBranch that takes an InstModCallback.	2020-12-04 14:50:26 -08:00
Michael Gottesman	16b63b15f8	[cfgoptutils] Add a new overload of addNewEdgeValueToBranch that takes an InstModCallback. I reimplemented the original addNewEdgeValueToBranch to just call the new overload with a default InstModCallbacks, so nothing changed and now we can plug in callbacks to this utility!	2020-12-04 01:07:07 -08:00
Erik Eckstein	423169ce5c	SILOptimizer: update alias analysis in TempRValueOpt and TempLValueOpt When instructions are changed within a pass in a way that affects subsequent alias queries in the same pass run, their alias analysis information must be invalidated. Otherwise it can result in miscompiles and/or invalid SIL. rdar://71924430	2020-12-03 13:53:57 +01:00
Michael Gottesman	b13a8e9ba3	Merge pull request #34915 from gottesmm/forwarding-silinstruction [ownership] Centralize all info about SILInstruction forwarding in the SILInstruction class hierarchy itself.	2020-12-01 21:33:27 -08:00
Michael Gottesman	8d479f1ff6	[autodiff] Change getTangentStoredProperty() to use a Projection instead of FieldIndexCacheBase. This is NFCI. THis is in preparation for making FieldIndexCacheBase a templated subclass.	2020-11-30 18:16:11 -08:00
Richard Wei	de2dbe57ed	[AutoDiff] Bump-pointer allocate pullback structs in loops. (#34886 ) In derivatives of loops, no longer allocate boxes for indirect case payloads. Instead, use a custom pullback context in the runtime which contains a bump-pointer allocator. When a function contains a differentiated loop, the closure context is a `Builtin.NativeObject`, which contains a `swift::AutoDiffLinearMapContext` and a tail-allocated top-level linear map struct (which represents the linear map struct that was previously directly partial-applied into the pullback). In branching trace enums, the payloads of previously indirect cases will be allocated by `swift::AutoDiffLinearMapContext::allocate` and stored as a `Builtin.RawPointer`.	2020-11-30 15:49:38 -08:00
Michael Gottesman	25ebb5d763	[autodiff] When asserts are enabled, verify all autodiff compiler generated functions. This ensures that any invalid SIL generated by these cloners is caught immediately at the source when asserts are enabled improving productivity.	2020-11-29 23:44:31 -08:00
Michael Gottesman	b78a64985f	Merge pull request #34755 from gottesmm/pr-c948d27bcce9be4feb87ece1fc46b74931415542 [value-lifetime] Cleanup constructors.	2020-11-16 01:06:46 -08:00
Michael Gottesman	7718bd1fed	[value-lifetime] Cleanup constructors.	2020-11-15 16:56:31 -08:00
Michael Gottesman	d2de176264	[sil][value-lifetime] Add ValueLifetimeAnalysis::FrontierImpl = SmallVectorImpl<SILInstruction > Otherwise, one is always forced to use ValueLifetimeAnalysis::Frontier, a SmallVector<SILInstruction , 4>. This may not be a size appropriate for every problem, so it makes sense to provide Frontier as a good rule of thumb, but use FrontierImpl on the actual API boundary to loosen the constraint if the user wishes to do so.	2020-11-15 16:41:47 -08:00
Andrew Trick	c2b13cdd51	Merge pull request #34635 from atrick/verify-critedge Verify non-critical edges in OSSA	2020-11-09 08:59:08 -08:00
eeckstein	56928ba851	Merge pull request #34593 from eeckstein/optimize_hte [concurrency] SILOptimizer: optimize hop_to_executor instructions.	2020-11-09 09:23:46 +01:00
Andrew Trick	34c48f1ee2	Add isNonCriticalEdge fast check for a specific edge.	2020-11-08 21:34:24 -08:00
Andrew Trick	903697675b	Fix EagerSpecializer to avoid critical edges.	2020-11-06 08:31:23 -08:00
Ben Barham	7cee600bcd	[SILGen] Add flag to skip typechecking and SIL gen for function bodies Adds a new flag "-experimental-skip-all-function-bodies" that skips typechecking and SIL generation for all function bodies (where possible). `didSet` functions are still typechecked and have SIL generated as their body is checked for the `oldValue` parameter, but are not serialized. Parsing will generally be skipped as well, but this isn't necessarily the case since other flags (eg. "-verify-syntax-tree") may force delayed parsing off.	2020-11-06 12:08:19 +10:00
Erik Eckstein	a47ebabe54	[concurrency] SILOptimizer: optimize hop_to_executor instructions. * Redundant hop_to_executor elimination: if a hop_to_executor is dominated by another hop_to_executor with the same operand, it is eliminated: hop_to_executor %a ... // no suspension points hop_to_executor %a // can be eliminated * Dead hop_to_executor elimination: if a hop_to_executor is not followed by any code which requires to run on its actor's executor, it is eliminated: hop_to_executor %a ... // no instruction which require to run on %a return rdar://problem/70304809	2020-11-05 18:48:22 +01:00
Meghana Gupta	483321c360	Enable ArrayElementValuePropagation on ownership SIL	2020-11-04 11:54:47 -08:00
Andrew Trick	223ee10939	EdgeThreadingCloner. Remove splitCriticalEdges calls.	2020-11-03 01:40:00 -08:00
Andrew Trick	3128eae3f0	Add NestedSemanticFunctionCheck diagnostic to check for improperly nested '@_semantic' functions. Add a missing @_semantics("array.init") in ArraySlice found by the diagnostic. Distinguish between array.init and array.init.empty. Categorize the types of semantic functions by how they affect the inliner and pass pipeline, and centralize this logic in PerformanceInlinerUtils. The ultimate goal is to prevent inlining of "Fundamental" @_semantics calls and @_effects calls until the late pipeline where we can safely discard semantics. However, that requires significant pipeline changes. In the meantime, this change prevents the situation from getting worse and makes the intention clear. However, it has no significant effect on the pass pipeline and inliner.	2020-10-26 17:02:33 -07:00
Andrew Trick	6f2cda1390	Add AccessUseVisitor and cleanup related APIs. Add AccesssedStorage::compute and computeInScope to mirror AccessPath. Allow recovering the begin_access for Nested storage. Adds AccessedStorage.visitRoots().	2020-10-16 15:00:10 -07:00
Andrew Trick	b2d1ac1631	Add AccessPathVerification pass and run it in the pipeline.	2020-10-16 15:00:10 -07:00
Andrew Trick	cc0aa2f8b8	Add an AccessPath abstraction and formalize memory access Things that have come up recently but are somewhat blocked on this: - Moving AccessMarkerElimination down in the pipeline - SemanticARCOpts correctness and improvements - AliasAnalysis improvements - LICM performance regressions - RLE/DSE improvements Begin to formalize the model for valid memory access in SIL. Ignoring ownership, every access is a def-use chain in three parts: object root -> formal access base -> memory operation address AccessPath abstracts over this path and standardizes the identity of a memory access throughout the optimizer. This abstraction is the basis for a new AccessPathVerification. With that verification, we now have all the properties we need for the type of analysis requires for exclusivity enforcement, but now generalized for any memory analysis. This is suitable for an extremely lightweight analysis with no side data structures. We currently have a massive amount of ad-hoc memory analysis throughout SIL, which is incredibly unmaintainable, bug-prone, and not performance-robust. We can begin taking advantage of this verifably complete model to solve that problem. The properties this gives us are: Access analysis must be complete over memory operations: every memory operation needs a recognizable valid access. An access can be unidentified only to the extent that it is rooted in some non-address type and we can prove that it is at least not part of an access to a nominal class or global property. Pointer provenance is also required for future IRGen-level bitfield optimizations. Access analysis must be complete over address users: for an identified object root all memory accesses including subobjects must be discoverable. Access analysis must be symmetric: use-def and def-use analysis must be consistent. AccessPath is merely a wrapper around the existing accessed-storage utilities and IndexTrieNode. Existing passes already very succesfully use this approach, but in an ad-hoc way. With a general utility we can: - update passes to use this approach to identify memory access, reducing the space and time complexity of those algorithms. - implement an inexpensive on-the-fly, debug mode address lifetime analysis - implement a lightweight debug mode alias analysis - ultimately improve the power, efficiency, and maintainability of full alias analysis - make our type-based alias analysis sensistive to the access path	2020-10-16 15:00:10 -07:00
Andrew Trick	92a181671e	Fix ValueTracking isUniquelyIdentified to use AccessedStorage. To clarify and unify logic, improve precision, and behave consistently with other code that does the same thing.	2020-10-16 15:00:09 -07:00
Andrew Trick	85ff15acd3	Add indexTrieRoot to the SILModule to share across Analyses. ...and avoid reallocation. This is immediately necessary for LICM, in addition to its current uses. I suspect this could be used by many passes that work with addresses. RLE/DSE should absolutely migrate to it.	2020-10-16 15:00:09 -07:00
Arnold Schwaighofer	b994bf3191	Add support for `_specialize(exported: true, ...)` This attribute allows to define a pre-specialized entry point of a generic function in a library. The following definition provides a pre-specialized entry point for `genericFunc(_:)` for the parameter type `Int` that clients of the library can call. ``` @_specialize(exported: true, where T == Int) public func genericFunc<T>(_ t: T) { ... } ``` Pre-specializations of internal `@inlinable` functions are allowed. ``` @usableFromInline internal struct GenericThing<T> { @_specialize(exported: true, where T == Int) @inlinable internal func genericMethod(_ t: T) { } } ``` There is syntax to pre-specialize a method from a different module. ``` import ModuleDefiningGenericFunc @_specialize(exported: true, target: genericFunc(_:), where T == Double) func prespecialize_genericFunc(_ t: T) { fatalError("dont call") } ``` Specially marked extensions allow for pre-specialization of internal methods accross module boundries (respecting `@inlinable` and `@usableFromInline`). ``` import ModuleDefiningGenericThing public struct Something {} @_specializeExtension extension GenericThing { @_specialize(exported: true, target: genericMethod(_:), where T == Something) func prespecialize_genericMethod(_ t: T) { fatalError("dont call") } } ``` rdar://64993425	2020-10-12 09:19:29 -07:00
Erik Eckstein	d4a6bd39b6	SILOptimizer: improve MemBehavior for apply instructions. 1. Do a better alias analysis for "function-local" objects, like alloc_stack and inout parameters 2. Fully support try_apply and begin/end/abort_apply So far we fully relied on escape analysis. But escape analysis has some shortcomings with SIL address-types. Therefore, handle two common cases, alloc_stack and inout parameters, with alias analysis. This gives better results. The biggest change here is to do a quick check if the address escapes via an address_to_pointer instructions.	2020-10-09 20:54:58 +02:00
Erik Eckstein	aced5c74df	SILOptimizer: Remove InspectionMode from MemBehehaviorVisitor The InspectionMode was never set to anything else than "IgnoreRetains"	2020-10-09 20:54:58 +02:00
Meghana Gupta	0a21c4d96f	Fix another use-after-free in SILCombine (#34168 ) * Fix another use-after-free in SILCombine swift::endLifetimeAtFrontier also needs to use swift::emitDestroyOperation and delete instructions via callbacks that can correctly remove it from the worklist that SILCombine maintains * Add test for use-after-free in SILCombine	2020-10-06 13:37:05 -07:00
Meghana Gupta	f4bbafb392	Revert "[PassManager] Update PassManager's function worklist for newly added SILFunctions"	2020-10-05 10:23:31 -07:00
Joe Groff	95f1bd3bf8	Merge pull request #34142 from jckarter/async-await-sil-instructions SIL: Add instructions to represent async suspend points.	2020-10-02 13:19:49 -07:00
Meghana Gupta	e2a9bf2009	Fix use-after-free in SILCombine (#34145 ) SILCombine maintains a worklist of instructions and deleting of instructions is valid only via callbacks that remove them from the worklist as well. It calls swift::tryDeleteDeadClosure which in turn calls SILBuilder apis like emitStrongRelease/emitReleaseValue/emitDestroyValue which can delete instructions via SILInstruction::eraseFromParent leaving behind a stale entry in SILCombine's worklist causing a crash. This PR adds swift::emitDestroyOperation which correctly calls the appropriate InstModCallbacks on added/removed instructions. This comes from swift::releasePartialApplyCapturedArg which was handling creation of destroys with callbacks correctly.	2020-10-01 20:57:40 -07:00
Joe Groff	a664a33b52	SIL: Add instructions to represent async suspend points. `get_async_continuation[_addr]` begins a suspend operation by accessing the continuation value that can resume the task, which can then be used in a callback or event handler before executing `await_async_continuation` to suspend the task.	2020-10-01 14:21:52 -07:00
Meghana Gupta	163d47ec90	Revert "Revert #33106 and #33205 " (#34106 )	2020-09-28 23:08:14 -07:00
Meghana Gupta	77a76a8422	Revert "Merge pull request #33205 from meg-gupta/ometofunctionpass" This reverts commit `8dbac48c18`, reversing changes made to `c22ba90700`.	2020-09-25 11:49:52 -07:00
Meghana Gupta	9c9a8ef224	Allow OME to run mandatorily	2020-09-22 18:02:04 -07:00
Michael Gottesman	646fcf6678	Merge pull request #33754 from gottesmm/pr-d1a9d022618a039a807ff68c0d46d898e8fe5578 [opt-remark] When looking for debug_value users, look modulo RC Identity preserving users.	2020-09-04 11:04:26 -07:00
Dan Zheng	1d72ec9bad	[AutoDiff] NFC: improve debug logging. (#33793 ) Print value/instruction with context in non-differentiability error debug log.	2020-09-03 20:18:52 -07:00

1 2 3 4 5 ...

1024 Commits