swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Michael Gottesman	0de00d1ce4	[sil-inst-opt] Improve performance of InstModCallbacks by eliminating indirect call along default callback path. Specifically before this PR, if a caller did not customize a specific callback of InstModCallbacks, we would store a static default std::function into InstModCallbacks. This means that we always would have an indirect jump. That is unfortunate since this code is often called in loops. In this PR, I eliminate this problem by: 1. I made all of the actual callback std::function in InstModCallback private and gave them a "Func" postfix (e.x.: deleteInst -> deleteInstFunc). 2. I created public methods with the old callback names to actually call the callbacks. This ensured that as long as we are not escaping callbacks from InstModCallback, this PR would not result in the need for any source changes since we are changing a call of a std::function field to a call to a method. 3. I changed all of the places that were escaping inst mod's callbacks to take an InstModCallback. We shouldn't be doing that anyway. 4. I changed the default value of each callback in InstModCallbacks to be a nullptr and changed the public helper methods to check if a callback is null. If the callback is not null, it is called, otherwise the getter falls back to an inline default implementation of the operation. All together this means that the cost of a plain InstModCallback is reduced and one pays an indirect function cost price as one customizes it further which is better scalability. P.S. as a little extra thing, I added a madeChange field onto the InstModCallback. Now that we have the helpers calling the callbacks, I can easily insert instrumentation like this, allowing for users to pass in InstModCallback and see if anything was RAUWed without needing to specify a callback.	2021-01-04 12:51:55 -08:00
Meghana Gupta	db24e3e94c	Fix crash in EpilogueArcAnalysis EpilogueARCState for unreachable blocks can be non-existent. Fix the EpilogueARCContext::getState method and its users.	2020-12-17 13:39:22 -08:00
Meghana Gupta	81107e4235	Improve handling of copy_value and destroy_value in (#35011 ) MemoryBehaviorVisitor - Also, compute use points for destroy_value - Cleanup explicit checks for refcount instructions in RLE	2020-12-14 22:44:58 -08:00
Richard Wei	8d8614058b	[AudoDiff] NFC: Replace 'SILAutoDiffIndices' with 'AutoDiffConfig'. (#35079 ) Resolve rdar://71678394 / SR-13889.	2020-12-14 14:32:40 -08:00
Michael Gottesman	259d2bb182	[ownership] Commit a generic replaceAllUsesAndEraseFixingOwnership api and enable SimplifyInstruction on OSSA. This is a generic API that when ownership is enabled allows one to replace all uses of a value with a value with a differing ownership by transforming/lifetime extending as appropriate. This API supports all pairings of ownership /except/ replacing a value with OwnershipKind::None with a value without OwnershipKind::None. This is a more complex optimization that we do not support today. As a result, we include on our state struct a helper routine that callers can use to know if the two values that they want to process can be handled by the algorithm. My moticiation is to use this to to update InstSimplify and SILCombiner in a less bug prone way rather than just turn stuff off. Noting that this transformation inserts ownership instructions, I have made sure to test this API in two ways: 1. With Mandatory Combiner alone (to make sure it works period). 2. With Mandatory Combiner + Semantic ARC Opts to make sure that we can eliminate the extra ownership instructions it inserts. As one can see from the tests, the optimizer today is able to handle all of these transforms except one conditional case where I need to eliminate a dead phi arg. I have a separate branch that hits that today but I have exposed unsafe behavior in ClosureLifetimeFixup that I need to fix first before I can land that. I don't want that to stop this PR since I think the current low level ARC optimizer may be able to help me here since this is a simple transform it does all of the time.	2020-12-09 11:53:56 -08:00
Erik Eckstein	423169ce5c	SILOptimizer: update alias analysis in TempRValueOpt and TempLValueOpt When instructions are changed within a pass in a way that affects subsequent alias queries in the same pass run, their alias analysis information must be invalidated. Otherwise it can result in miscompiles and/or invalid SIL. rdar://71924430	2020-12-03 13:53:57 +01:00
Andrew Trick	3128eae3f0	Add NestedSemanticFunctionCheck diagnostic to check for improperly nested '@_semantic' functions. Add a missing @_semantics("array.init") in ArraySlice found by the diagnostic. Distinguish between array.init and array.init.empty. Categorize the types of semantic functions by how they affect the inliner and pass pipeline, and centralize this logic in PerformanceInlinerUtils. The ultimate goal is to prevent inlining of "Fundamental" @_semantics calls and @_effects calls until the late pipeline where we can safely discard semantics. However, that requires significant pipeline changes. In the meantime, this change prevents the situation from getting worse and makes the intention clear. However, it has no significant effect on the pass pipeline and inliner.	2020-10-26 17:02:33 -07:00
Andrew Trick	6f2cda1390	Add AccessUseVisitor and cleanup related APIs. Add AccesssedStorage::compute and computeInScope to mirror AccessPath. Allow recovering the begin_access for Nested storage. Adds AccessedStorage.visitRoots().	2020-10-16 15:00:10 -07:00
Andrew Trick	cc0aa2f8b8	Add an AccessPath abstraction and formalize memory access Things that have come up recently but are somewhat blocked on this: - Moving AccessMarkerElimination down in the pipeline - SemanticARCOpts correctness and improvements - AliasAnalysis improvements - LICM performance regressions - RLE/DSE improvements Begin to formalize the model for valid memory access in SIL. Ignoring ownership, every access is a def-use chain in three parts: object root -> formal access base -> memory operation address AccessPath abstracts over this path and standardizes the identity of a memory access throughout the optimizer. This abstraction is the basis for a new AccessPathVerification. With that verification, we now have all the properties we need for the type of analysis requires for exclusivity enforcement, but now generalized for any memory analysis. This is suitable for an extremely lightweight analysis with no side data structures. We currently have a massive amount of ad-hoc memory analysis throughout SIL, which is incredibly unmaintainable, bug-prone, and not performance-robust. We can begin taking advantage of this verifably complete model to solve that problem. The properties this gives us are: Access analysis must be complete over memory operations: every memory operation needs a recognizable valid access. An access can be unidentified only to the extent that it is rooted in some non-address type and we can prove that it is at least not part of an access to a nominal class or global property. Pointer provenance is also required for future IRGen-level bitfield optimizations. Access analysis must be complete over address users: for an identified object root all memory accesses including subobjects must be discoverable. Access analysis must be symmetric: use-def and def-use analysis must be consistent. AccessPath is merely a wrapper around the existing accessed-storage utilities and IndexTrieNode. Existing passes already very succesfully use this approach, but in an ad-hoc way. With a general utility we can: - update passes to use this approach to identify memory access, reducing the space and time complexity of those algorithms. - implement an inexpensive on-the-fly, debug mode address lifetime analysis - implement a lightweight debug mode alias analysis - ultimately improve the power, efficiency, and maintainability of full alias analysis - make our type-based alias analysis sensistive to the access path	2020-10-16 15:00:10 -07:00
Andrew Trick	92a181671e	Fix ValueTracking isUniquelyIdentified to use AccessedStorage. To clarify and unify logic, improve precision, and behave consistently with other code that does the same thing.	2020-10-16 15:00:09 -07:00
Andrew Trick	85ff15acd3	Add indexTrieRoot to the SILModule to share across Analyses. ...and avoid reallocation. This is immediately necessary for LICM, in addition to its current uses. I suspect this could be used by many passes that work with addresses. RLE/DSE should absolutely migrate to it.	2020-10-16 15:00:09 -07:00
Erik Eckstein	d4a6bd39b6	SILOptimizer: improve MemBehavior for apply instructions. 1. Do a better alias analysis for "function-local" objects, like alloc_stack and inout parameters 2. Fully support try_apply and begin/end/abort_apply So far we fully relied on escape analysis. But escape analysis has some shortcomings with SIL address-types. Therefore, handle two common cases, alloc_stack and inout parameters, with alias analysis. This gives better results. The biggest change here is to do a quick check if the address escapes via an address_to_pointer instructions.	2020-10-09 20:54:58 +02:00
Erik Eckstein	aced5c74df	SILOptimizer: Remove InspectionMode from MemBehehaviorVisitor The InspectionMode was never set to anything else than "IgnoreRetains"	2020-10-09 20:54:58 +02:00
Meghana Gupta	f4bbafb392	Revert "[PassManager] Update PassManager's function worklist for newly added SILFunctions"	2020-10-05 10:23:31 -07:00
Michael Gottesman	646fcf6678	Merge pull request #33754 from gottesmm/pr-d1a9d022618a039a807ff68c0d46d898e8fe5578 [opt-remark] When looking for debug_value users, look modulo RC Identity preserving users.	2020-09-04 11:04:26 -07:00
Michael Gottesman	2daf1d2050	[opt-remark] When looking for debug_value users, look modulo RC Identity preserving users. A key concept in late ARC optimization is "RC Identity". In short, a result of an instruction is rc-identical to an operand of the instruction if one can safely move a retain (release) from before the instruction on the result to one after on the operand without changing the program semantics. This creates a simple model where one can work on equivalence classes of rc-identical values (using a dominating definition generally as the representative) and thus optimize/pair retain, release. When preparing for late ARC optimization, the optimizer will normalize aggregate ARC operations (retain_value, release_value) into singular strong_retain, strong_release operations on leaf types of the aggregate that are non-trivial. As an example, a retain_value on a KlassPair would be canonicalized into two strong_retain, one for the lhs and one for the rhs. When this is done, the optimizer generally just creates new struct_extract at the point where the retain is. In such a case, we may have that the debug_value for the underlying type is actually on a reformed aggregate whose underlying parts we are retaining: ``` bb0(%0 : $Builtin.NativeObject): strong_retain %0 %1 = struct $Array(%0 : $Builtin.NativeObject, ...) debug_value %1 : $Array, ... ``` By looking through RC identical uses, we can handle a large subset of these cases without much effort: ones were there is a single owning pointer like Array. To handle more complex cases we would have to calculate an inverse access path needed to get back to our value and somehow deal with all of the complexity therein (I am sure we can do it I just haven't thought through all of the details). The only interesting behavior that this results in is that when we emit diagnostics, we just use the rc-identical transitive use debug_value's name without a projection path. This is because the source location associated with that debug_value is with a separate value that is rc-identical to the actual value that we visited during our opt-remark traversal up the def-use graph. Consider the following example below, noting the comments that show in the SIL itself what I attempted to explain above. ``` struct KlassPair { var lhs: Klass var rhs: Klass } struct StateWithOwningPointer { var state: TrivialState var owningPtr: Klass } sil @theFunction : $@convention(thin) () -> () { bb0: %0 = apply %getKlassPair() : $@convention(thin) () -> @owned KlassPair // This debug_value's name can be combined... debug_value %0 : $KlassPair, name "myPair" // ... with the access path from the struct_extract here... %1 = struct_extract %0 : $KlassPair, #KlassPair.lhs // ... to emit a nice diagnostic that 'myPair.lhs' is being retained. strong_retain %1 : $Klass // In contrast in the case below, we rely on looking through rc-identity uses // to find the debug_value. In this case, the source info associated with the // debug_value (%2) is no longer associated with the underlying access path we // have been tracking upwards (%1 is in our access path list). Instead, we // know that the debug_value is rc-identical to whatever value we were // originally tracking up (%1) and thus the correct identifier to use is the // direct name of the identifier alone (without access path) since that source // identifier must be some value in the source that by itself is rc-identical // to whatever is being manipulated. Thus if we were to emit the access path // here for na rc-identical use we would get "myAdditionalState.owningPtr" // which is misleading since ArrayWrapperWithMoreState does not have a field // named 'owningPtr', its subfield array does. That being said since // rc-identity means a retain_value on the value with the debug_value upon it // is equivalent to the access path value we found by walking up the def-use // graph from our strong_retain's operand. %0a = apply %getStateWithOwningPointer() : $@convention(thin) () -> @owned StateWithOwningPointer %1 = struct_extract %0a : $StateWithOwningPointer, #StateWithOwningPointer.owningPtr strong_retain %1 : $Klass %2 = struct $Array(%0 : $Builtin.NativeObject, ...) %3 = struct $ArrayWrapperWithMoreState(%2 : $Array, %moreState : MoreState) debug_value %2 : $ArrayWrapperWithMoreState, name "myAdditionalState" } ```	2020-09-01 23:25:59 -07:00
swift_jenkins	258b72273c	Merge remote-tracking branch 'origin/master' into master-rebranch	2020-08-24 21:36:02 -07:00
Meghana Gupta	b1c0bd3096	Minor cleanup in ARCSequenceOpts (#33578 ) * Remove NewInsts from ARCSequenceOpts * Remove more instances of InsertPts * Address comments from #33504 * Make bottom up loop traversal simpler. Use better apis * Update LoopRegion printer with more info	2020-08-24 21:21:11 -07:00
Nathan Hawes	9d4ed5f39c	Manually merge remote-tracking branch 'upstream/master' into manually-merge-master-to-master-rebranch	2020-07-20 16:09:55 -07:00
Andrew Trick	5826e75b00	Generalize the MemAccessUtils API. For use outside access enforcement passes. Add isUniquelyIdentifiedAfterEnforcement. Rename functions for clarity and generality. Rename isUniquelyIdentifiedOrClass to isFormalAccessBase. Rename findAccessedStorage to identifyFormalAccess. Rename findAccessedStorageNonNested to findAccessedStorage. Part of generalizing the utility for use outside the access enforcement passes.	2020-07-17 10:13:20 -07:00
Nathan Hawes	3957574a7b	Fixup code that was using the now-removed DominatorTreeBase::getRoots() Use the new roots(), roots_begin(), etc. instead.	2020-07-07 17:55:28 -07:00
Dan Zheng	d3b6b89de6	[AutoDiff] Support multiple differentiability result indices in SIL. (#32206 ) `DifferentiableFunctionInst` now stores result indices. `SILAutoDiffIndices` now stores result indices instead of a source index. `@differentiable` SIL function types may now have multiple differentiability result indices and `@noDerivative` resutls. `@differentiable` AST function types do not have `@noDerivative` results (yet), so this functionality is not exposed to users. Resolves TF-689 and TF-1256. Infrastructural support for TF-983: supporting differentiation of `apply` instructions with multiple active semantic results.	2020-06-05 16:25:17 -07:00
Meghana Gupta	a1e281d926	Merge pull request #30710 from meg-gupta/bottomupfunction [PassManager] Update PassManager's function worklist for newly added SILFunctions	2020-06-03 08:20:39 -07:00
Varun Gandhi	5e73d7776c	[NFC] Remove redundant includes for llvm/ADT/BitVector.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	f85492161c	[NFC] Remove redundant includes for llvm/ADT/SetVector.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	7df7378048	[NFC] Remove redundant includes for llvm/ADT/DenseMap.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	fea589e470	[NFC] Remove redundant includes for llvm/ADT/DenseSet.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	91693e96c9	[NFC] Remove redundant includes for llvm/ADT/SmallPtrSet.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	c14e934563	[NFC] Remove redundant includes for llvm/ADT/SmallSet.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	96a332e865	[NFC] Remove redundant includes for llvm/ADT/SmallVector.h.	2020-05-31 13:07:45 -07:00
Varun Gandhi	ba4b43a96b	[NFC] Remove redundant includes for llvm/ADT/TinyPtrVector.h.	2020-05-31 13:07:41 -07:00
Varun Gandhi	044189471b	[NFC] Remove redundant includes for llvm/ADT/ArrayRef.h.	2020-05-31 13:06:57 -07:00
Varun Gandhi	2c7b4b119a	[NFC] Remove redundant includes for llvm/ADT/Optional.h.	2020-05-31 13:05:48 -07:00
Varun Gandhi	ea92df04e1	[NFC] Remove redundant includes for <vector>.	2020-05-31 13:05:02 -07:00
Erik Eckstein	2403e56eb5	SIL: new "array.end_mutation" and "array.finalize_intrinsic" array semantics Used to "finalize" an array literal. It's not used, yet. So this is NFC. Also handle the "array.finalize_intrinsic" function in various array specific optimizations.	2020-05-26 18:01:17 +02:00
Meghana Gupta	fd98ce10c7	Update PassManager's function worklist for newly added SILFunctions The PassManager should transform all functions in bottom up order. This is necessary because when optimizations like inlining looks at the callee function bodies to compute profitability, the callee functions should have already undergone optimizations to get better profitability estimates. The PassManager builds its function worklist based on bottom up order on initialization. However, newly created SILFunctions due to specialization etc, are simply appended to the function worklist. This can cause us to make bad inlining decisions due to inaccurate profitability estimates. This change now updates the function worklist such that, all the callees of the newly added SILFunction are proccessed before it by the PassManager. Fixes rdar://52202680	2020-05-11 19:43:22 -07:00
Erik Eckstein	36c6fe7f10	FunctionSignatureOptimization: fix a miscompile in owned->guaranteed return value specialization FSO can handle self-recursive calls. But this only works if the result of the self-recursive call is actually returned and not used otherwise. The check for this was missing. https://bugs.swift.org/browse/SR-12677 rdar://problem/62895040	2020-05-06 14:20:43 +02:00
zoecarver	c733ca1419	[NFC] Make EscapeState a public member of EscapeAnalysis. Makes EscapeAnalysis::EscapeState public. This allows EscapeAnalysis::getEscapeState to be used outside of EscapeAnalysis.	2020-04-30 10:36:10 -07:00
Meghana Gupta	6c3857b6d6	Add forced precomputation and verification of analysis (#31251 ) -sil-verify-all flag will verify analyses before and after a pass to confirm correct invalidations. But if an analysis was never constructed or invalidated as per current pass order, it may never detect insufficient invalidations. -sil-verify-force-analysis will force construct an analysis so that we can better check for insufficient invalidations. It is also terribly slow compared to -sil-verify-all.	2020-04-24 10:55:45 -07:00
Erik Eckstein	fd6c26e948	EscapeAnalysis: don't compute the connection graph for very large functions For functions which results in > 10000 nodes, just bail and don't compute the connection graph. The node merging algorithm is quadratic and can result in significant compile times for very large functions. rdar://problem/56268570	2020-04-10 20:10:24 +02:00
Erik Eckstein	f33c2ade1d	EscapeAnalysis: remove an unused parameter from canOptimizeArrayUninitializedCall NFC	2020-04-10 20:10:24 +02:00
Dan Zheng	bb6d4ebd9f	[AutoDiff upstream] Add differentiable activity analysis. Differentiable activity analysis is a dataflow analysis which marks values in a function as varied, useful, or active (both varied and useful). Only active values need a derivative.	2020-04-05 20:35:30 -07:00
Andrew Trick	991a8dc31b	Add getArraySemanticsKind API. So that it's possible to determine the kind of Array semantic function from its SILFunction decl only.	2020-03-18 18:19:10 -07:00
Andrew Trick	badc5658bb	Fix SIL MemBehavior queries with access markers. This is in prepration for other bug fixes. Clarify the SIL utilities that return canonical address values for formal access given the address used by some memory operation: - stripAccessMarkers - getAddressAccess - getAccessedAddress These are closely related to the code in MemAccessUtils. Make sure passes use these utilities consistently so that optimizations aren't defeated by normal variations in SIL patterns. Create an isLetAddress() utility alongside these basic utilities to make sure it is used consistently with the address corresponding to formal access. When this query is used inconsistently, it defeats optimization. It can also cause correctness bugs because some optimizations assume that 'let' initialization is only performed on a unique address value. Functional changes to Memory Behavior: - An instruction with side effects now conservatively still has side effects even when the queried value is a 'let'. Let values are certainly sensitive to side effects, such as the parent object being deallocated. - Return the correct MemBehavior for begin/end_access markers.	2020-03-03 09:24:18 -08:00
Erik Eckstein	85789367a3	SILOptimizer: restructure the apply(partial_apply) peephole and the dead partial_apply elimination optimizations Changes: * Allow optimizing partial_apply capturing opened existential: we didn't do this originally because it was complicated to insert the required alloc/dealloc_stack instructions at the right places. Now we have the StackNesting utility, which makes this easier. * Support indirect-in parameters. Not super important, but why not? It's also easy to do with the StackNesting utility. * Share code between dead closure elimination and the apply(partial_apply) optimization. It's a bit of refactoring and allowed to eliminate some code which is not used anymore. * Fix an ownership problem: We inserted copies of partial_apply arguments _after_ the partial_apply (which consumes the arguments). * When replacing an apply(partial_apply) -> apply and the partial_apply becomes dead, avoid inserting copies of the arguments twice. These changes don't have any immediate effect on our current benchmarks, but will allow eliminating curry thunks for existentials.	2020-02-11 12:48:39 +01:00
swift-ci	70ebdb7629	Merge remote-tracking branch 'origin/master' into master-rebranch	2020-02-05 17:44:45 -08:00
Ravi Kandhadai	a6bed21d9e	[SIL Optimization] Make ArraySemantics.cpp aware of "array.uninitialized_intrinsic" semantics attribute that is used by the top-level array initializer (in ArrayShared.swift), which is the entry point used by the compiler to initialize array from array literals. This initializer is early-inlined so that other optimizations can work on its body. Fix DeadObjectElimination and ArrayCOWOpts optimization passes to work with this semantics attribute in addition to "array.uninitialized", which they already use. Refactor mapInitializationStores function from ArrayElementValuePropagation.cpp to ArraySemantic.cpp so that the array-initialization pattern matching functionality implemented by the function can be reused by other optimizations.	2020-02-05 14:28:34 -08:00
swift-ci	57393b28fa	Merge remote-tracking branch 'origin/master' into master-rebranch	2020-01-29 01:44:04 -08:00
Andrew Trick	1af49ecb99	Fix an EscapeAnalysis assert to handle recent changes. setPointsToEdge should assert that its target isn't already merged, but now that we batch up multiple merge requests, it's fine to allow the target to be scheduled-for-merge. Many assertions have been recently added and tightened in order to "discover" unexpected cases. There's nothing incorrect about how these cases were handled, but they lack unit tests. In this case I still haven't been able to reduce a test case. I'm continuing to work on it, but don't want to further delay the fix.	2020-01-28 22:54:08 -08:00
swift-ci	5ef7482da1	Merge remote-tracking branch 'origin/master' into master-rebranch	2020-01-24 09:43:51 -08:00

... 2 3 4 5 6 ...

522 Commits