swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Erik Eckstein	959e19d7bc	Add an optimization to eliminate a partial_apply if all applied arguments are dead in the applied function. This consists of 3 parts: 1) Extend CallerAnalysis to also provide information if a function is partially applied 2) A new DeadArgSignatureOpt pass, similar to FunctionSignatureOpts, which just specializes for dead arguments of partially applied functions. 3) Let CapturePropagation eliminate such partial_apply instructions and replace them with a thin_to_thick conversion of the specialized functions. This optimzation improves benchmarks where static struct or class functions are passed as a closure (e.g. -20% for SortStrings). Such functions have a additional metatype parameter. We used to create a partial_apply in this case, which allocates a context, etc. But this is not necessary as the metatype parameter is not used in most cases. rdar://problem/27513085	2016-08-23 07:32:41 -07:00
Xin Tong	34eadd43ee	Small refactoring in RRCM.	2016-08-09 21:44:53 -07:00
practicalswift	dbc6ec55b8	[gardening] Fix recently introduced "a" vs. "an" typos	2016-07-25 10:09:57 +02:00
Dmitri Gribenko	d82682ec6c	Merge pull request #3733 from practicalswift/typo-fixes-20160724 [gardening] Fix recently introduced typos.	2016-07-24 15:43:03 -07:00
practicalswift	7e89679404	[gardening] Fix recently introduced typos.	2016-07-24 22:32:40 +02:00
practicalswift	5c916c0e71	[gardening] "a" vs. "an"-fixes	2016-07-24 21:39:51 +02:00
Xin Tong	eaaf825032	Implement an iterative data flow to find epilogue retains or releases. We have a few places this analysis can be used. e.g. FSO, ASO, etc. I will wire them up one by one later. rdar://problem/26446587	2016-07-11 14:06:06 -07:00
Mark Lacey	f47cd8afd0	Fix leak in BasicCalleeAnalysis. Each time we delete the pass manager, we delete the analyses it holds. In this analysis we were holding onto memory that wasn't getting released when the analysis was deleted. Fixes rdar://problem/26245872.	2016-06-23 07:49:06 -07:00
practicalswift	8df3859ce7	[gardening] Fix recently introduced typos.	2016-06-05 11:11:44 +02:00
Xin Tong	35471ab345	Merge pull request #2310 from trentxintong/SFSO Simplify function signature optimzation and fix a memory leak in function signature.	2016-05-25 15:11:12 -07:00
Xin Tong	fb3eb0b646	Simplify function signature optimzation. Several functionalities have been added to FSO over time and the logic has become muddled. We were always looking at a static image of the SIL and try to reason about what kind of function signature related optimizations we can do. This can easily lead to muddled logic. e.g. we need to consider 2 different function signature optimizations together instead of independently. Split 1 single function to do all sorts of different analyses in FSO into several small transformations, each of which does a specific job. After every analysis, we produce a new function and eventually we collapse all intermediate thunks to in a single thunk. With this change, it will be easier to implement function signature optimization as now we can do them independently now. Small modifications to the test cases.	2016-05-25 11:12:27 -07:00
Xin Tong	b5b905e3cc	Bring back rc identity cache. This takes off 0.7% of the 27.7% of the time we spent in SILoptimizer when building Stdlib.	2016-05-25 09:25:27 -07:00
Mark Lacey	921dededad	Use a bump pointer allocator in the callee set creation. Shaves about 19% of the time from the construction of these sets. The SmallVector size was chosen to minimize the number of dynamic allocations we end up doing while building the stdlib. This should be a reasonable size for most projects, too. It's a bit wasteful in space, but the total amount of allocated space here is pretty small to begin with.	2016-05-11 17:07:27 -07:00
Arnold Schwaighofer	4df87a6554	Refactor unsafeGuaranteed code into utility functions. NFC.	2016-05-08 08:10:43 -07:00
practicalswift	9a078b54ef	[gardening] Fix recently introduced typo: "a executable" → "an executable" [gardening] Fix recently introduced typo: "a offset" → "an offset" [gardening] Fix recently introduced typo: "accessiblity" → "accessibility" [gardening] Fix recently introduced typo: "cant" → "can't" [gardening] Fix recently introduced typo: "inteference" → "interference" [gardening] Fix recently introduced typo: "unsatified" → "unsatisfied" [gardening] Remove accidental space.	2016-04-24 22:11:59 +02:00
Xin Tong	49f1c66d7b	Rename mayUseValue to mayHaveSymmetricInteference	2016-04-19 15:23:45 -07:00
Xin Tong	51b1c0bc68	Implement retain, release code motion. Iterative data flow retain sinking and release hoisting. This allows us to sink retains and hoist releases across harmless loops. which is an improvement on the SILCodeMotion retain sinking and release hoisting. It also separates the duty of moving retain and release with the duty of eliminating them in ASO. This should eventually replace RR code motion in SILcodemotion and insertion point in ARCsequence opts (ASO). This is the performance difference i get with retain sinking and release hoisting. After disabling retain release code motion in ASO and SILCodeMotion. we can start to take those code out once this lands. I see that we go from 24.5% of time spent in SILOptimizations w.r.t. the whole stdlib compilation to 25.1%. Improvement is better (i.e. retain sinking and hoisting releases result in performance gain). <details open> <summary>Regression (7)</summary> TEST \| OLD_MIN \| NEW_MIN \| DELTA (%) \| SPEEDUP --- \| --- \| --- \| --- \| --- SetIsSubsetOf \| 441 \| 510 \| +15.7% \| 0.86x SetIntersect \| 1041 \| 1197 \| +15.0% \| 0.87x BenchLangCallingCFunction \| 184 \| 211 \| +14.7% \| 0.87x Sim2DArray \| 326 \| 372 \| +14.1% \| 0.88x SetIsSubsetOf_OfObjects \| 498 \| 567 \| +13.9% \| 0.88x GeekbenchGEMM \| 945 \| 1022 \| +8.2% \| 0.92x COWTree \| 3839 \| 4181 \| +8.9% \| 0.92x(?) </details> <details > <summary>Improvement (31)</summary> TEST \| OLD_MIN \| NEW_MIN \| DELTA (%) \| SPEEDUP --- \| --- \| --- \| --- \| --- ObjectiveCBridgeFromNSDictionaryAnyObjectToString \| 174526 \| 165392 \| -5.2% \| 1.06x RGBHistogram \| 3128 \| 2957 \| -5.5% \| 1.06x ObjectiveCBridgeToNSDictionary \| 16510 \| 15494 \| -6.2% \| 1.07x LuhnAlgoLazy \| 2294 \| 2120 \| -7.6% \| 1.08x DictionarySwapOfObjects \| 6477 \| 5994 \| -7.5% \| 1.08x StringRemoveDupes \| 1610 \| 1485 \| -7.8% \| 1.08x ObjectiveCBridgeFromNSSetAnyObjectToString \| 159358 \| 147824 \| -7.2% \| 1.08x ObjectiveCBridgeToNSSet \| 16191 \| 14924 \| -7.8% \| 1.08x DictionaryHashableClass \| 1839 \| 1704 \| -7.3% \| 1.08x DictionaryLiteral \| 2906 \| 2678 \| -7.8% \| 1.09x(?) StringUtilsUnderscoreCase \| 10031 \| 9187 \| -8.4% \| 1.09x LuhnAlgoEager \| 2320 \| 2113 \| -8.9% \| 1.10x ObjectiveCBridgeFromNSSetAnyObjectToStringForced \| 99553 \| 90348 \| -9.2% \| 1.10x RIPEMD \| 3327 \| 3009 \| -9.6% \| 1.11x Combos \| 595 \| 538 \| -9.6% \| 1.11x Roman \| 10 \| 9 \| -10.0% \| 1.11x StringUtilsCamelCase \| 10783 \| 9646 \| -10.5% \| 1.12x SetIntersect_OfObjects \| 2511 \| 2182 \| -13.1% \| 1.15x SwiftStructuresTrie \| 28331 \| 24339 \| -14.1% \| 1.16x Dictionary2OfObjects \| 3748 \| 3115 \| -16.9% \| 1.20x DictionaryOfObjects \| 2473 \| 2050 \| -17.1% \| 1.21x Dictionary \| 894 \| 737 \| -17.6% \| 1.21x Dictionary2 \| 2268 \| 1859 \| -18.0% \| 1.22x StringIteration \| 8027 \| 6344 \| -21.0% \| 1.27x Phonebook \| 8207 \| 6436 \| -21.6% \| 1.28x BenchLangArray \| 119 \| 91 \| -23.5% \| 1.31x LinkedList \| 8267 \| 6297 \| -23.8% \| 1.31x StrToInt \| 5585 \| 4180 \| -25.2% \| 1.34x Dictionary3OfObjects \| 1122 \| 831 \| -25.9% \| 1.35x Dictionary3 \| 731 \| 515 \| -29.6% \| 1.42x SuperChars \| 513353 \| 258735 \| -49.6% \| 1.98x	2016-04-18 15:39:17 -07:00
Xin Tong	d84de12943	Revert "Change FSO explosion heuristic" This reverts commit `fa09c6b71d`. Broke Linux build. And also PR "please benchmark" does not seem to catch it.	2016-04-14 11:05:00 -07:00
Xin Tong	fa09c6b71d	Change FSO explosion heuristic If we can not find the epilogue releases for all the fields with reference sematics, but we found for some fields. Explode the argument. I do not see a performance improvement with this change rdar://25451364	2016-04-13 19:40:53 -07:00
Xin Tong	1a4f567685	More conservative about when we can move a release across an instruction We now consider effect of deinit in addition to the released value. rdar://25362826 This is the only 10%+ regression i measured on my machine. no performance improvement. Sim2DArray \| 326 \| 366 \| +12.3% \| 0.89x	2016-04-12 20:39:30 -07:00
Erik Eckstein	6cdfc2e469	EscapeAnalysis: Make the CGNode class public. It's used by clients. NFC.	2016-04-08 10:20:47 -07:00
practicalswift	abfecfde17	[gardening] if ([space]…[space]) → if (…), for(…) → for (…), while(…) → while (…), [[space]x, y[space]] → [x, y]	2016-04-04 16:22:11 +02:00
practicalswift	d00a5ef814	[gardening] Weekly gardening: typos, duplicate includes, header formatting, etc.	2016-03-24 22:41:10 +01:00
Xin Tong	9a3761000c	Move function signature analysis to a Util We really only need this signature analysis in the cloner pass now.	2016-03-24 11:17:47 -07:00
Xin Tong	9a020c8c7a	Minor refactoring in epilogue retain matcher	2016-03-23 22:16:49 -07:00
Xin Tong	b1c7bc5e4b	Reinstate "Minor refactoring in epilogue retain matcher"	2016-03-23 22:16:34 -07:00
Xin Tong	6e07c5ec60	Revert "Minor refactoring in epilogue release matcher. NFC" This reverts commit `a191ae72a7`. Broke Opt+Assert, Stdlib DebInfo+Assert.	2016-03-21 11:08:31 -07:00
Xin Tong	b2b5247ba9	Merge pull request #1756 from trentxintong/FSO Minor refactoring in epilogue release matcher	2016-03-21 07:59:46 -07:00
Xin Tong	a191ae72a7	Minor refactoring in epilogue release matcher. NFC	2016-03-20 23:13:50 -07:00
Xin Tong	e3ec0703fd	Merge pull request #1744 from trentxintong/FSO Implement a function signature cloner and rewriter.	2016-03-20 11:44:54 -07:00
practicalswift	a0d494c143	[gardening] Fix recently introduced typos: "fucntion" → "function", "functio" → "function", "mergable" → "mergeable", "mistmatched" → "mismatched"	2016-03-20 10:34:32 +01:00
Xin Tong	cff61d7fe7	Implement a function signature cloner and rewriter. This split the function signature module pass into 2 functin passes. By doing so, this allows us to rewrite to using the FSO-optimized function prior to attempting inlining, but allow us to do a substantial amount of optimization on the current function before attempting to do FSO on that function. And also helps us to move to a model which module pass is NOT used unless necesary. I do not see regression nor improvement for on the performance test suite. functionsignopts.sil and functionsignopt_sroa.sil are modified because the mangler now takes into account of information in the projection tree.	2016-03-19 23:57:37 -07:00
Xin Tong	fd353df19e	Remove some of unneeded functionality in CallerAnalysis We really only need the analysis to tell whether a function has caller inside the module or not. We do not need to know the callsites. Remove them for now to make the analysis more memory efficient. Add a note to indicate it can be extended.	2016-03-17 21:16:24 -07:00
Xin Tong	f543c336e7	Use SetVector instead of a SmallVector+DenseMap in CallerAnalysis	2016-03-17 17:31:38 -07:00
Xin Tong	991fc1a5fc	Add comments on how CallerAnalysis work	2016-03-17 14:01:21 -07:00
Xin Tong	f5511da774	Get rid of an non-determinism in CallerAnalysis RecomputeFunctionList should really be a SmallVector instead of a DenseSet. A DenseSet gives rise to a nondeterminstic way of iterating over all functions.	2016-03-17 11:28:25 -07:00
Xin Tong	eefacd198b	Make fields in CallerAnalysisFunctionInfo private and use friend class This allows us to vend only a determinstic getCallSites interface to users while internal implementation is protected.	2016-03-17 10:57:59 -07:00
Xin Tong	eab029d795	Add CallerAnalysis Printer. This provides some basic testing on CallerAnalysis before hooking it up to function signature opts.	2016-03-17 10:51:16 -07:00
Xin Tong	1603b0f153	Handle dead functions in CallerAnalysis. Add an invalidateAnalysisForDeadFunction API. This API calls the invalidateAnalysis by default unless overriden by analysis pass themselves. This API passes the extra information that this function is dead and going to be removed from the module. CallerAnalysis overrides this API and only invalidate caller/callee relations but does not push this into the recompute list. We also considered the possibility of keeping a computed list, instead of recompute list but that would introduce a O(n^2) complexity as every time we try to complete the computed list, we need to walk over all the functions that currently exist in the module to make sure the computed list is complete. I feel eventually we can do a handleDeleteNotification for function deletion and we wont need the API added in this change.	2016-03-17 09:55:12 -07:00
Xin Tong	cca9c2521a	Improve CallerAnalysis. Address the comments from `0acc0a8464` I still have not made up my mind how to handle deleted functions. CallerAnalysis is not hooked up to anything yet.	2016-03-16 17:49:34 -07:00
practicalswift	356b843c1b	[gardening] Fix recently introduced typo: "invaldiated" → "invalidated"	2016-03-16 23:15:45 +01:00
practicalswift	1147753a94	[gardening] Fix formatting of recently introduced header files	2016-03-16 22:51:11 +01:00
Xin Tong	0acc0a8464	Implement a Caller Analysis. The analysis can tell all the callsites which calls a function in the module. The analysis is computed and kept up-to-date lazily. At the core of it, it keeps a list of functions that need to be recomputed for the Caller/Callee relation to be precise and on every query, the analysis makes sure to recompute them and clear the list before any query. This is NFC right now. I am going to wire it up to function signature analysis eventually.	2016-03-16 09:33:22 -07:00
Xin Tong	5f7f05da9b	Reinstate "Moves SignatureAnalyzer and ArgumentDescriptor/ResultDescriptor into a separate analysis pass. This pass is run on every function and the optimized signature is return'ed through the getArgDescList and getResultDescList. Next step is to split to cloning and callsite rewriting into their own function passes. rdar://24730896 "	2016-03-16 07:00:57 -07:00
Xin Tong	48ed191ca4	Revert "Moves SignatureAnalyzer and ArgumentDescriptor/ResultDescriptor into a separate" This reverts commit `069612bccc`. Reverts because it Breaks compiling the stdlib (optimized, no stdlib assertions), while i try to reproduce and fix.	2016-03-15 14:17:01 -07:00
Xin Tong	069612bccc	Moves SignatureAnalyzer and ArgumentDescriptor/ResultDescriptor into a separate analysis pass. This pass is run on every function and the optimized signature is return'ed through the getArgDescList and getResultDescList. Next step is to split to cloning and callsite rewriting into their own function passes. rdar://24730896	2016-03-15 12:21:20 -07:00
Michael Gottesman	406a7c9962	[loop-region] Track the backedges of all loop regions. We already computed this information so this is just storing information we were already computing. One thing to note is that in code with canonicalized loops, we will always only have one backedge. But we would like loop region to be correct even in the case of non-canonicalized code so we support having multiple back edges. But since the common case is 1 backedge, we optimize for that case. This commit contains updated tests and also updates to the loop region graph viewer so that it draws backedges as green arrows from the loop to its backedge subregions. The test updates were done by examining each test case by hand.	2016-03-14 22:37:06 -07:00
Michael Gottesman	3fd5e80b39	[loop-region] Change LoopRegion::getParentID() to return the optional ParentID instead of attempting to use the value and asserting.	2016-03-14 22:37:06 -07:00
Arnold Schwaighofer	3676671b7f	Merge pull request #1587 from aschwaighofer/stack_promote_with_unsafe_mutable_buffer_pointer Mark Array.withUnsafeMutableBuffer as not escaping the array storage.	2016-03-08 19:39:28 -08:00
Arnold Schwaighofer	b5f018a4b1	Mark Array.withUnsafeMutableBuffer as not escaping the array storage. This is safe because the closure is not allowed to capture the array according to the documentation of 'withUnsafeMutableBuffer' and the current implementation makes sure that any such capture would observe an empty array by swapping self with an empty array. Users will get "almost guaranteed" stack promotion for small arrays by writing something like: func testStackAllocation(p: Proto) { var a = [p, p, p] a.withUnsafeMutableBufferPointer { let array = $0 work(array) } } It is "almost guaranteed" because we need to statically be able to tell the size required for the array (no unspecialized generics) and the total buffer size must not exceed 1K.	2016-03-08 19:37:47 -08:00

... 3 4 5 6 7

333 Commits