swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Slava Pestov	5aa99fa346	SILOptimizer: Create non-[fragile] specializations of [fragile] functions where possible Change the optimizer to only make specializations [fragile] if both the original callee is [fragile] and the caller is [fragile]. Otherwise, the specialized callee might be [fragile] even if it is never called from a [fragile] function, which inhibits the optimizer from devirtualizing calls inside the specialization. This opens up some missed optimization opportunities in the performance inliner and devirtualization, which currently reject fragile->non-fragile references: TEST \| OLD_MIN \| NEW_MIN \| DELTA (%) \| SPEEDUP --- \| --- \| --- \| --- \| --- DictionaryRemoveOfObjects \| 38391 \| 35859 \| -6.6% \| 1.07x Hanoi \| 5853 \| 5288 \| -9.7% \| 1.11x Phonebook \| 18287 \| 14988 \| -18.0% \| 1.22x SetExclusiveOr_OfObjects \| 20001 \| 15906 \| -20.5% \| 1.26x SetUnion_OfObjects \| 16490 \| 12370 \| -25.0% \| 1.33x Right now, passes other than performance inlining and devirtualization of class methods are not checking invariants on [fragile] functions at all, which was incorrect; as part of the work on building the standard library with -enable-resilience, I added these checks, which regressed performance with resilience disabled. This patch makes up for these regressions. Furthermore, once SIL type lowering is aware of resilience, this will allow the stack promotion pass to make further optimizations after specializing [fragile] callees.	2016-04-08 02:10:31 -07:00
swiftix	baf8e7d7cb	Merge pull request #2067 from swiftix/SR-249 Add [nonatomic] attribute to all SIL reference counting instructions. Support this attribute at SIL level, IRGen and LLVM-based ARC passes.	2016-04-06 23:56:43 -07:00
eeckstein	2de4dbb765	Merge pull request #2084 from eeckstein/remove_from_parent Remove removeFromParent	2016-04-06 16:43:11 -07:00
Jordan Rose	52b961de61	Revert "Fix post-dominator tree in stack promotion" This broke the test suite under optimizations with a SIL verifier error: "stack dealloc does not match most recent stack alloc". This reverts commit `7a2ca23bc2`, reversing changes made to `4c55e8d7a7`.	2016-04-06 16:02:20 -07:00
Erik Eckstein	0f1a89d5dc	SpeculativeDevirtualizer: erase a dead block and not just remove it from the function. And add a few asserts to make the code clearer.	2016-04-06 14:55:47 -07:00
Erik Eckstein	3d050f7b43	StackPromotion: Ignore unreachable blocks in post-dominator tree. Unreachable blocks prevented stack promotion in some cases. Now we use our own post-dominator tree which ignores unreachable blocks instead of the standard post-dominator tree provided by the PostDominanceAnalysis. Unreachable blocks (better: unreachable sub-graphs) are of no interrest because we don't have to insert the dealloc instructions in unreachable blocks anyway.	2016-04-06 11:11:13 -07:00
Roman Levenstein	2e77b3990b	Add [nonatomic] attribute to all SIL reference counting instructions.	2016-04-06 01:52:43 -07:00
Andrew Trick	6ffafe4dd3	NFC: Remove a dead call in the DeadObjectElimination pass. Cleanup after a pasto. Thanks Roman.	2016-04-05 16:28:27 -07:00
practicalswift	798877ae77	[gardening] "if (foo)[SPACE][SPACE]{" → "if (foo)[SPACE]{"	2016-04-03 22:57:05 +02:00
Chris Lattner	4c57791516	Merge pull request #2025 from wxxsw/pull [gardening] Put white spaces in between if/while clauses and braces where it is missing.	2016-04-02 10:46:18 -07:00
Ge Sen	5ad36b2962	[gardening] Put white spaces in between if/while clauses and braces where it is missing. For instance: 'if (foo){' => 'if (foo) {'	2016-04-02 14:43:45 +08:00
Michael Gottesman	7361e35bb9	Revert "Putting white spaces in between if/while clauses and braces."	2016-04-01 22:00:25 -07:00
Ge Sen	7dd61bdfa9	[gardening] Put white spaces in between if/while clauses and braces where it is missing. For instance: 'if (foo){' => 'if (foo) {'	2016-04-02 08:22:23 +08:00
practicalswift	fa1d5d231a	[gardening] Fix recently introduced typo: "althouth" → "although"	2016-04-01 23:14:16 +02:00
eeckstein	045ee83705	Merge pull request #1992 from eeckstein/onfastpath introduce the onFastPath builtin	2016-03-31 17:41:26 -07:00
Arnold Schwaighofer	308ce091b7	Actually return the result from the _withUnsafeGuaranteed closure call	2016-03-31 16:43:46 -07:00
Erik Eckstein	a47a62d644	A new onFastPath built-in. It is a hint to the optimizer that the code, where this builtin is called, is on the fast path. Specifically, the inliner takes it into account and increases the assumed benefit for code where the builtin is located. Compared to the fastPath/slowPath builtins, this builtin can be placed into plain linear code and doesn't need to be used in conditions. Compared to the @inline(__always) attribute, this builtin has also an effect on the caller function. Let's assume foo() calls bar() contains onFastPath and both foo and bar are small functions. Then if bar gets inlined into foo, the builtin also increases the chances that foo gets inlined. This would not be the case if @inline(__always) is used just for bar.	2016-03-31 12:53:44 -07:00
Erik Eckstein	fd3f343dab	SIL: add a utility function to check if a terminator exits a function. NFC	2016-03-31 09:29:15 -07:00
practicalswift	109cf92d17	[gardening] Fix recently introduced typo: "extry" → "entry"	2016-03-31 13:42:47 +02:00
Mark Lacey	84473f242a	Do not specialize dead apply/partial_apply. Do not specialize an apply/partial_apply that we've already added to the set of dead instructions. Doing so can result in creating a new instruction which we will leave around, and which will have a type mismatch in its parameter list. Fixes rdar://problem/25447450.	2016-03-30 21:16:00 -07:00
Mark Lacey	99d4485713	Fix double delete in generic specialization. We ended up adding the same instruction twice to a SmallVector of instructions to be deleted. To avoid this, we'll track these to-be-deleted instructions in a SmallSetVector instead. We were also failing to add an instruction that we can delete to the set of instructions to be deleted, so I fixed that as well. I've added a test case, but it's currently disabled because fixing this turned up another issue in the same code which I still need to take a look at. Fixes rdar://problem/25369617.	2016-03-30 13:10:00 -07:00
Chris Lattner	5b7e030810	Merge pull request #1952 from practicalswift/gardening-20160330 [gardening] Remove unused code, fix typos and improve file header formatting	2016-03-30 10:24:05 -07:00
Xin Tong	4183d972da	Merge pull request #1946 from trentxintong/FS Change FSO heuristic.	2016-03-30 10:11:27 -07:00
practicalswift	74987b2164	[gardening] Remove unused code.	2016-03-30 18:20:49 +02:00
Xin Tong	f95d9b3c92	Change FSO heuristic. FSO functions that have high potential but does not have caller inside current module. The thunk can then be inlined into the module calling the function and the function would get the benefit of FSO. The heuristic for selecting such function is 1. Have no indirect caller. This would introduce a thunk. 2. Have potential to give better performance. i.e. function argument can be O2G. Regression TEST \| OLD_MIN \| NEW_MIN \| DELTA (%) \| SPEEDUP --- \| --- \| --- \| --- \| --- BenchLangCallingCFunction \| 184 \| 211 \| +14.7% \| 0.87x Calculator \| 55 \| 59 \| +7.3% \| 0.93x DeadArray \| 687 \| 741 \| +7.9% \| 0.93x MonteCarloPi \| 39275 \| 41669 \| +6.1% \| 0.94x Improvement TEST \| OLD_MIN \| NEW_MIN \| DELTA (%) \| SPEEDUP --- \| --- \| --- \| --- \| --- LuhnAlgoLazy \| 2478 \| 2327 \| -6.1% \| 1.06x OpenClose \| 54 \| 51 \| -5.6% \| 1.06x SortLettersInPlace \| 1016 \| 946 \| -6.9% \| 1.07x ObjectiveCBridgeFromNSDictionaryAnyObjectToStringForced \| 149993 \| 139755 \| -6.8% \| 1.07x Phonebook \| 9666 \| 8992 \| -7.0% \| 1.07x ObjectiveCBridgeFromNSDictionaryAnyObjectToString \| 222713 \| 206538 \| -7.3% \| 1.08x LuhnAlgoEager \| 2393 \| 2226 \| -7.0% \| 1.08x Dictionary \| 1307 \| 1196 \| -8.5% \| 1.09x JSONHelperDeserialize \| 3808 \| 3492 \| -8.3% \| 1.09x StdlibSort \| 7310 \| 4084 \| -44.1% \| 1.79x I see 0.15% increase in code size for Benchmark_O. Thanks @gottesmm for suggesting this opportunity. rdar://25345056	2016-03-29 23:04:36 -07:00
Erik Eckstein	d581cbdd9d	Re-instate "PerformanceInliner: Improve the inlining heuristic to reduce code size." ...with a fix in the shortest-path-analysis This reinstates commit `4bd1216702`	2016-03-29 16:33:47 -07:00
Erik Eckstein	be8faad792	Revert my recent inliner commits. There are some failures in the tests.	2016-03-28 11:11:25 -07:00
practicalswift	5004c21c6a	Merge pull request #1877 from practicalswift/cleanups-20160325 [gardening] Daily cleanup: typos, header formatting, pep8 fixes.	2016-03-28 09:46:55 +02:00
practicalswift	11a8b6c2ba	[gardening] Daily cleanup: typos, header formatting.	2016-03-28 09:29:38 +02:00
Erik Eckstein	4bd1216702	PerformanceInliner: Improve the inlining heuristic to reduce code size. It now detects more opportunities for inlining, like some patters with RC instructions or loads/stores from/to stack locations in the caller. On the other hand a new shortest path analysis limits inlining to those cases where it really gives a benefit. As the inlining decision now depends on many parameters, the test-threshhold option is removed because it doe not make much sense anymore. Instead the inliner test files are modified to model the "real" instruction costs.	2016-03-27 10:59:29 -07:00
Arnold Schwaighofer	255779082e	Add a peephole optimization for the builtin "unsafeGuaranteed" We can remove the retain/release pair preceeding the builtins based on the knowledge that the lifetime of the reference is guaranteed by someone hanging on to the reference elsewhere.	2016-03-27 06:47:16 -07:00
Xin Tong	f557a3253d	Merge pull request #1857 from trentxintong/FSO Rename FunctionSignatureOptCloner to FunctionSignatureOpts	2016-03-24 15:57:34 -07:00
Xin Tong	5907b8a3e2	Rename FunctionSignatureOptCloner to FunctionSignatureOpts Eventually, we decided to do this 1. Have the function signature opts (used to be called the cloner to create the optimized function. 2. Mark the thunk as always_inline 3. Rely on the inliner to inline the thunk to get the benefit of calling optimized function directly.	2016-03-24 12:50:12 -07:00
Xin Tong	e0ba695d17	Merge pull request #1852 from trentxintong/FSO Remove function signature rewriter and make function signature analysis a Util	2016-03-24 12:42:05 -07:00
Xin Tong	9a3761000c	Move function signature analysis to a Util We really only need this signature analysis in the cloner pass now.	2016-03-24 11:17:47 -07:00
Xin Tong	3f075dfe47	Remove function signature rewriter. We decided to use the inliner to rewrite the caller's callsites. And eventually I will turn FunctionSignatureAnalysis into a Utility. As its data should only be used and kept in the cloner pass.	2016-03-24 10:50:47 -07:00
Xin Tong	c44006aa9d	Merge pull request #1824 from trentxintong/RLE Make sure non-epilogue releases do not kill redundant loads	2016-03-24 10:48:21 -07:00
Xin Tong	2a63907a17	Make FSO thunks always_inline. This forces the callsites to be rewritten by the inliner. we have the issue that the thunk changes from the time the its created to the time its reread to figure out what we have done to the original function This results in missed opportunities. This solution solves the problem gracefully, because the thunk carries the information on how to set up the call to the optimized functions. Inlining the thunk makes the callsite calling the optimized function for free. i.e. without any rewriting. I did not measure any regression with this change.	2016-03-24 09:18:13 -07:00
Xin Tong	524ed34583	Make sure epilogue releases do not kill redundant loads I did not measure a performance improvements with this.	2016-03-23 23:59:54 -07:00
practicalswift	04ef7851eb	[gardening] Fix recently introduced typo: "cant" → "can't"	2016-03-22 21:35:39 +01:00
Xin Tong	10b108a8f3	Move PerformanceInliner.cpp to SILOptimzer/Transforms as it is a function pass	2016-03-22 11:18:59 -07:00
Xin Tong	cff61d7fe7	Implement a function signature cloner and rewriter. This split the function signature module pass into 2 functin passes. By doing so, this allows us to rewrite to using the FSO-optimized function prior to attempting inlining, but allow us to do a substantial amount of optimization on the current function before attempting to do FSO on that function. And also helps us to move to a model which module pass is NOT used unless necesary. I do not see regression nor improvement for on the performance test suite. functionsignopts.sil and functionsignopt_sroa.sil are modified because the mangler now takes into account of information in the projection tree.	2016-03-19 23:57:37 -07:00
Erik Eckstein	bf87de3bc3	Fix a memory leak caused by the ReleaseDevirtualizer. This occured if a stack-promoted object with a devirtualized final release is not actually allocated on the stack. Now the ReleaseDevirtualizer models the procedure of a final release more accurately. It inserts a set_deallocating instruction and calles the deallocator (instead of just the deinit). This changes also includes two peephole optimizations in IRGen and LLVMStackPromotion which get rid of unused runtime calls in case the stack promoted object is really allocated on the stack. This fixes rdar://problem/25068118	2016-03-15 12:56:54 -07:00
Mark Lacey	a7eb7b10d9	Fix typo in comment.	2016-03-14 16:18:00 -07:00
Ben Langmuir	cf4498e6cf	Mark a couple of local sil-optimizer functions as static NFC	2016-03-11 13:34:23 -08:00
Joe Groff	77dd9b2992	Split exact-subclass and bindable-to-subclass queries. In many places, we're interested in whether a type with archetypes might be a superclass of another type with the right bindings, particularly in the optimizer. Provide a separate Type::isBindableToSuperclassOf method that performs this check. Use it in the devirtualizer to fix rdar://problem/24993618. Using it might unblock other places where the optimizer is conservative, but we can fix those separately.	2016-03-09 11:14:45 -08:00
Michael Gottesman	bb15808554	Convert some trivial std::count_if invocations on ranges to use the provided range adaptor.	2016-03-08 14:58:13 -08:00
Michael Gottesman	5f72810ad3	Add a range adaptor for std::count and update various trivial usages in the compiler to use this API instead.	2016-03-08 14:58:13 -08:00
Mark Lacey	4f2f993c68	Fix use-after-free in SimplifyCFG's block argument splitter. We were creating new uses of an argument just prior to erasing it from the block argument list. We need to replace references to that value in the side structure we generate with references to the new value that we're replacing it with. Fixes SR-884 / rdar://problem/25008398.	2016-03-07 23:22:34 -08:00
Xin Tong	64e2710102	Move LSBase.x to SILOptimizer/Utils/. NFC.	2016-03-07 22:07:13 -05:00

... 46 47 48 49 50 ...

2601 Commits