swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Erik Eckstein	506a14b9f0	COWArrayOpts: make the optimization work again for two-dimensional arrays. With removing of pinning and with addressors, the pattern matching did not work anymore. The good thing is that the SIL is now much simpler and we can handle the 2D case without pattern matching at all. This removes a lot of code from COWArrayOpts. rdar://problem/43863081	2018-10-05 08:26:14 -07:00
Erik Eckstein	b2a5fd69be	COWArrayOpts: remove the -view-cfg-before-cow-for option for debugging This option is very old. The same effect can now be achieved with pass-manager options, like -sil-print-before	2018-10-05 08:26:14 -07:00
Michael Gottesman	d57a88af0d	[gardening] Rename references to SILPHIArgument => SILPhiArgument.	2018-09-25 22:23:34 -07:00
Erik Eckstein	87cf7eff03	SIL optimizer: fix a compiler non-determinism in LICM. SR-8844 rdar://problem/44762620	2018-09-25 12:58:42 -07:00
Mike Ash	798edb9d0e	[Runtime][Stdlib][Overlays] Rename various Objective-C classes and methods that would conflict when loading old Swift libraries into a process alongside ABI-stable libraries. rdar://problem/35768222	2018-09-13 16:55:10 -04:00
Erik Eckstein	584ed9710f	SILOptimizer: update bounds check and uniqueness check hoisting optimizations for using _modify in Array subscript. The optimizations now handle the ref_tail_addr instructions for detecting element addresses (in addition to the array semantics function _getElementAddress). After _modify for Array subscript lands, we can get rid of _getElementAddress at all.	2018-09-06 16:58:06 -07:00
Joe Shajrawi	a9df5863d0	Merge pull request #19119 from shajrawi/mayrelease [Exclusivity] Handle mayRelease instructions conservatively in AccessnforcementOpts and LICM	2018-09-04 17:41:17 -07:00
Joe Shajrawi	c6a4e2cdd2	[Exclusivity] Handle mayRelease instructions conservatively in AccessEnforcementOpts and LICM	2018-09-04 13:23:22 -07:00
Andrew Trick	1e88e44ce8	Add critical edge verification and fix SIL passes. SIL passes were violating the existing invariant on non-cond-br critical edges in several places. I fixed the places that I could find. Wherever there was a post-pass to "clean up" critical edges, I replaced it with a a call to verification that the critical edges aren't broken in the first place. We still need to eliminate critical edges entirely before enabling ownership SIL.	2018-08-30 13:01:39 -07:00
Andrew Trick	9d2af79975	Simplify SILPHIArgument::getIncomingValue. The client of this interface naturally expects to get back the incoming phi value. Ignoring dominance and SIL ownership, the incoming phi value and the block argument should be substitutable. This method was actually returning the incoming operand for checked_cast and switch_enum terminators, which is deeply misleading and has been the source of bugs. If the client wants to peek though casts, and enums, it should do so explicitly. getSingleTerminatorOperand[s]() will do just that.	2018-08-30 13:01:39 -07:00
Joe Shajrawi	95344f6591	[LICM/Exclusivity] Hoist (some) conflicting begin_accesses out of loops Consider the attached test cases: We have a begin_access [dynamic] to a global inside of a loop There’s a nested conflict on said access due to an apply() instruction between the begin and end accesses. LICM is currently very conservative: If there are any function calls inside of the loop that conflict with begin and end access, we do not hoist out of the loop. However, if all conflicting applies are “sandwiched” between the begin and end access. So there’s no reason we can’t hoist out of the loop. See radar rdar://problem/43660965 - this improves some internal benchmarks by over 3X	2018-08-28 11:51:16 -07:00
Joe Shajrawi	8e935f6046	Fix a warning in LICM in release builds	2018-08-27 10:11:23 -07:00
Joe Shajrawi	5e2f3d8448	[LICM]: support hosting ref_element_addr even if they are not guaranteed to be executed In some instances, some instructions, like ref_element_addr, can be hoisted outside of loops even if they are not guaranteed to be executed. We currently don’t support that / bail. We only try to do so / do further analysis only for begin_access because they are extremely heavy. However, we need to support hosting of ref_element_addr in that case, if it does not have a loop dependent operand, in order to be able to hoist begin_access instructions in some benchmarks. Initial local testing shows that this PR, when we enable exclusivity, improves the performance of a certain internal benchmark by over 40% See rdar://problem/43623829	2018-08-23 14:13:37 -07:00
Joe Shajrawi	7281a76deb	[AccessEnforcementOpts] Add mergeAccesses optimization	2018-08-09 16:15:25 -07:00
Bob Wilson	8e330ee344	NFC: Fix indentation around the newly renamed LLVM_DEBUG macro. Jordan used a sed command to rename DEBUG to LLVM_DEBUG. That caused some lines to wrap and messed up indentiation for multi-line arguments.	2018-07-21 00:56:18 -07:00
Jordan Rose	cefb0b62ba	Replace old DEBUG macro with new LLVM_DEBUG ...using a sed command provided by Vedant: $ find . -name \*.cpp -print -exec sed -i "" -E "s/ DEBUG\(/ LLVM_DEBUG(/g" {} \;	2018-07-20 14:37:26 -07:00
swift-ci	9b2d9606f6	Merge pull request #17614 from shajrawi/licm_asan	2018-06-28 17:02:08 -07:00
Joe Shajrawi	3d411d22e0	[LICM] Fix an ASAN use-after-free bug in rewrite	2018-06-28 15:46:05 -07:00
Joe Shajrawi	437b0d8e13	[LICM] Array Semantics: only hoist kGetCount and kGetCapacity We can’t hoist everything that is hoist-able The canHoist method does not do all the required analysis Some of the work is done at COW Array Opt TODO: Refactor COW Array Opt + canHoist - radar 41601468	2018-06-28 14:42:20 -07:00
swift-ci	b34c087cf4	Merge pull request #17194 from shajrawi/licm2	2018-06-27 22:59:33 -07:00
Joe Shajrawi	c3ddaf92cb	[LICM] Further refactoring: Simplify hosting of begin access instructions - get rid of HoistPairSet and hoistAndSinkInstructionPair	2018-06-27 20:54:24 -07:00
Michael Gottesman	9e70b855e4	[cmake] Use a helper macro to simplify adding sources to the SILOptimizer library. All this does is automate the creation of the ${DIRNAME}_SOURCES variables that we already create and allows for the author to avoid having to prefix with the directory name, i.e.: set(FOOBAR_SOURCES FooBar/Source.cpp PARENT_SCOPE) => silopt_register_sources( Source.cpp) Much easier and cleaner to read. I put the code that implements this in the CMakeLists.txt file just for the SILOptimizer.	2018-06-27 17:48:59 -07:00
Joe Shajrawi	f56b5d8730	[LICM] Add support for Hosting <Instruction, Instruction Set> Pairs Support having the target of each hoist instruction as multiple sinks.	2018-06-27 16:25:35 -07:00
Joe Shajrawi	bc59eaad70	[LICM] Refactoring + Improvements + Exclusivity Support Major refactoring + tuning of LICM. Includes: Support for hosting more array semantic calls Remove restrictions for sinking instructions Add support for hoisting and sinking instruction pairs (begin and end accesses) Testing with Exclusivity enabled on a couple of benchmarks shows: ReversedArray 7x improvement StringWalk 2.6x improvement	2018-06-26 13:26:37 -07:00
Joe Shajrawi	19a6bb5bdb	[LICM] Code Hygiene - rip out sinkCondFail Removing this optimization from SIL: It is not worth the extra code complexity and compilation time. More in-depth explanation for the reasoning behind my decision: 1) What is being done there is obviously not LICM (more below) - even if it is useful it should be its own separate optimization 2) The regression that caused us to add this code is no longer there in most cases - 10% in only one specific corner-case 3) Even if the regression was still there, this is an extremely specific code pattern that we are pattern-matching against. Said pattern would be hard to find in any real code. There is a small code snippet in rdar://17451529 that caused us to add this optimization. Looking at it now we see that the only difference is in loop1 example - The only difference in SIL level is in loop 1: %295 = tuple_extract %294 : $(Builtin.Int64, Builtin.Int1), 0 %296 = tuple_extract %294 : $(Builtin.Int64, Builtin.Int1), 1 cond_fail %296 : $Builtin.Int1 %298 = struct $Int (%295 : $Builtin.Int64) store %298 to %6 : $*Int %300 = builtin "cmp_eq_Int64"(%292 : $Builtin.Int64, %16 : $Builtin.Int64) : $Builtin.Int1 cond_br %300, bb1, bb12 The cond_fail instruction in said loop is moved below the store instruction / above the builtin. Looking at the resulting IR. And how LLVM optimizes it. It is almost the same. If we look at the assembly code being executed then, before removing this optimization, we have: LBB0_11: testq %rcx, %rcx je LBB0_2 decq %rcx incq %rax movq %rax, _$S4main4sum1Sivp(%rip) jno LBB0_11 After removing it we have: LBB0_11: incq %rax testq %rcx, %rcx je LBB0_2 decq %rcx movq %rax, %rdx incq %rdx jno LBB0_11 There is no extra load/movq which was mentioned the radar.	2018-06-14 11:10:11 -07:00
David Zarzycki	8c0c55539f	[SIL] NFC: Rename misleading getSwiftRValueType() to getASTType() Reference storage types are not RValues. Also, use more SILType helper methods to avoid line wrap.	2018-05-04 08:14:38 -04:00
Andrew Trick	cdcb7c7a2c	[NFC] SideEffectAnalysis refactoring and cleanup. Make this a generic analysis so that it can be used to analyze any kind of function effect. FunctionSideEffect becomes a trivial specialization of the analysis. The immediate need for this is to introduce an new AccessedStorageAnalysis, although I foresee it as a generally very useful utility. This way, new kinds of function effects can be computed without adding any complexity or compile time to FunctionSideEffects. We have the flexibility of computing different kinds of function effects at different points in the pipeline. In the case of AccessedStorageAnalysis, it will compute both FunctionSideEffects and FunctionAccessedStorage in the same pass by implementing a simple wrapper on top of FunctionEffects. This cleanup reflects my feeling that nested classes make the code extremely unreadable unless they are very small and either private or only used directly via its parent class. It's easier to see how these classes compose with a flat type system. In addition to enabling new kinds of function effects analyses, I think this makes the implementation of side effect analysis easier to understand by separating concerns.	2018-04-16 17:05:04 -07:00
Davide Italiano	b4d563802b	[SILInstruction] Introduce isDebugInstruction(). This is a property of an instruction and should be a member function of `SILInstruction` and not a free function in `DebugUtils`. Discussed with Adrian.	2018-04-11 10:14:21 -07:00
Slava Pestov	e1f50b2d36	SE-0193: Rename @_inlineable to @inlinable, @_versioned to @usableFromInline	2018-03-30 21:55:30 -07:00
Alex Blewitt	b13c8dede3	[SR-7043] Remove duplicate if statement	2018-02-20 19:25:10 +00:00
Erik Eckstein	db69b8d433	SideEffectAnalysis: don't assume the worst side-effects for a release instruction Instead let the client decide what to do with this. Sometimes the client knows what side effect a release instruction really has.	2018-01-19 11:32:35 -08:00
eeckstein	b126b62256	Revert "Optimization changes to completely fold OptionSet literals"	2018-01-18 22:05:07 -08:00
Erik Eckstein	9907ffc09d	SideEffectAnalysis: don't assume the worst side-effects for a release instruction Instead let the client decide what to do with this. Sometimes the client knows what side effect a release instruction really has.	2018-01-18 18:27:17 -08:00
Chris Lattner	de289752fe	Implement a few silcombine transformations for arrays (#13652 ) * Implement a few silcombine transformations for arrays - Useless existential_ref <-> class conversions. - mark_dependence_inst depending on uninteresting instructions. - release(init_existential_ref(x)) -> release(x) when hasOneUse(x) - Update COWArrayOpt to handle the new forms generated by this. these aren't massive performance wins, but do shrink the size of SIL when dealing with arrays. * Generalize testcase to work on linux and on mac when checking stdlib is enabled.	2017-12-30 22:30:37 -08:00
Erik Eckstein	90c21be191	Unify the implementation of optimization mode in various option classes. This commit is mostly refactoring. ) Introduce a new OptimizationMode enum and use that in SILOptions and IRGenOptions ) Allow the optimization mode also be specified for specific SILFunctions. This is not used in this commit yet and thus still a NFC. Also, fixes a minor bug: we didn’t run mandatory IRGen passes for functions with @_semantics("optimize.sil.never")	2017-11-14 11:25:02 -08:00
Raj Barik	c0b7cf0ff5	Expose UnrollThreshold and Inline_Benefit_reduction in the command line for automatic code size tuning	2017-10-27 10:30:23 -07:00
Slava Pestov	7bf3b90b62	SIL: Split off objc_method / objc_super_method from class_method / super_method This replaces the '[volatile]' flag. Now, class_method and super_method are only used for vtable dispatch. The witness_method instruction is still overloaded for use with both ObjC protocol requirements and Swift protocol requirements; the next step is to make it only mean the latter, also using objc_method for ObjC protocol calls.	2017-10-03 22:13:31 -07:00
Arnold Schwaighofer	71a7252424	Fix dominator tree update in array specialization We need to walk the dominator tree starting at the header and update the dominator of all nodes outside the loop we hit i.e the dominator tree nodes that are immediately dominated 'by the loop' instead of only updating dominated exit blocks. rdar://34523864	2017-09-26 08:33:58 -07:00
John McCall	ab3f77baf2	Make SILInstruction no longer a subclass of ValueBase and introduce a common superclass, SILNode. This is in preparation for allowing instructions to have multiple results. It is also a somewhat more elegant representation for instructions that have zero results. Instructions that are known to have exactly one result inherit from a class, SingleValueInstruction, that subclasses both ValueBase and SILInstruction. Some care must be taken when working with SILNode pointers and testing for equality; please see the comment on SILNode for more information. A number of SIL passes needed to be updated in order to handle this new distinction between SIL values and SIL instructions. Note that the SIL parser is now stricter about not trying to assign a result value from an instruction (like 'return' or 'strong_retain') that does not produce any.	2017-09-25 02:06:26 -04:00
Roman Levenstein	2e5b820063	[sil-loop-unroll] Do not unroll loops if their bodies contain function calls of big functions It is more profitable in most cases to inline the big callee rather than unroll the loop, because unrolling it would create a lot of calls which cannot be further optimized due to increased size of the function containing a loop.	2017-08-22 12:45:22 -07:00
Arnold Schwaighofer	9d1ae9aa88	Don't perform array property specialization at Osize.	2017-08-17 14:57:22 -07:00
practicalswift	492f5cd35a	[gardening] Remove redundant repetition of type names (DRY): RepeatedTypeName foo = dyn_cast<RepeatedTypeName>(bar) Replace `NameOfType foo = dyn_cast<NameOfType>(bar)` with DRY version `auto foo = dyn_cast<NameOfType>(bar)`. The DRY auto version is by far the dominant form already used in the repo, so this PR merely brings the exceptional cases (redundant repetition form) in line with the dominant form (auto form). See the [C++ Core Guidelines](https://github.com/isocpp/CppCoreGuidelines/blob/master/CppCoreGuidelines.md#es11-use-auto-to-avoid-redundant-repetition-of-type-names) for a general discussion on why to use `auto` to avoid redundant repetition of type names.	2017-05-05 09:45:53 +02:00
Erik Eckstein	9ac13ae606	stdlib, optimizer: add Array. reserveCapacityForAppend as a new array semantics operation. This function reserves capacity in an Array for new elements which are about to be appended.	2017-04-27 09:06:55 -07:00
Arnold Schwaighofer	cafe58251e	LoopUnroller: Also handle > exit condition	2017-04-22 20:04:13 -07:00
Arnold Schwaighofer	533044d4c4	LoopUnroller: Teach the loop unroller about >= terminated loops	2017-04-22 17:32:37 -07:00
practicalswift	5e255e07d7	[gardening] Remove redundant logic	2017-04-11 23:04:55 +02:00
Andrew Trick	be1881aa1f	Remove redundant Transform.getName() definitions. At some point, pass definitions were heavily macro-ized. Pass descriptive names were added in two places. This is not only redundant but a source of confusion. You could waste a lot of time grepping for the wrong string. I removed all the getName() overrides which, at around 90 passes, was a fairly significant amount of code bloat. Any pass that we want to be able to invoke by name from a tool (sil-opt) or pipeline plan should have unique type name, enum value, commend-line string, and name string. I removed a comment about the various inliner passes that contradicted that. Side note: We should be consistent with the policy that a pass is identified by its type. We have a couple passes, LICM and CSE, which currently violate that convention.	2017-04-09 15:20:28 -07:00
Erik Eckstein	9ce3df106e	Add two new array semantics functions. array.append_element(newElement: Element) array.append_contentsOf(contentsOf newElements: S) And allow early inlining of them. Those functions will be needed to optimize Array.append(contentsOf)	2017-03-31 15:31:57 -07:00
John McCall	3c5de5fa0a	Preserve type canonicality better in several places and idiomatize some uses of SILType::getSwiftRValueType().	2017-03-14 14:59:43 -04:00
Andrew Trick	1abeddcc5d	[SILType] SILFunctionConventions API. Separate formal lowered types from SIL types. The SIL type of an argument will depend on the SIL module's conventions. The module conventions are determined by the SIL stage and LangOpts. Almost NFC, but specialized manglings are broken incidentally as a result of fixes to the way passes handle book-keeping of aruments. The mangler is fixed in the subsequent commit. Otherwise, NFC is intended, but quite possible do to rewriting the logic in many places.	2017-01-26 15:35:48 -08:00

... 3 4 5 6 7

350 Commits