swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Meghana Gupta	0044e7dcac	[ownership] Move OME past SILMem2Reg	2020-10-21 12:04:10 -07:00
Andrew Trick	b2d1ac1631	Add AccessPathVerification pass and run it in the pipeline.	2020-10-16 15:00:10 -07:00
Arnold Schwaighofer	b994bf3191	Add support for `_specialize(exported: true, ...)` This attribute allows to define a pre-specialized entry point of a generic function in a library. The following definition provides a pre-specialized entry point for `genericFunc(_:)` for the parameter type `Int` that clients of the library can call. ``` @_specialize(exported: true, where T == Int) public func genericFunc<T>(_ t: T) { ... } ``` Pre-specializations of internal `@inlinable` functions are allowed. ``` @usableFromInline internal struct GenericThing<T> { @_specialize(exported: true, where T == Int) @inlinable internal func genericMethod(_ t: T) { } } ``` There is syntax to pre-specialize a method from a different module. ``` import ModuleDefiningGenericFunc @_specialize(exported: true, target: genericFunc(_:), where T == Double) func prespecialize_genericFunc(_ t: T) { fatalError("dont call") } ``` Specially marked extensions allow for pre-specialization of internal methods accross module boundries (respecting `@inlinable` and `@usableFromInline`). ``` import ModuleDefiningGenericThing public struct Something {} @_specializeExtension extension GenericThing { @_specialize(exported: true, target: genericMethod(_:), where T == Something) func prespecialize_genericMethod(_ t: T) { fatalError("dont call") } } ``` rdar://64993425	2020-10-12 09:19:29 -07:00
Erik Eckstein	68f485424c	SILOptimizer: add an additional TempRValueOpt pass later in the pipeline. This can compensate the performance regression of the more conservative handling of function calls in TempRValueOpt (see previous commit). The pass runs after the inlining passes and can therefore optimize in some cases where it's not possible before inlining.	2020-10-09 20:54:59 +02:00
Meghana Gupta	f4bbafb392	Revert "[PassManager] Update PassManager's function worklist for newly added SILFunctions"	2020-10-05 10:23:31 -07:00
Michael Gottesman	4c8d09feb3	[ownership] Move ownership lowering past SROA. I already updated SROA for this and we already have tests/etc. We have just been waiting on some other passes to be moved afterwards.	2020-09-30 16:08:44 -05:00
Michael Gottesman	c3bc8e8ef9	[ownership] Move ownership elimination on the stdlib passed lower aggregate instrs.	2020-09-30 11:43:34 -05:00
Michael Gottesman	d1f43032fc	[ownership] Move ownership passed TempLValueOpt for the stdlib and add an ossa test case.	2020-09-29 16:36:12 -05:00
Meghana Gupta	163d47ec90	Revert "Revert #33106 and #33205 " (#34106 )	2020-09-28 23:08:14 -07:00
Meghana Gupta	77a76a8422	Revert "Merge pull request #33205 from meg-gupta/ometofunctionpass" This reverts commit `8dbac48c18`, reversing changes made to `c22ba90700`.	2020-09-25 11:49:52 -07:00
Meghana Gupta	49d93c58a7	Revert "[ownership] Move OME after CopyForwarding (#33106 )" This reverts commit `ef972eb34d`.	2020-09-25 11:49:07 -07:00
Meghana Gupta	ef972eb34d	[ownership] Move OME after CopyForwarding (#33106 ) * Move OME after CopyForwarding * Minor fix in CopyForwarding test	2020-09-24 20:59:28 -07:00
Meghana Gupta	9c9a8ef224	Allow OME to run mandatorily	2020-09-22 18:02:04 -07:00
Michael Gottesman	4cbc07c6c6	[ownership] Add a frontend option to stop optimizing right before we lower ownership. Specifically the option: -sil-stop-optzns-before-lowering-ownership. This makes it possible to write end-to-end tests on OSSA passes. Before one would have to pattern match after ownership was lowered, losing the ability to do finegrained FileCheck pattern matching on ossa itself.	2020-09-17 18:02:33 -05:00
Erik Eckstein	4d03eb4f0f	SILOptimizer: Move the StringOptimization a bit earlier in the pipeline. Needed to make sure that global initializers are not optimized in mid-level SIL while other functions are still in high-level SIL. Having the StringOptimization not in high-level SIL was just a mistake in my earlier PR.	2020-08-03 12:01:29 +02:00
Erik Eckstein	2a035432e7	SILOptimizer: make a separate SROA pass for high-level SIL, which doesn't split String types. The StringOptimization relies on seeing String values a a whole and not being split.	2020-08-03 12:01:29 +02:00
Hamish Knight	9f47a284be	Merge pull request #33159 from hamishknight/back-to-the-future Introduce LoweredSILRequest	2020-07-28 21:30:42 -07:00
Hamish Knight	a99f8e9d9c	Introduce LoweredSILRequest Add a request that produces lowered SIL for a file or module and use it in IRGenRequest if no SILModule is provided.	2020-07-28 10:37:37 -07:00
Erik Eckstein	7f684b62e2	SIL optimizer: Add a new string optimization. Optimizes String operations with constant operands. Specifically: * Replaces x.append(y) with x = y if x is empty. * Removes x.append("") * Replaces x.append(y) with x = x + y if x and y are constant strings. * Replaces _typeName(T.self) with a constant string if T is statically known. With this optimization it's possible to constant fold string interpolations, like "the \(Int.self) type" -> "the Int type" This new pass runs on high-level SIL, where semantic calls are still in place. rdar://problem/65642843	2020-07-27 21:32:56 +02:00
Joe Groff	b4a0ceac71	Add PruneVTables to the performance optimizer passes.	2020-07-23 20:40:49 -07:00
Michael Gottesman	76c7c3e579	[opt-remark] Add support for emitting opt-remark-generator remarks when compiling with optimization. In order to test this, I implemented a small source loc inference routine for instructions without valid SILLocations. This is an optional nob that the opt-remark writer can optionally enable on a per remark basis. The current behaviors are just forward/backward scans in the same basic block. If we scan forwards, if we find a valid SourceLoc, we just use ethat. If we are scanning backwards, instead we grab the SourceRange and if it is valid use the end source range of the given instruction. This seems to give a good result for retain (forward scan) and release (backward scan). The specific reason that I did that is that my test case for this are retain/release operations. Often times these operations due to code-motion are moved around (and rightly to prevent user confusion) given by optimizations auto generated SIL locations. Since that is the test case I am using, to test this I needed said engine.	2020-07-20 12:01:34 -07:00
Michael Gottesman	0dbed44ddd	[ownership] Move ownership lowering past the eager specializer on the stdlib.	2020-07-10 15:31:59 -07:00
Meghana Gupta	f8d8091c98	[ownership] Move ome after GlobalOpt (#32742 )	2020-07-08 14:54:59 -07:00
Meghana Gupta	337c4e88d7	Add a new -sil-disable-only-function flag (#32622 ) This will enable us apply -sil-disable-pass only on certain functions	2020-07-01 10:18:36 -07:00
Michael Gottesman	ba2e04be7e	[ownership] Move the stdlib ome point to before global opt. This just moves it past the SIL linker (which since the stdlib doesn't link anything will not change anything and past TempRValueOpt which is already updated for OSSA.	2020-06-26 14:04:48 -07:00
Michael Gottesman	5b6918fd3f	Merge pull request #32505 from gottesmm/pr-12b4fd6015e37d9a95ea6a81da117e6678369d02 [ownership] Split ownership lowering in the pass pipeline for non-transparent stdlib vs non-stdlib functions.	2020-06-23 15:02:35 -07:00
Michael Gottesman	3530f8e26d	[ownership] Split ownership lowering in the pass pipeline for non-transparent stdlib vs non-stdlib functions. I am going to be moving back ownership lowering first in the stdlib so that we can bring up the optimizer on ownership without needing to deal with serialization issues (the stdlib doesn't deserialize SIL from any other modules). This patch just begins the mechanical process with a nice commit message. Should be NFC.	2020-06-22 18:32:17 -07:00
Erik Eckstein	a7425c16ff	Improvements for cross-module-optimization * Include small non-generic functions for serializaion * serialize initializer of global variables: so that global let variables can be constant propagated across modules rdar://problem/60696510	2020-06-22 16:49:26 +02:00
Erik Eckstein	9e92389fa5	SILOptimizer: a new "TempLValueOpt" optimization pass for copy_addr Optimizes copies from a temporary (an "l-value") to a destination. %temp = alloc_stack $Ty instructions_which_store_to %temp copy_addr [take] %temp to %destination dealloc_stack %temp is optimized to destroy_addr %destination instructions_which_store_to %destination The name TempLValueOpt refers to the TempRValueOpt pass, which performs a related transformation, just with the temporary on the "right" side. The TempLValueOpt is similar to CopyForwarding::backwardPropagateCopy. It's more restricted (e.g. the copy-source must be an alloc_stack). That enables other patterns to be optimized, which backwardPropagateCopy cannot handle. This pass also performs a small peephole optimization which simplifies copy_addr - destroy sequences. copy_addr %source to %destination destroy_addr %source is replace with copy_addr [take] %source to %destination	2020-06-22 13:47:31 +02:00
Michael Gottesman	46432404f3	[ownership] Remove dead option: enable-ownership-stripping-after-serialization. We always lower ownership now after the diagnostic passes (what this option actually controlled). So remove it. NFC.	2020-06-16 10:52:02 -07:00
Michael Gottesman	702c1bc5e8	[arc] Change guaranteed arc opts to be based on SemanticARCOpts and move from Diagnostic pipeline -> Onone pipeline. The pass is already not being run during normal compilation scenarios today since it bails on OSSA except in certain bit-rot situations where a test wasn't updated and so was inadvertently invoking the pass. I discovered these while originally just trying to eliminate the pass from the diagnostic pipeline. The reason why I am doing this in one larger change is that I found there were a bunch of sil tests inadvertently relying on guaranteed arc opts to eliminate copy traffic. So, if I just removed this and did this in two steps, I would basically be unoptimizing then re-optimizing the tests. Some notes: 1. The new guaranteed arc opts is based off of SemanticARCOpts and runs only on ossa. Specifically, in this new pass, we just perform simple canonicalizations that do not involve any significant analysis. Some examples: a copy_value all of whose uses are destroys. This will do what the original pass did and more without more compile time. I did a conservative first approximation, but we can probably tune this a bit. 2. the reason why I am doing this now is that I was trying to eliminate the enable-ownership-stripping-after-serialization flag and discovered that the test opaque_value_mandatory implicitly depends on this since sil-opt by default was the only place left in the compiler with that option set to false by default. So I am eliminating that dependency before I land the larger change.	2020-06-15 17:00:18 -07:00
Meghana Gupta	b5a8b518ea	Merge pull request #32141 from meg-gupta/commaflags [NFC] Make some PassManager options to accept comma separated values	2020-06-11 15:28:32 -07:00
Meghana Gupta	a1e281d926	Merge pull request #30710 from meg-gupta/bottomupfunction [PassManager] Update PassManager's function worklist for newly added SILFunctions	2020-06-03 08:20:39 -07:00
Meghana Gupta	15583ac269	Make some PassManager options to accept comma separated values	2020-06-01 23:29:13 -07:00
Anthony Latsis	9fd1aa5d59	[NFC] Pre- increment and decrement where possible	2020-06-01 15:39:29 +03:00
Erik Eckstein	6569c98332	SIL optimizer: add an additional stack promotion pass to the late pipeline Sometimes stack promotion can catch cases only at a late stage of the pipeline, after FunctionSignatureOpts. https://bugs.swift.org/browse/SR-12773 rdar://problem/63068408	2020-05-28 10:23:40 +02:00
Erik Eckstein	216eec2d21	SIL optimizer: add an additional LICM pass to the pipeline. The COWOpts optimization relies more on LICM. This additional run of the pass ensures that there is no phase ordering issue between LICM and COWOpts	2020-05-26 18:01:17 +02:00
Erik Eckstein	9722578df6	SILOptimizer: a new optimization for copy-on-write Constant folds the uniqueness result of begin_cow_mutation instructions, if it can be proved that the buffer argument is uniquely referenced. For example: %buffer = end_cow_mutation %mutable_buffer // ... // %buffer does not escape here // ... (%is_unique, %mutable_buffer2) = begin_cow_mutation %buffer cond_br %is_unique, ... is replaced with %buffer = end_cow_mutation [keep_unique] %mutable_buffer // ... (%not_used, %mutable_buffer2) = begin_cow_mutation %buffer %true = integer_literal 1 cond_br %true, ... Note that the keep_unique flag is set on the end_cow_mutation because the code now relies on that the buffer is really uniquely referenced. The optimization can also handle def-use chains between end_cow_mutation and begin_cow_mutation which involve phi-arguments. An additional peephole optimization is performed: if the begin_cow_mutation is the only use of the end_cow_mutation, the whole pair of instructions is eliminated.	2020-05-26 18:01:17 +02:00
Erik Eckstein	ad99b9d4f8	SILOptimizer: a new phi-argument expansion optimization. If only a single field of a struct phi-argument is used, replace the argument by the field value. br bb(%str) bb(%phi): %f = struct_extract %phi, #Field // the only use of %phi use %f is replaced with %f = struct_extract %str, #Field br bb(%f) bb(%phi): use %phi This also works if the phi-argument is in a def-use cycle. The new PhiExpansionPass is in the same file as the RedundantPhiEliminationPass. Therefore I renamed the source file to PhiArgumentOptimizations.cpp	2020-05-25 09:36:09 +02:00
Saleem Abdulrasool	cebe79d482	SIL: use object libraries instead of globbing This simplifies the handling of the subdirectories in the SIL and SILOptimizer paths. Create individual libraries as object libraries which allows the analysis of the source changes to be limited in scope. Because these are object libraries, this has 0 overhead compared to the previous implementation. However, string operations over the filenames are avoided. The cost for this is that any new sub-library needs to be added into the list rather than added with the special local function.	2020-05-18 18:56:34 +00:00
Meghana Gupta	fd98ce10c7	Update PassManager's function worklist for newly added SILFunctions The PassManager should transform all functions in bottom up order. This is necessary because when optimizations like inlining looks at the callee function bodies to compute profitability, the callee functions should have already undergone optimizations to get better profitability estimates. The PassManager builds its function worklist based on bottom up order on initialization. However, newly created SILFunctions due to specialization etc, are simply appended to the function worklist. This can cause us to make bad inlining decisions due to inaccurate profitability estimates. This change now updates the function worklist such that, all the callees of the newly added SILFunction are proccessed before it by the PassManager. Fixes rdar://52202680	2020-05-11 19:43:22 -07:00
Meghana Gupta	47fe49a2a9	Fix the mid-level function-pass pipeline (#31424 ) * Fix the mid-level pass pipeline. Module passes need to be in a separate pipeline, otherwise the pipeline restart mechanism will be broken. This makes GlobalOpt and serialization run earlier in the pipeline. There's no explicit reason for them to be run later, in the middle of a function pass pipeline. Also, pipeline boundaries, like serialization and module passes should be explicit at the the top level function that creates the pass pipelines. * SILOptimizer: Add enforcement of function-pass pipelines. Don't allow module passes to be inserted within a function pass pipeline. This silently breaks the function pipeline both interfering with analysis and the normal pipeline restart mechanism. * Add misssing pass in addFunctionPasses Co-authored-by: Andrew Trick <atrick@apple.com>	2020-05-03 18:23:40 -07:00
Meghana Gupta	6c3857b6d6	Add forced precomputation and verification of analysis (#31251 ) -sil-verify-all flag will verify analyses before and after a pass to confirm correct invalidations. But if an analysis was never constructed or invalidated as per current pass order, it may never detect insufficient invalidations. -sil-verify-force-analysis will force construct an analysis so that we can better check for insufficient invalidations. It is also terribly slow compared to -sil-verify-all.	2020-04-24 10:55:45 -07:00
Erik Eckstein	53f6fdadc6	SILOptimizer: reorganize the optimization-prepare passpipeline Don't create a separate pass manager for those passes, just let them run at the beginning of the performance pipeline. Regarding generated code this is a NFC. This change fixes a problem with pass-bisecting (for debugging). Having two instances of the pass manager can cause troubles with bisecting, because -sil-opt-pass-count affects both pass managers at the same time.	2020-04-24 15:48:48 +02:00
ematejska	4cd68edf8c	[Autodiff upstream] Add DifferentiabilityWitnessDevirtualizer SILOptimizer pass (#30984 ) Add DifferentiabilityWitnessDevirtualizer: an optimization pass that devirtualizes `differentiability_witness_function` instructions into `function_ref` instructions. Co-authored-by: Dan Zheng <danielzheng@google.com>	2020-04-23 02:13:05 -07:00
Dan Zheng	1775e8ae16	[AutoDiff upstream] Add VJPEmitter. `VJPEmitter` is a cloner that emits VJP functions. It implements reverse-mode automatic differentiation, along with `PullbackEmitter`. `VJPEmitter` clones an original function, replacing function applications with VJP function applications. In VJP functions, each basic block takes a pullback struct (containing callee pullbacks) and produces a predecessor enum: these data structures are consumed by pullback functions.	2020-04-05 20:35:35 -07:00
Dan Zheng	aa66cce808	[AutoDiff upstream] Add differentiation transform. The differentiation transform does the following: - Canonicalizes differentiability witnesses by filling in missing derivative function entries. - Canonicalizes `differentiable_function` instructions by filling in missing derivative function operands. - If necessary, performs automatic differentiation: generating derivative functions for original functions. - When encountering non-differentiability code, produces a diagnostic and errors out. Partially resolves TF-1211: add the main canonicalization loop. To incrementally stage changes, derivative functions are currently created with empty bodies that fatal error with a nice message. Derivative emitters will be upstreamed separately.	2020-04-02 15:43:57 -07:00
Robert Widmann	1f904ca86d	Merge pull request #30669 from CodaFi/consteval [NFC] A Handful of Sweeping Evaluator Cleanups	2020-03-27 09:43:28 -07:00
Robert Widmann	987cd55f50	[NFC] Drop llvm::Expected from Evaluation Points A request is intended to be a pure function of its inputs. That function could, in theory, fail. In practice, there were basically no requests taking advantage of this ability - the few that were using it to explicitly detect cycles can just return reasonable defaults instead of forwarding the error on up the stack. This is because cycles are checked by the Evaluator, and are unwound by the Evaluator. Therefore, restore the idea that the evaluate functions are themselves pure, but keep the idea that evaluation of those requests may fail. This model enables the best of both worlds: we not only keep the evaluator flexible enough to handle future use cases like cancellation and diagnostic invalidation, but also request-based dependencies using the values computed at the evaluation points. These aforementioned use cases would use the llvm::Expected interface and the regular evaluation-point interface respectively.	2020-03-26 23:08:02 -07:00
Robert Widmann	f2a1abc5dd	[NFC] Refactor Side-Effecting Requests to be Explicitly so Introduce evaluator::SideEffect, the type of a request that performs some operation solely to execute its side effects. Thankfully, there are precious few requests that need to use this type in practice, but it's good to call them out explicitly so we can get around to making them behave much more functionally in the future.	2020-03-26 22:55:20 -07:00

... 6 7 8 9 10 ...

658 Commits