swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-21 12:14:44 +01:00

Author	SHA1	Message	Date
Michael Gottesman	606df10725	doxygenify file level comment.	2016-01-03 18:44:31 -06:00
Michael Gottesman	e1ef60987c	Revert "[codegardening] Move MemBehaviorDumper from AADumper.cpp into its own file MemoryBehaviorDumper.cpp." Revert "Make AADumper and MemoryBehaviorDumper function passes. They do not need to be module passes." This reverts commit `a503269e2d`. This reverts commit `375f525c51`. Turns out we /do/ want these two passes to be module passes so that their output is independent of how the pass manager schedules function passes. I tried to just fix the issue in MemBehaviorDumper/AADumper without reverting, but somehow this caused their tests to start failing?! I will try separating them again in a subsequent commit.	2016-01-03 18:41:34 -06:00
practicalswift	31ff35e1dd	Use 80 column headers consistently.	2016-01-04 01:35:02 +01:00
Michael Gottesman	a503269e2d	[codegardening] Move MemBehaviorDumper from AADumper.cpp into its own file MemoryBehaviorDumper.cpp.	2016-01-03 16:32:59 -06:00
Michael Gottesman	375f525c51	Make AADumper and MemoryBehaviorDumper function passes. They do not need to be module passes.	2016-01-03 16:27:02 -06:00
Michael Gottesman	11b2a7d29c	[arc-sequence-opts] Add more log output. This makes it easy to see which individual retains/releases are partial/known safe so one can reason about why the final set is partial/known safe (or not).	2016-01-03 16:02:41 -06:00
Mark Lacey	149e1e4059	Fix 80-column violations.	2016-01-03 13:15:56 -08:00
practicalswift	6c32688275	Fix recently introduced typos.	2016-01-03 21:16:23 +01:00
Xin Tong	4ea79fec2b	There are simply too many locations and too many basic blocks in some functions for dead store elimination to handle. In the worst case, The number of memory behavior or alias queries we need to do is roughly linear to the # BBs x(times) # of locations. Put in some heuristic to trade off accuracy for compilation time. NOTE: we are not disabling DSE for these offending functions. instead we are running a one iteration pessimistic data flow as supposed to the multiple iteration optimistic data flow we've done previously. With this change. I see compilation time on StdlibUnitTest drops significantly. 50%+ drop in time spent in DSE in StdlibUnit with a release compiler. I will update more Instruments data post-commit once i get close to my desktop. I see a slight drop in # of dead stores (DS) elimination in stdlib and stdlibUnit test. stdlib: 203 DS -> 202 DS. (RLE is affected slightly as well. 6313 -> 6295 RL). stdlibunittest : 43 DS -> 42. (RLE is not affected). We are passing all existing dead store tests.	2016-01-03 10:48:02 -08:00
Xin Tong	013d08d439	Add a bailout location # threshold in DSE. In StdlibUnitTest, there is this function that has too many (2450) LSLocations and the data flow in DSE takes too long to converge. StdlibUnittest.TestSuite.(addForwardRangeReplaceableCollectionTests <A, B where A: Swift.RangeReplaceableCollectionType, B: Swift.RangeReplaceableCollectionType, A.SubSequence: Swift.CollectionType, B.Generator.Element: Swift.Equatable, A.SubSequence == A.SubSequence.SubSequence, A.Generator.Element == A.SubSequence.Generator.Element> (Swift.String, makeCollection : ([A.Generator.Element]) -> A, wrapValue : (StdlibUnittest.OpaqueValue<Swift.Int>) -> A.Generator.Element, extractValue : (A.Generator.Element) -> StdlibUnittest.OpaqueValue<Swift.Int>, makeCollectionOfEquatable : ([B.Generator.Element]) -> B, wrapValueIntoEquatable : (StdlibUnittest.MinimalEquatableValue) -> B.Generator.Element, extractValueFromEquatable : (B.Generator.Element) -> StdlibUnittest.MinimalEquatableValue, checksAdded : StdlibUnittest.Box<Swift.Set<Swift.String>>, resiliencyChecks : StdlibUnittest.CollectionMisuseResiliencyChecks, outOfBoundsIndexOffset : Swift.Int) -> ()).(closure #18) This function alone takes ~20% of the total amount of time spent in DSE in StdlibUnitTest. And DSE does not eliminate any dead store in the function either. I added this threshold to abort on functions that have too many LSLocations. I see no difference in # of dead store eliminated in the Stdlib.	2016-01-02 17:37:53 -08:00
Michael Gottesman	9e551dd237	Doxygenify file level comment.	2016-01-02 16:41:36 -06:00
Michael Gottesman	3d8433b7f6	[arc] Add in a new semantics attribute called "arc.programtermination_point" and document it. If this semantic tag is applied to a function, then we know that: - The function does not touch any reference counted objects. - After the function is executed, all reference counted objects are leaked (most likely in preparation for program termination). This allows one, when performing ARC code motion, to ignore blocks that contain an apply to this function as long as the block does not have any other side effect having instructions. I have wanted to do this for a while but was stymied by lacking the ability to apply multiple @_semantics attributes. This is now committed to trunk so I added this attribute instead of pattern matching against fatalError (since there could be other functions with this property). rdar://19592537	2016-01-02 16:22:06 -06:00
Michael Gottesman	389238e801	Add support for multiple @_semantics attributes at the SIL level. This is something that we have wanted for a long time and will enable us to remove some hacks from the compiler (i.e. how we determine in the ARC optimizer that we have "fatalError" like function) and also express new things like "noarc".	2016-01-02 04:17:07 -06:00
Xin Tong	310f48eab0	Improve dead store elimination compilation time. If we know a function is not a one iteration function which means its its BBWriteSetIn and BBWriteSetOut have been computed and converged, and a basic block does not even have StoreInsts, there is no point in processing every instruction in the last iteration of the data flow again as no store will be eliminated. We can simply skip the basic block and rely on the converged BBWriteSetIn to process its predecessors. Compilation time improvement: 1.7% to 1.5% of overall compilation time. on stdlib -O. This represents a 4.0% of all SILOptimzations(37.2%) Existing tests ensure correctness.	2016-01-01 10:16:42 -08:00
Chris Lattner	2af78aede1	forward declare ASTWalker in ASTNode.h instead of including its header, NFC.	2015-12-31 21:05:13 -08:00
Zach Panzarino	e3a4147ac9	Update copyright date	2015-12-31 23:28:40 +00:00
practicalswift	2e995a8ba4	Fix recently introduced typos.	2015-12-31 15:28:54 +01:00
Xin Tong	a35eabd6f7	Instead of enumerating all the LSValues before RLE is ran on the function. we enumerate them lazily. This leads to compilation time improvement, as some of the LSValues previously enumerated do not be created in this approach. i.e. we enumerate LSValues created by loads previously, but the LoadInsts could be target for RLE. In such case, the enumerated LSValues are not used. Compilation time improvement: 1775ms to 1721ms (2.7% to 2.6% of the entire compilation time for stdlib -O). Existing tests ensure correctness. Note: we still enumerate locations, as we need to know how many locations there are in the function to resize the bitvector appropriately before the data flow runs.	2015-12-30 14:35:59 -08:00
Michael Gottesman	a06cacbfcb	[projection] Introduce two new types NewProjection and NewProjectionPath. NewProjection is a re-architecting of Projection that supports all of the same functionality as Projection but in 1/3 of the original size (in the common case). It is able to accomplish this by removing the base type out of NewProjection itself and into users such as NewProjectionPath. Thus NewProjection is now strictly an index from some parent type rather than being a parent type and an index. NewProjectionPath also has all of the same functionality as ProjectionPath, but due to NewProjection being smaller than Projection is smaller than ProjectionPath. Used together NewProjection/NewProjectionPath yields the same output as Projection/ProjectionPath when evaluating the LSLocation dumping tests. Additionally, NewProjection is more flexible than Projection and will for free give us the ability to perform AA on index_addr/index_raw_addr as well as be able to integrate casts into the projection paradigm. rdar://22484381	2015-12-29 22:31:09 -06:00
Xin Tong	0c162cd3d1	Refactor in redundant load elimination. NFC. 1. Update some comments. 2. Rename a few functions, e.g. runIterativeDF -> runIterativeRLE, getLSValueBit -> getValueBit. 3. Remove unused headers. 4. Remove no-longer used function, mergePredecessorStates and mergePredecessorState. 5. A few other small NFCs.	2015-12-29 15:35:49 -08:00
practicalswift	6dac32e416	Fix typos in code merged today	2015-12-29 23:19:48 +01:00
Xin Tong	be4bf816bd	Update some comments and rename runIterativeDF -> runIterativeDSE, mergeSuccessorStates -> mergeSuccessorLiveIns in DSE. Also fix some includes.	2015-12-29 10:13:02 -08:00
Chris Lattner	151556707a	Merge pull request #811 from practicalswift/remove-duplicate-includes Remove duplicate #include:s	2015-12-29 08:32:55 -08:00
practicalswift	fb314c37aa	Fix typos introduced yesterday :-)	2015-12-29 13:16:02 +01:00
practicalswift	f34df59dc6	Remove duplicate #include:s	2015-12-29 12:12:26 +01:00
Aaron Raimist	b196e79b88	SimplifyCFG: Fix typo, Fist to First	2015-12-28 23:04:24 -06:00
Xin Tong	cc1dda478e	Move to a genset and killset for the dataflow in redundant load elimination. Previously we process every instruction every time the data flow re-iterates. This is very inefficient. In addition to moving to genset and killset, we also group function into OneIterationFunction which we know that the data flow would converge in 1 iteration and functions that requre the iterative data flow, mostly due to backedges in loops. we process them differently. I observed that there are ~93% of the functions that require just a single iteration to perform the RLE. But the other 7% accounts for 2321 (out of 6318) of the redundant loads we eliminated. This change reduces RLE compilation from 4.1% to 2.7% of the entire compilation time (frontend+OPT+LLVM) on stdlib with -O. This represents 6.9% of the time spent in SILOptimizations (38.8%). ~2 weeks ago, RLE was taking 1.9% of the entire compilation time. It rose to 4.1% mostly due to that we are now eliminating many more redundant loads (mostly thank to Erik's integragtion of escape analysis in alias analysis). i.e. 3945 redundant loads elimnated before Erik's change to 6318 redundant loads eliminated now.	2015-12-28 15:04:58 -08:00
Xin Tong	7c43b47bd0	revert some WIP code	2015-12-28 14:41:16 -08:00
Xin Tong	0c05064429	Speculative disable escape analysis in local variable DSE. There is test case failure on the real device, but not on simulators. The failure could be a result of this change.	2015-12-28 14:39:54 -08:00
practicalswift	149b50d901	Fix typos in code (non-comment/documentation typos).	2015-12-28 11:42:15 +01:00
Chris Lattner	487b0d1365	Merge pull request #796 from practicalswift/fix-comment-typos Fix typos in code comments	2015-12-27 20:46:46 -08:00
Xin Tong	7e49985b00	Turn a function into early exit sytle. NFC	2015-12-27 17:35:24 -08:00
practicalswift	fd70b26033	Fix typos in comments.	2015-12-28 02:15:34 +01:00
practicalswift	d89b4d45e1	Fix typos in code (non-comment typos).	2015-12-27 13:05:01 +01:00
Xin Tong	656894567f	Some of the functions do not really need the iterative data flow in DSE. i.e. for function of which a single post-order would be enough for DSE. In this case, we do not really need to compute the genset and killset (which is a costly operation). On stdlib, i see 93% of the functions are "OneIterationFunction". With this change, i see the compilation time of DSE drops from 2.0% to 1.7% of the entire compilation. This represents 4.3% of all the time spent in SILOptimizations (39.5%).	2015-12-26 20:33:35 -08:00
Chris Lattner	bee0d955ff	Merge pull request #784 from practicalswift/a-vs-an-again Fix typos: "a" vs. "an"	2015-12-26 17:45:36 -08:00
Xin Tong	173fc871ff	Use a SmallBitVector instead of BitVector in DSE. I observed that most functions do not have over 64 locations which makes SmallBitVector a more suitable choice than BitVector. I see a ~10% drop in compilation time in DSE. i.e. 1430ms to 1270ms (2.2% to 2.0% of overall compilation time).	2015-12-26 12:30:49 -08:00
Xin Tong	8be2c51875	Merge 2 steps in data flow in DSE into 1. The more we iterate over the instructions in the function, the more compilation time DSE take. I see a ~10% drop in DSE compilation time, from 2.4% to 2.2%. (Last time (~1 week ago) i checked DSE was taking 1.4% of the compilation time. Now its taking 2.2%. I will look into where the increase come from later).	2015-12-26 11:06:52 -08:00
practicalswift	fa0b339a21	Fix typos.	2015-12-26 17:51:59 +01:00
practicalswift	85e2e6eb9a	Fix a vs. an	2015-12-26 14:40:16 +01:00
practicalswift	db13bcb22e	Fix typos.	2015-12-26 14:11:42 +01:00
Nadav Rotem	07d4558c1c	[Mangler] Change the Swift mangler into a symbol builder. This commit changes the Swift mangler from a utility that writes tokens into a stream into a name-builder that has two phases: "building a name", and "ready". This clear separation is needed for the implementation of the compression layer. Users of the mangler can continue to build the name using the mangleXXX methods, but to access the results the users of the mangler need to call the finalize() method. This method can write the result into a stream, like before, or return an std::string.	2015-12-25 21:40:25 -08:00
practicalswift	22e10737e2	Fix typos	2015-12-26 01:19:40 +01:00
Xin Tong	ed7ef800ab	More refactoring in DSE. Split 1 more function and update some comments.	2015-12-24 16:11:19 -08:00
Xin Tong	30ed2f15aa	Move local variable store checks into helper function. NFC	2015-12-24 15:26:37 -08:00
Xin Tong	1dd13b5d92	Optimize how basic blocks are processed in DSE. I do not see a real compilation time improvement	2015-12-24 15:10:39 -08:00
Xin Tong	6a942d2d20	Split bigger functions into multiple smaller functions in DSE. NFC	2015-12-24 14:58:44 -08:00
Xin Tong	32dc903339	Enable local variable dead store elimination If a variable can not escape the function, we mark the store to it as dead before the function exits. It was disabled due to some TBAA and side-effect analysis changes. We were removing 8 dead stores on the stdlib. With this change, we are now removing 203 dead stores. I only see noise-level performance difference on PerfTestSuite. I do not see real increase on compilation time either.	2015-12-24 14:18:36 -08:00
Dmitri Gribenko	6160b4ef4d	Merge pull request #766 from practicalswift/a-few-more-typos Fix typos	2015-12-24 13:18:33 -08:00
Xin Tong	f9450ea6ab	Avoid extraneous generation of AACache and MBCache keys	2015-12-24 09:24:56 -08:00

... 218 219 220 221 222 ...

11193 Commits