swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Erik Eckstein	16bd756f7b	tests: make some test more robust for optimizer changes. This is in preparation for COW support. More optimizer tests require an optimized non-assert stdlib build.	2020-05-26 18:01:17 +02:00
Andrew Trick	a17dbc7c74	Enable run-time exclusivity checking in release mode. This change could impact Swift programs that previously appeared well-behaved, but weren't fully tested in debug mode. Now, when running in release mode, they may trap with the message "error: overlapping accesses...". Recent optimizations have brought performance where I think it needs to be for adoption. More optimizations are planned, and some benchmarks should be further improved, but at this point we're ready to begin receiving bug reports. That will help prioritize the remaining work for Swift 5. Of the 656 public microbenchmarks in the Swift repository, there are still several regressions larger than 10%: TEST OLD NEW DELTA RATIO ClassArrayGetter2 139 1307 +840.3% 0.11x HashTest 631 1233 +95.4% 0.51x NopDeinit 21269 32389 +52.3% 0.66x Hanoi 1478 2166 +46.5% 0.68x Calculator 127 158 +24.4% 0.80x Dictionary3OfObjects 391 455 +16.4% 0.86x CSVParsingAltIndices2 526 604 +14.8% 0.87x Prims 549 626 +14.0% 0.88x CSVParsingAlt2 1252 1411 +12.7% 0.89x Dictionary4OfObjects 206 232 +12.6% 0.89x ArrayInClass 46 51 +10.9% 0.90x The common pattern in these benchmarks is to define an array of data as a class property and to repeatedly access that array through the class reference. Each of those class property accesses now incurs a runtime call. Naturally, introducing a runtime call in a loop that otherwise does almost no work incurs substantial overhead. This is similar to the issue caused by automatic reference counting. In some cases, more sophistacated optimization will be able to determine the same object is repeatedly accessed. Furthermore, the overhead of the runtime call itself can be improved. But regardless of how well we optimize, there will always a class of microbenchmarks in which the runtime check has a noticeable impact. As a general guideline, avoid performing class property access within the most performance critical loops, particularly on different objects in each loop iteration. If that isn't possible, it may help if the visibility of those class properties is private or internal.	2018-11-02 16:54:31 -07:00
Erik Eckstein	39bb14b094	change mangling prefix from $S to $s This is the final ABI mangling prefix rdar://problem/38471478	2018-09-19 13:55:11 -07:00
Erik Eckstein	1ebe33e03c	GenericSpecializer: Allow simple function specialization cycles. So far we immediately bailed once we detect a cycle in specializations. But it turned out that this prevented efficient code generation for some stdlib functions like compactMap. With this change we allow specialization of cycles up to a depth of 1 (= still very limited to prevent code size explosion in some corner cases). The effect of this optimization is tested with the existing benchmark FatCompactMap. SR-7952, rdar://problem/41005326	2018-07-11 12:45:21 -07:00
Erik Eckstein	cd3d50a5d9	ABI: Change the mangling prefix from _T0 to $S	2018-01-06 13:55:59 -08:00
Pavel Yaskevich	f90d943a29	[Mangling/ABI] NFC: Fix SILOptimizer tests to reflect label mangling changes	2017-12-18 15:45:50 -08:00
Roman Levenstein	ffa927a2dc	Support even more complex cases of generic substitution loops Allow for cases, where the old substitution type `T1` is partially contained in the new substitution type `T2`. Partially contained means that if you drop the common structural "prefix" of `T1` and `T2` and get `T1'` and `T2'` then `T1'` is strictly contained in `T2'`. E.g. `Outer<Start>` is partially contained in `Outer<Step<Start>>` if you drop the common prefix `Outer`, then `Start` is contained in `Step<Start>`	2017-08-07 08:23:46 -07:00
Roman Levenstein	8503daee0d	Implement a more robust way to avoid infinite generic specialization loops The existing simple mechanism for avoiding infinite generic specialization loops is based on checking the structural depth and width of types passed as generic type parameters. If the depth or the width of a type is above a certain threshold, the type is considered too complex for generic specialization and no specialization is produced. While this approach prevents the possibility of producing an infinite number of generic specializations for ever-growing generic type parameters, it catches the issue too late in some cases, leading to excessive CPU and memory usage. Therefore, the new method tries to solve the problem at its root. An infinite generic specialization loop can be triggered by specializing a given generic call-site if and only if: - Doing so would result in a loop inside the specialization graph represented by the `GenericSpecializationInformations`, i.e. it would produce direct or indirect recursion involving a generic call - The substitutions used by the current generic call-site are structurally more complex than the substitutions used by the same call-site in the previous iteration inside specialization graph. More complex in this context means that the new generic type parameter structurally contains the generic type parameter from a previous iteration inside the specialization graph and has greater structural depth, e.g. `Array<Int>` is more complex than `Int`. The generic specializer now records all the required information about specializations it produces and uses it later to detect and prevent any generic specializations which would result in an infinite specialization loop. It detects them as early as possible and thus reduces compile times, memory consumption and potentially also reduces the code-size by not generating useless specializations.	2017-08-06 12:51:49 -07:00

8 Commits