swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
John McCall	749ed09f8c	Eliminate priority inversions in the metadata completion runtime. The current system is based on MetadataCompletionQueueEntry objects which are allocated and then enqueued on dependencies. Blocking is achieved using a condition variable associated with the lock on the appropriate metadata cache. Condition variables are inherently susceptible to priority inversions because the waiting threads have no dynamic knowledge of which thread will notify the condition. In the current system, threads that unblock dependencies synchronously advance their dependent metadata completions, which means the signaling thread is unreliable even if we could represent it in condition variables. As a result, the current system is wholly unsuited for eliminating these priority inversions. An AtomicWaitQueue is an object containing a lock. The queue is eagerly allocated, and the lock is held, whenever a thread is doing work that other threads might wish to block on. In the metadata completion system, this means whenever we construct a metadata cache entry and the metadata isn't already allocated and transitively complete after said construction. Blocking is done by safely acquiring a shared reference to the queue object (which, in the current implementation, requires briefly taking a lock that's global to the surrounding metadata cache) and then acquiring the contained lock. For typical lock implementations, this avoids priority inversions by temporarily propagating the priority of waiting threads to the locking threads. Dependencies are unblocked by simply releasing the lock held in the queue. The unblocking thread doesn't know exactly what metadata are blocked on it and doesn't make any effort to directly advance their completion; instead, the blocking thread will wake up and then attempt to advance the dependent metadata completion itself, eliminating a source of priority overhang that affected the old system. Successive rounds of unblocking (e.g. when a metadata makes partial progress but isn't yet complete) can be achieved by creating a new queue and unlocking the old one. We can still record dependencies and use them to dynamically diagnose metadata cycles. The new system allocates more eagerly than the old one. Formerly, metadata completions which were never blocked never needed to allocate a MetadataCompletionQueueEntry; we were then unable to actually deallocate those entries once they were allocated. The new system will allocate a queue for most metadata completions, although, on the positive side, we can reliably deallocate these queues. Cache entries are also now slightly smaller because some of the excess storage for status has been folded into the queue. The fast path of an actual read of the metadata remains a simple load-acquire. Slow paths may require a bit more locking. On Darwin, the metadata cache lock can now use os_unfair_lock instead of pthread_mutex_t (which is a massive improvement) because it does not need to support associated condition variables. The excess locking could be eliminated with some sort of generational scheme. Sadly, those are not portable, and I didn't want to take it on up-front. rdar://76127798	2021-12-14 22:18:46 -05:00
Kuba (Brecka) Mracek	5ff64f8443	Introduce a CMake option to disable MetadataAllocator and use malloc+free instead, use it for freestandning preset (#39678 )	2021-10-12 16:20:08 -07:00
Mike Ash	e82d9e8c7b	Move ConditionMutex to ConditionVariable::Mutex and move various other Mutex.h types to be nested.	2020-11-10 14:44:59 -05:00
Mike Ash	dd6c235a2d	[Runtime] Use os_unfair_lock for Mutex and StaticMutex on Darwin. os_unfair_lock is much smaller than pthread_mutex_t (4 bytes versus 64) and a bit faster. However, it doesn't support condition variables. Most of our uses of Mutex don't use condition variables, but a few do. Introduce ConditionMutex and StaticConditionMutex, which allow condition variables and continue to use pthread_mutex_t. On all other platforms, we continue to use the same backing mutex type for both Mutex and ConditionMutex. rdar://problem/45412121	2020-11-06 13:05:37 -05:00
Mike Ash	12d114713f	[Runtime] Switch MetadataCache to ConcurrentReadableHashMap. (#34307 ) * [Runtime] Switch MetadataCache to ConcurrentReadableHashMap. Use StableAddressConcurrentReadableHashMap. MetadataCacheEntry's methods for awaiting a particular state assume a stable address, where it will repeatedly examine `this` in a loop while waiting on a condition variable, so we give it a stable address to accommodate that. Some of these caches may be able to tolerate unstable addresses if this code is changed to perform the necessary table lookup each time through the loop instead. Some of them store metadata inline and we assume metadata never moves, so they'll have to stay this way. * Have StableAddressConcurrentReadableHashMap remember the last found entry and check that before doing a more expensive lookup. * Make a SmallMutex type that stores the mutex data out of line, and use it to get LockingConcurrentMapStorage to fit into the available space on 32-bit. rdar://problem/70220660	2020-11-04 15:10:50 -05:00
Mike Ash	630aff7b19	[Runtime] Change SimpleGlobalCache to use ConcurrentReadableHashMap instead of ConcurrentMap. This gives us faster lookups and a small advantage in memory usage. Most of these maps need stable addresses for their entries, so we add a level of indirection to ConcurrentReadableHashMap for these cases to accommodate that. This costs some extra memory, but it's still a net win. A new StableAddressConcurrentReadableHashMap type handles this indirection and adds a convenience getOrInsert to take advantage of it. ConcurrentReadableHashMap is tweaked to avoid any global constructors or destructors when using it as a global variable. ForeignWitnessTables does not need stable addresses and it now uses ConcurrentReadableHashMap directly. rdar://problem/70056398	2020-10-08 14:57:39 -04:00
John McCall	0fb407943f	[NFC] Rename swift_runtime_unreachable to swift_unreachable and make it use LLVM's support when available.	2020-10-03 02:54:56 -04:00
Kuba (Brecka) Mracek	9ead8d57fa	Add a single-threaded stdlib mode, use it for the 'minimal' stdlib (#33437 )	2020-08-25 06:03:14 -07:00
Nate Chandler	6d0c34caf3	[Runtime] Add entry point to compare conformance descriptors. The new function swift_compareProtocolConformanceDescriptors calls through to the preexisting code in MetadataCacheKey which has been extracted out from MetadataCacheKey::compareWitnessTables into a new public static function MetadataCacheKey::compareProtocolConformanceDescriptors. The new function's availability is "future" for now.	2020-06-22 15:01:28 -07:00
Mike Ash	2c7f09cef7	[Tools] Add some metadata printing to swiftdt. rdar://problem/55481578	2020-05-29 08:31:03 -07:00
Mike Ash	22cfe461ec	[Tools] Super rough draft of swiftdt dumping MetadataAllocator contents. rdar://problem/55481578	2020-05-29 08:31:03 -07:00
Saleem Abdulrasool	f0413f6170	build: remove last `llvm::` reference in stdlib This removes the last reference to the `llvm::` namespace in the standard library. All uses of the LLVMSupport library now are namespaced into the `__swift::__runtime` namespace. This allows us to incrementally vend the LLVMSupport library and make the separation explicit.	2020-05-18 11:41:57 -07:00
Saleem Abdulrasool	3fa1d1fe3f	runtime: ingest LLVMSupport into the runtime This adds a new copy of LLVMSupport into the runtime. This is the final step before changing the inline namespace for the runtime support. This will allow us to avoid the ODR violations from the header definitions of LLVMSupport. LLVMSupport forked at: 22492eead218ec91d349c8c50439880fbeacf2b7 Changes made to LLVMSupport from that revision: process.inc forward declares `_beginthreadex` due to compilation issues due to custom flag handling API changes required that we alter the `Deallocate` routine to account for the alignment. This is a temporary state, meant to simplify the process. We do not use the entire LLVMSupport library and there is no value in keeping the entire library. Subsequent commits will prune the library to the needs for the runtime.	2020-05-15 09:55:36 -07:00
Saleem Abdulrasool	a7415423e6	runtime: add and switch to `SWIFT_FALLTHROUGH` (NFC) This duplicates and switches the uses of `LLVM_FALLTHROUGH` to a local macro for the same annotation.	2020-05-07 11:50:22 -07:00
Saleem Abdulrasool	c8e8b177fa	runtime: fix a UBSAN issue This was identified by UBSAN: signed-integer-overflow. Explicitly mark the value as unsigned to ensure that the value does not overflow. There is a second instance of a raw literal, however, because it is a `*=` the value is implicitly understood to be unsigned.	2020-04-09 12:21:20 -07:00
Slava Pestov	4bc1d03283	Runtime: Use accessor method to get TargetWitnessTable::Description	2020-03-11 19:34:10 -04:00
Doug Gregor	39481149a3	[Runtime] Compute generic metadata cache info once per type. Rather than scanning through the generic parameters and generic requirements each time we form a key for the generic metadata cache, compute these values once, when the cache itself is first initialized.	2018-11-26 10:46:42 -08:00
Doug Gregor	06807c2ea6	[Runtime] Only scan the type descriptor once for metadata cache entries. Rather than scanning the type descriptor each time we perform a comparison or hash of a metadata cache entry, do so only once to establish the number of key parameters and the number of witness tables. Use those values to more efficiently compare keys.	2018-11-25 22:10:04 -08:00
Doug Gregor	32f577abf2	[Runtime] Perform “deep” comparisons of witness tables when uniquing metadata. Metadata uniquing might encounter witness tables that were distinctly generated but come from identical descriptors. Handle this case in metadata uniquing be looking into the protocol conformance descriptors themselves.	2018-11-20 22:07:55 -08:00
John McCall	db8f23df74	Update the ABI for uniquing foreign type metadata. - `swift_getForeignTypeMetadata` is now a request/response function. - The initialization function is now a completion function, and the pointer to it has moved into the type descriptor. - The cache variable is no longer part of the ABI; it's an implementation detail of the access function. - The two points above mean that there is no special header on foreign type metadata and therefore that they can be marked constant when there isn't something about them that needs to be initialized. The only foreign-metadata initialization we actually do right now is of the superclass field of a foreign class, and since that relationship is a proper DAG, it's not actually possible to have recursive initialization problems. But this is the right long-term thing to do, and it removes one of the last two clients of once-based initialization.	2018-07-29 03:16:35 -04:00
John McCall	e6fc5cb54e	Refactor `LockingConcurrentMap` to allow it to not use a map; NFC.	2018-07-25 03:00:36 -04:00
Mike Ash	77e9b967f3	[Runtime] When asserts are enabled, verify that each newly created Metadata successfully roundtrips its mangled name through the demangler. rdar://problem/37551850	2018-04-25 16:26:23 -04:00
John McCall	6c31586128	Add cyclic-metadata support to tuples. I was going to put this off for awhile, but it turns out that a lot of my testcases are enums with multi-payload cases, which we currently compile as tuples, so they were all still hanging until this patch.	2018-04-01 19:23:57 -04:00
John McCall	f22d02a67a	Detect unbreakable metadata dependency cycles and abort with a diagnostic.	2018-04-01 19:23:57 -04:00
John McCall	ea7dd7eca3	Minor changes to how we track and lock around metadata dependencies to enable the general cycle-dependency runtime error.	2018-04-01 19:23:57 -04:00
John McCall	4d461831b8	Optimize the layout of MetadataCacheEntryBase to pack fields more effectively.	2018-04-01 19:23:57 -04:00
John McCall	aceb2fd5ce	Ensure the transitive completion of type arguments and the superclass before declaring nominal type metadata complete. Also, future-proof MetadataState.	2018-03-29 13:52:36 -04:00
John McCall	583bec3b2c	Add a runtime function to query the current runtime state of a metadata. This functions returns the metadata purely for liveness purposes.	2018-03-26 02:48:52 -04:00
John McCall	ba17f320c6	Extract MetadataRequest::BasicKind as MetadataState. NFC. I de-templated MetadataState and MetadataRequest because we weren't relying on the template and because using the template was causing conversion problems due to the inability to directly template an enum in C++.	2018-03-26 01:13:45 -04:00
John McCall	31f2eec044	Change type metadata accessors to support incomplete metadata. This includes global generic and non-generic global access functions, protocol associated type access functions, swift_getGenericMetadata, and generic type completion functions. The main part of this change is that the functions now need to take a MetadataRequest and return a MetadataResponse, which is capable of expressing that the request can fail. The state of the returned metadata is reported as an second, independent return value; this allows the caller to easily check the possibility of failure without having to mask it out from the returned metadata pointer, as well as allowing it to be easily ignored. Also, change metadata access functions to use swiftcc to ensure that this return value is indeed returned in two separate registers. Also, change protocol associated conformance access functions to use swiftcc. This isn't really related, but for some reason it snuck in. Since it's clearly the right thing to do, and since I really didn't want to retroactively tease that back out from all the rest of the test changes, I've left it in. Also, change generic metadata access functions to either pass all the generic arguments directly or pass them all indirectly. I don't know how we ended up with the hybrid approach. I needed to change all the code-generation and calls here anyway in order to pass the request parameter, and I figured I might as well change the ABI to something sensible.	2018-03-18 21:38:08 -04:00
John McCall	3d5d643b9e	Substantially simplify the synchronization scheme for metadata caches. I was trying to make the entry-delegation thing do way too much. Just give the entry access to the lock/queue and introduce subclasses which simplify most of the work. Also, fix some bad reasoning around the attempts to avoid acquiring locks in the absence of waiters. It really is always necessary to acquire the lock when notifying; waiters cannot atomically set the has-waiters flag and wait, so we have to protect against the possibility that we notify before they can wait.	2018-03-17 11:40:30 -04:00
John McCall	3cf9a6f91a	Add the caching machinery for arbitrary metadata dependencies. We aren't taking advantage of this yet.	2018-03-17 11:40:30 -04:00
John McCall	f304c321f3	Rewrite MetadataCache to be a more natural extension of ConcurrentMap. Change generic witness table instantiation to use a more lightweight entry scheme that allocates the witness table as part of the entry. In contrast, change generic metadata instantiation to use a more straightforward allocation scheme where the metadata is a totally independent allocation. This is preparation for proper cyclic-dependency handling.	2018-03-08 02:29:20 -05:00
John McCall	038303b1b1	Switch MetadataCache to use a global slab allocator. This seems to more than fix a performance regression that we detected on a metadata-allocation microbenchmark. A few months ago, I improved the metadata cache representation and changed the metadata allocation scheme to primarily use malloc. Previously, we'd been using malloc in the concurrent tree data structure but a per-cache slab allocator for the metadata itself. At the time, I was concerned about the overhead of per-cache allocators, since many metadata patterns see only a small number of instantiations. That's still an important factor, so in the new scheme we're using a global allocator; but instead of using malloc for individual allocations, we're using a slab allocator, which should have better peak, single-thread performance, at the cost of not easily supporting deallocation. Deallocation is only used for metadata when there's contention on the cache, and specifically only when there's contention for the same key, so leaking a little isn't the worst thing in the world. The initial slab is a 64K globally-allocated buffer. Successive slabs are 16K and allocated with malloc. rdar://28189496	2017-02-14 11:10:44 -05:00
Hugh Bellamy	d457742a8d	Change getKeyIntValueForDump to use intptr_t rather than long	2017-02-12 09:46:36 +07:00
John McCall	2b25701a93	Revert "Switch MetadataCache to use a global slab allocator." This reverts commit `ccbe5fcf73`.	2017-01-29 00:17:30 -05:00
John McCall	ccbe5fcf73	Switch MetadataCache to use a global slab allocator. This seems to more than fix a performance regression that we detected on a metadata-allocation microbenchmark. A few months ago, I improved the metadata cache representation and changed the metadata allocation scheme to primarily use malloc. Previously, we'd been using malloc in the concurrent tree data structure but a per-cache slab allocator for the metadata itself. At the time, I was concerned about the overhead of per-cache allocators, since many metadata patterns see only a small number of instantiations. That's still an important factor, so in the new scheme we're using a global allocator; but instead of using malloc for individual allocations, we're using a slab allocator, which should have better peak, single-thread performance, at the cost of not easily supporting deallocation. Deallocation is only used for metadata when there's contention on the cache, and specifically only when there's contention for the same key, so leaking a little isn't the worst thing in the world. The initial slab is a 64K globally-allocated buffer. Successive slabs are 16K and allocated with malloc. rdar://28189496	2017-01-28 02:37:22 -05:00
practicalswift	6d1ae2a39c	[gardening] 2016 → 2017	2017-01-06 16:41:22 +01:00
practicalswift	797b80765f	[gardening] Use the correct base URL (https://swift.org ) in references to the Swift website Remove all references to the old non-TLS enabled base URL (http://swift.org)	2016-11-20 17:36:03 +01:00
Ryan Mansfield	75084a7868	Fix build error when SWIFT_DEBUG_RUNTIME is defined.	2016-09-04 14:59:51 -04:00
John McCall	4b22a29a7c	Change MetadataCache to just use malloc/free. IIRC we never had any evidence that the performance impact of a separate allocator here was actually measurable, and it does come at a significant fragmentation cost because every single cache allocates at least a page of memory. Sharing that with the system allocator makes more sense, even if these allocations are typically permanent. This also means that standard memory-debugging tools will actually find problems with out-of-bounds accesses to metadata.	2016-09-01 14:19:14 -07:00
John McCall	81b27c210b	Permit ConcurrentMap to be templated over an allocator and move MetadataCache's allocator into it. The major functional change here is that MetadataCache will now use the slab allocator for tree nodes, but I also switched the Hashable conformances cache to use ConcurrentMap directly instead of a Lazy<ConcurrentMap<>>.	2016-09-01 14:09:43 -07:00
John McCall	99c52e89ad	Remove the linked list through metadata caches, since LLDB is no longer using it even in fallback code.	2016-08-31 18:53:54 -07:00
Greg Parker	e0656515b9	Revert "Metadata cache optimizations"	2016-08-29 19:03:40 -07:00
John McCall	f2080cb1a2	Remove the linked list through metadata caches, since LLDB is no longer using it even in fallback code.	2016-08-25 20:11:56 -07:00
John McCall	51feb0095e	Fix race condition in MetadataAllocator.	2016-05-02 16:10:57 -07:00
Shawn Erickson	d78c6e3403	[runtime] addressed review comments outlined in #1950	2016-04-28 10:39:35 -07:00
Shawn Erickson	a0452be947	[runtime] enhanced and refactored recently added Mutex abstraction - added read / write lock support - added non-fatal error support to allow use of mutex in fatal error reporting pathway - isolated pthread implementation to it own header/cpp file pair - expanded unit tests to cover new code as well as better test existing mutex - removed a layer of complexity that added no real value	2016-04-06 13:02:37 -07:00
Dmitri Gribenko	82509cbd74	Merge pull request #1731 from shawnce/SR-946 SR-946 - Unify mutexes in the Swift runtime	2016-03-29 08:56:39 -07:00
John McCall	0ffb7278bc	Only use metadata patterns for generic types; perform other initialization in-place on demand. Initialize parent metadata references correctly on struct and enum metadata. Also includes several minor improvements related to relative pointers that I was using before deciding to simply switch the parent reference to an absolute reference to get better access patterns. Includes a fix since the earlier commit to make enum metadata writable if they have an unfilled payload size. This didn't show up on Darwin because "constant" is currently unenforced there in global data containing relocations. This patch requires an associated LLDB change which is being submitted in parallel.	2016-03-24 15:10:31 -07:00

1 2

69 Commits