swift-mirror

mirror of https://github.com/apple/swift.git synced 2025-12-14 20:36:38 +01:00

Author	SHA1	Message	Date
Michael Gottesman	828bf45e4e	[rbi] Ensure that we error when two 'inout sending' parameters are in the same region at end of function. The caller is allowed to assume that the 'inout sending' parameters are not in the same region on return so can be sent to different isolation domains safely. To enforce that we have to ensure on return that the two are /actually/ not in the same region. rdar://138519484	2025-10-17 12:07:09 -07:00
Michael Gottesman	391ad21583	[rbi] Make all PartitionOpErrors noncopyable types. I am going to be adding support to passes for emitting IsolationHistory behind a flag. As part of this, we need to store the state of the partition that created the error when the error is emitted. A partition stores heap memory so it makes sense to make these types noncopyable types so we just move the heap memory rather than copying it all over the place.	2025-08-29 15:18:55 -07:00
Michael Gottesman	8745ab00de	[rbi] Teach RegionIsolation how to properly error when 'inout sending' params are returned. We want 'inout sending' parameters to have the semantics that not only are they disconnected on return from the function but additionally they are guaranteed to be in their own disconnected region on return. This implies that we must emit errors when an 'inout sending' parameter or any element that is in the same region as the current value within an 'inout sending' parameter is returned. This commit contains a new diagnostic for RegionIsolation that adds specific logic for detecting and emitting errors in these situations. To implement this, we introduce 3 new diagnostics with each individual diagnostic being slightly different to reflect the various ways that this error can come up in source: * Returning 'inout sending' directly: ```swift func returnInOutSendingDirectly(_ x: inout sending NonSendableKlass) -> NonSendableKlass { return x // expected-warning {{cannot return 'inout sending' parameter 'x' from global function 'returnInOutSendingDirectly'}} // expected-note @-1 {{returning 'x' risks concurrent access since caller assumes that 'x' and the result of global function 'returnInOutSendingDirectly' can be safely sent to different isolation domains}} } ``` * Returning a value in the same region as an 'inout sending' parameter. E.x.: ```swift func returnInOutSendingRegionVar(_ x: inout sending NonSendableKlass) -> NonSendableKlass { var y = x y = x return y // expected-warning {{cannot return 'y' from global function 'returnInOutSendingRegionVar'}} // expected-note @-1 {{returning 'y' risks concurrent access to 'inout sending' parameter 'x' since the caller assumes that 'x' and the result of global function 'returnInOutSendingRegionVar' can be safely sent to different isolation domains}} } ``` * Returning the result of a function or computed property that is in the same region as the 'inout parameter'. ```swift func returnInOutSendingViaHelper(_ x: inout sending NonSendableKlass) -> NonSendableKlass { let y = x return useNonSendableKlassAndReturn(y) // expected-warning {{cannot return result of global function 'useNonSendableKlassAndReturn' from global function 'returnInOutSendingViaHelper'}} // expected-note @-1 {{returning result of global function 'useNonSendableKlassAndReturn' risks concurrent access to 'inout sending' parameter 'x' since the caller assumes that 'x' and the result of global function 'returnInOutSendingViaHelper' can be safely sent to different isolation domains}} } ``` Additionally, I had to introduce a specific variant for each of these diagnostics for cases where due to us being in a method, we are actually in our caller causing the 'inout sending' parameter to be in the same region as an actor isolated value: * Returning 'inout sending' directly: ```swift extension MyActor { func returnInOutSendingDirectly(_ x: inout sending NonSendableKlass) -> NonSendableKlass { return x // expected-warning {{cannot return 'inout sending' parameter 'x' from instance method 'returnInOutSendingDirectly'}} // expected-note @-1 {{returning 'x' risks concurrent access since caller assumes that 'x' is not actor-isolated and the result of instance method 'returnInOutSendingDirectly' is 'self'-isolated}} } } ``` * Returning a value in the same region as an 'inout sending' parameter. E.x.: ```swift extension MyActor { func returnInOutSendingRegionLet(_ x: inout sending NonSendableKlass) -> NonSendableKlass { let y = x return y // expected-warning {{cannot return 'y' from instance method 'returnInOutSendingRegionLet'}} // expected-note @-1 {{returning 'y' risks concurrent access to 'inout sending' parameter 'x' since the caller assumes that 'x' is not actor-isolated and the result of instance method 'returnInOutSendingRegionLet' is 'self'-isolated}} } } ``` * Returning the result of a function or computed property that is in the same region as the 'inout parameter'. ```swift extension MyActor { func returnInOutSendingViaHelper(_ x: inout sending NonSendableKlass) -> NonSendableKlass { let y = x return useNonSendableKlassAndReturn(y) // expected-warning {{cannot return result of global function 'useNonSendableKlassAndReturn' from instance method 'returnInOutSendingViaHelper'; this is an error in the Swift 6 language mode}} // expected-note @-1 {{returning result of global function 'useNonSendableKlassAndReturn' risks concurrent access to 'inout sending' parameter 'x' since the caller assumes that 'x' is not actor-isolated and the result of instance method 'returnInOutSendingViaHelper' is 'self'-isolated}} } } ``` To implement this, I used two different approaches depending on whether or not the returned value was generic or not. * Concrete In the case where we had a concrete value, I was able to in simple cases emit diagnostics based off of the values returned by the return inst. In cases where we phied together results due to multiple results in the same function, we determine which of the incoming phied values caused the error by grabbing the exit partition information of each of the incoming value predecessors and seeing if an InOutSendingAtFunctionExit would emit an error. * Generic In the case of generic code, it is a little more interesting since the result is a value stored in an our parameter instead of being a value directly returned by a return inst. To work around this, I use PrunedLiveness to determine the last values stored into the out parameter in the function to avoid having to do a full dataflow. Then I take the exit blocks where we assign each of those values and run the same check as we do in the direct phi case to emit the appropriate error. rdar://152454571	2025-08-25 14:57:44 -07:00
Michael Gottesman	72acb3d464	Update unit test	2024-12-02 16:54:13 -05:00
Michael Gottesman	32b4de60a9	Rename transfer -> send. Accomplished using clangd's rename functionality.	2024-11-04 15:17:51 -08:00
Michael Gottesman	2b150bfe37	Fix unittest	2024-10-25 16:57:55 -07:00
Michael Gottesman	f23ad55acb	[region-isolation] Eliminate CRTP for errors and just pass through an error struct instead. This is going to let me just pass through the error struct to the diagnostic rather than having the CRTP and then constructing an info object per CRTP. Currently, to make it easier to refactor, I changed the code in TransferNonSendable to just take in the new error and call the current CRTP routines. In the next commit, I am going to refactor TransferNonSendable.cpp itself. This just makes it easier to test that I did not break anything.	2024-10-24 15:00:43 -07:00
Michael Gottesman	eda603d2c9	[region-isolation] Add mocking for getSourceInst so that unittests still work.	2024-09-04 12:55:09 -07:00
Michael Gottesman	10862642ca	[region-isolation] Stub out PartitionOpEvaluator::doesFunctionHaveSendingResult() so that the unittests can override it. The unittests for PartitionUtils pass in mocked operands and instructions that cannot be dereferenced. Adding this static CRTP helper allows for the unittest PartitionOpEvaluator subclass to just return false for it instead of dereferencing operands or instructions. The rest of the evaluators just get to use the default "normal" implementation that actually accesses program state.	2024-07-18 22:35:52 -07:00
Michael Gottesman	3f26d08ee4	[region-isolation] Add the ability in SILIsolationInfo to represent a disconnected value that is nonisolated(unsafe).	2024-05-27 21:25:44 -07:00
Michael Gottesman	1113a61f8d	[region-isolation] Stub out getIsolationInfo so the mocking unit tests succeed. Specifically, the partition unit tests pass in bogus instructions/operands so we cannot call /any/ methods on them. So I created stubed out helpers on the evaluator that in the case of mocking just return a default initialized SILIsolationInfo().	2024-04-19 15:28:48 -07:00
Michael Gottesman	ca8179aa7c	[region-isolation] Track operand info in a separate map rather than inline in a TransferringOperand data structure. This is backing out an approach that I thought would be superior, but ended up causing problems. Originally, we mapped a region number to an immutable pointer set containing Operand * where the region was tranferred. This worked great for a time... until I began to need to propagate other information from the transferring code in the analysis to the actual diagnostic emitter. To be able to do that, my thought was to make a wrapper type around Operand called TransferringOperand that contained the operand and the other information I needed. This seemed to provide me what I wanted but I later found that since the immutable pointer set was tracking TransferringOperands which were always newly wrapped with an Operand , we actually always created new pointer sets. This is of course wasteful from a memory perspective, but also prevents me from tracking transferring operand sets during the dataflow since we would never converge. In this commit, I fix that issue by again tracking just an Operand in the TransferringOperandSet and instead map each operand to a state structure which we merge dataflow state into whenever we visit it. This provides us with everything we need to in the next commit to including a region -> transferring operand set equality check in our dataflow equations and always converge.	2024-04-10 10:30:10 -07:00
Michael Gottesman	cc1a873b9e	Fix another mocking issue with PartitionUtils unittest.	2024-04-06 01:39:00 -07:00
Michael Gottesman	d6b4f16382	[region-isolation] Add a SILLocation to SequenceBoundary in IsolationHistory. We package all isolation history nodes from a single instruction by placing a sequence boundary at the bottom. When ever we pop, we actually pop a PartitionOp at a time meaning that we pop until we see a SequenceBoundary. Thus the sequence boundary will always be the last element visited when popping meaning that it is a convenient place to stick the SILLocation associated with the entire PartitionOp. As a benefit, there was some unused space in IsolationHistory::Node for that case since we were not using the std::variant field at all.	2024-04-06 00:58:10 -07:00
Michael Gottesman	aee5a37d9d	[region-isolation] Add isolation history support but do not wire it up to the checker. This means that I added an IsolationHistory field to Partition. Just upstreaming the beginning part of this work. I added some unittests to exercise the code as well. NOTE: This means that I did need to begin tracking an IsolationHistoryFactory and propagating IsolationHistory in the pass itself... but we do not use it for anything. A quick overview of the design. IsolationHistory is the head of an immutable directed acyclic graph. It is actually represented as an immutable linked list with a special node that ties in extra children nodes. The users of the information are expected to get a SmallVectorImpl and process those sibling nodes afterwards. The reason why we use an immutable approach is that it fits well with the problem and saves space since different partitions could be pointing at the same linked list node. Operations occur on an isolation history by pushing/popping nodes. It is assumed that the user will push nodes in batches with a sequence boundary at the bottom of the addition which signals to stop processing nodes. Tieing this together, each Partition within it contains an IsolationHistory. As the PartitionOpEvaluator applies PartitionOps to Partition in PartitionOpEvaluator::apply, the evaluator also updates the isolation history in the partition by first pushing a SequenceBoundary node and then pushing nodes that will undo the operation that it is performing. This information is used by the method Partition::popHistory. This pops linked list nodes from its history, performing the operation in reverse until it hits a SequenceBoundary node. This allows for one to rewind Partition history. And if one stashes an isolation history as a target, one can even unwind a partition to what its state was at a specific transfer point or earlier. Once we are at that point, we can begin going one node back at a time and see when two values that we are searching for no longer are apart of the same region. That is a place where we want to emit a diagnostic. We then process until we find for both of our values history points where they were the immediate reason why the two regions merge. rdar://123479934	2024-04-06 00:58:05 -07:00
Michael Gottesman	6f66849610	[region-isolation] Do not squelch errors in the unittests. To squelch errors, we need access to functionality not available in the unittests. The unittests do not require this functionality anyways, so just disable squelching during the unittests.	2024-03-26 10:06:21 -07:00
Michael Gottesman	357a53ab48	[region-isolation] Clean up use after transfer error to use the dynamic isolation information of the transfered operand value in its diagnostic message. As an example of the change: - // expected-note @-1 {{'x' is transferred from nonisolated caller to main actor-isolated callee. Later uses in caller could race with potential uses in callee}} + // expected-note @-1 {{transferring disconnected 'x' to main actor-isolated callee could cause races in between callee main actor-isolated and local nonisolated uses}} Part of the reason I am doing this is that I am going to be ensuring that we handle a bunch more cases and I wanted to fix this diagnostic before I added more incaranations of it to the tests.	2024-03-22 13:12:51 -07:00
Michael Gottesman	226c4ac187	Fix unittest	2024-03-21 14:16:20 -07:00
Michael Gottesman	465bb230c4	[region-isolation] Rename callback handleFailure -> handleLocalUseAfterTransfer. Now that we have other forms of error callbacks, having such a general name for any specific failure is misleading and hinds intent.	2024-03-18 12:13:30 -07:00
Ben Barham	f292ec9784	Use the new template deduction guides rather than `makeArrayRef` LLVM has removed `make*ArrayRef`, migrate all references to their constructor equivalent.	2024-02-23 20:04:51 -08:00
Michael Gottesman	434261b851	[region-isolation] When changing an elements region, if that element was the last element in a transferred regionl, remove that region from the transferredOpMap. I also added some validation that we properly do this. It only runs when NDEBUG is not set. rdar://122280930	2024-02-04 14:16:24 -08:00
Michael Gottesman	398fa8b10f	[region-isolation] Make PartitionOpEvaluator use CRTP instead of std::function callbacks. Just a fixup requested by reviewers of incoming code that I wanted to do in a follow on commit.	2023-12-04 13:03:15 -06:00
Michael Gottesman	12573d6b52	[region-isolation] Instead of just tracking a single transferring instruction, track all of them. Previously I avoided doing this since the only problem would be that in a case where we had two transfer instructions that were in an if-else block, we would just emit an error for one: ```swift if boolValue { transfer(x) } else { transfer(x) // Only emit error for this transfer! } useValue(x) ``` Now that we are tracking at the transfer point if any element in the transfer was captured in a closure, this becomes an actual semantic issue since if we track the transfer instruction that isn't reachable from the closure capture, we will not become more pessimistic: ```swift if boolValue { closure = { useInOut(&x) } transfer(x) } else { transfer(x) } // Since we grab from the else block, sendableField is allowed to be accessed // since we do not track that x was captured by reference in a closure. x.sendableField useValue(x) ``` To be truly safe, we need to emit both errors. rdar://119048779	2023-12-01 13:53:57 -08:00
Michael Gottesman	a2604dcafa	[region-isolation] Ensure that we error if we access a Sendable field of a non-Sendable typed var and the var is captured in a closure If the var is captured in a closure before it is transferred, it is not safe to access the Sendable field since we may race on accessing the field with an assignment to the field in another concurrency domain. rdar://115124361	2023-12-01 13:53:56 -08:00
Michael Gottesman	18f91c0acd	[region-isolation] Add support for async let. Specifically: 1. If the value is transferred such that it becomes part of an actor region, the value is permanently part of the actor region as one would normally have. 2. If the value is just used in an async let or is used by a nonisolated async function within the async let then while the async let is alive it cannot be used. But once the async let has been awaited upon, we allow for it to be used again. rdar://117506395	2023-11-28 09:39:04 -08:00
Michael Gottesman	957a79f82a	[region-isolation] Track operands instead of SILInstructions for Transfer instructions. This is another NFC refactor in preparation for changing how we emit errors. Specifically, we need access to not only the instruction, but also the specific operand that the transfer occurs at. This ensures that we can look up the specific type information later when we emit an error rather than tracking this information throughout the entire pass.	2023-11-15 18:58:06 -08:00
Michael Gottesman	c336f3a47e	[region-isolation] Track transferring separately from region information. What this does is really split the one dataflow we are performing into two dataflows we perform at the same time. The first dataflow is the region dataflow that we already have with transferring never occurring. The second dataflow is a simple gen/kill dataflow where we gen on a transfer instruction and kill on AssignFresh. What it tracks are regions where a specific element is transferred and propagates the region until the element is given a new value. This of course means that once the dataflow has converged, we have to emit an error not if the value was transferred, but if any value in its region was transferred.	2023-11-10 12:48:45 -08:00
Michael Gottesman	dbc3deb382	[region-isolation] Improve logging. Specifically: 1. I changed Partition::apply so that it has an emitLog flag. The reason why I did this is we run apply in a few different situations sometimes when we want to emit logging other times when we really don't. For instance, we want to emit logging when walking instructions and updating the entry partition. On the other hand, we do not want to emit logging if we apply a value to a partition while attempting to determine why an error needed to be emitted. 2. When we create an assign partition op and we see that our destination and source are the same representative, we do not create the actual assign. Before we did not log this so it looked like there was a logic error that was stopping us from emitting a partition op when visiting said instructions. Now, we emit a small logging message so it isn't possible to be confused. 3. Since I am adding another parameter to Partition::apply, I decided to refactor Partition::apply to be in a separate PartitionOpEvaluator data structure that contains the options that we used to pass into Partition::apply. This prevents any mistakes around configuring Partition::apply since the fields provide nice names/common sense default values.	2023-11-01 21:35:32 -07:00
Michael Gottesman	c9e750ba18	[region-isolation] Clean up/standardize comments.	2023-10-26 12:25:21 -07:00
Michael Gottesman	4c7e13d3cb	[region-isolation] Fix join to be a true union operation including the symmetric difference. We were performing a union on the intersection of the lhs/rhs but were dropping the parts of lhs/rhs that were in the symmetric difference of the two sets. Without this, we would not diagnose cases like this where we had elements on the lhs/rhs that were not in the intersection. ``` var closure: () -> () = {} await transferToMain(closure) if await booleanFlag { closure = { print(self.klass) } } else { closure = {} } // At this point we would lose closure since they were different elements await transferToMain(closure) // We wouldn't error on this! ``` rdar://117437059	2023-10-25 17:05:11 -07:00
Michael Gottesman	c474f6c5ab	[region-isolation] Move unittests from SIL to SILOptimizer. This is the correct thing to do since the header is in SILOptimizer. That being said the reason why I am doing this is that I want to add a command line flag to PartitionUtils.h to allow for more verbose debug output and the flag's definition will be in the SILOptimizer library. So this is just a little cleanup that follows from that.	2023-10-23 12:24:17 -07:00

31 Commits