Details
Diff Detail
- Repository
- rL LLVM
Event Timeline
I have left a few comments, but wanted to start with a higher-level design consideration.
I believe we should probably expose the cancellations in the ClangdServer API directly.
The reasons for that are:
- We have an internal client that would also benefit from using the cancellation API and we should try to share as much code as possible between that and the rest of clangd. Creating the cancellation cookies, stashing them in the context are all details that could be handled by ClangdServer so two clients don't have to do the same things.
- ClangdServer seems like a natural layer for adding this, since it's a user-facing async API. It spawns async tasks, so it is a perfect place to provide a way to cancel the spawned tasks as well.
The concrete API that I propose is:
- Relevant methods of ClangdServer (we could start we code completion) that spawn async reqeuests returns a cancellation object that allows to cancel the spawned job:
class TaskHandle { void cancel(); }; class ClangdServer { TaskHandle codeComplete(File, Position, Callback); };
- Users of the API are free to ignore the TaskHandle if they don't need to cancel the corresponding requests.
- If the users are capable of cancelling the request, they are responsible for keeping the TaskHandle around before the corresponding callback is called.
- If the request was cancelled, the Callback might receive the TaskCancelledError in the callback and should handle it accordingly.
The rest of the design LG, i.e. the code that can actually be cancelled is responsible for checking CancellationHandler::HasCancelled and should return a corresponding error in the callback if it was cancelled.
clangd/Cancellation.h | ||
---|---|---|
1 ↗ | (On Diff #159888) | This file could use some comments about the API and samples of how it can be used. |
22 ↗ | (On Diff #159888) | I think we might want to expose the context key for the cancellation primitive here. Exposing a key in the context would allow to avoid doing the lookups in the context every time we need to check if request was cancelled. Note that exposing shared_ptr<atomic<bool>> is probably not a great idea, since it'll allow to write into that object too. I suggest we go with a class that wraps it and only allows to read the value of the variable, i.e. only has isCancelled method. |
23 ↗ | (On Diff #159888) | The LLVM_NODISCARD is misplaced. The current code applies the attribute to the type (i.e. Context), and we want to apply to the function.
The first one is definitely less awkward and more widely used, so I suggest we stick to it |
25 ↗ | (On Diff #159888) | NIT: use lowerCamelCase for function names (per LLVM style guide) |
36 ↗ | (On Diff #159888) | It seems the contract for ErrorInfo actually requires this function to be implemented, so using llvm_unreachable doesn't seem right here. |
clangd/ClangdLSPServer.cpp | ||
76 ↗ | (On Diff #159888) |
Alternatively, could we simply pipe the json::Value to a raw_string_stream and get the pretty-printed string? Would avoid dealing with input validation issues. |
clangd/ClangdLSPServer.h | ||
170 ↗ | (On Diff #159888) | Maybe clarify in the comment that the key of the map is a serialized request-id? |
Sorry for missing this one. The biggest concern that I have is about the thread-safety of the API, it's too easy to misuse from multiple threads at this point.
We should define thread-safety guarantees for TaskHandle and CancellationToken and make sure they're not easily broken. The suggested way is to make both move-only types.
For CancellationToken we might need a non-thread-safe copy operation, though. But maybe make it a separate method (clone?) with a clear statement that it's not thread-safe?
Other than that just NITs.
clangd/Cancellation.cpp | ||
---|---|---|
17 ↗ | (On Diff #160531) | Having a shared_ptr key in the Context can cause data races (e.g. if we copy it concurrently from multiple threads). |
34 ↗ | (On Diff #160531) | I'm not sure if this function buys us much in terms of clarity, maybe remove it and inline everywhere? |
clangd/Cancellation.h | ||
9 ↗ | (On Diff #160531) | The comments seems to mention all the important bits, but maybe we could separate different concerns more clearly based on how the code should be used? Cancellation mechanism for async tasks. Roughly all the clients of this code can be classified into three categories: 1. The code that creates and schedules async tasks, e.g. TUScheduler. 2. The callers of the async method that can cancel some of the running tasks, e.g. `ClangdLSPServer` 3. The code running inside the async task itself, i.e. code completion or find definition implementation that run clang, etc. For (1), the guideline is to accept a callback for the result of async operation and return a `TaskHandle` to allow cancelling the request. TaskHandle someAsyncMethod(function<void(llvm::Expected<ResultType>)> Callback); The callers of async methods (2) can issue cancellations and should be prepared to handle `TaskCancelledError` result: TaskHandle H = someAsyncMethod([](llvm::Expected<ResultType> R) { if (/* code to check for TaskCancelledError */) llvm::errs() << "Task got cancelled"; else llvm::errs() << "The result is: " << *R; } }); sleep(5); H.cancel(); The worker code itself (3) should check for cancellations using `CancellationToken` that can be retrieved via `CancellationToken::isCancelled()`. void codeComplete() { const CancellationToken& CT = CancellationToken::isCancelled(); if (CT.isCancelled()) return llvm::makeError<TaskCancelledError>(); runSema(); if (CT.isCancelled()) return llvm::makeError<TaskCancelledError>(); queryIndex(); return results; } If the operation was cancelled before it could run to completion, it should propagate the TaskCancelledError as a result. |
60 ↗ | (On Diff #160531) | We're missing docs on this and the other classes. I suggest a small doc describing what they're used for, e.g. used for checking if the operation was cancelled and used to signal the operation should be cancelled, etc. Also noting the threading-safety guarantees is a probably a good idea. |
61 ↗ | (On Diff #160531) | NIT: move members to the end of the class to be consistent with the rest of clangd. |
67 ↗ | (On Diff #160531) | Can we make this ctor private? |
71 ↗ | (On Diff #160531) | I wonder if we should make TaskHandle move-only? The reasoning is that is can be easily used from multiple threads (since it's used by code dealing with async tasks anyway) and copying it concurrently is a data race. |
74 ↗ | (On Diff #160531) | NIT: maybe use a shorter name, e.g. TaskHandle::create? |
82 ↗ | (On Diff #160531) | Maybe remove CancellationHandler and put all its functions into the clangd namespace? It's somewhat easily confused with CancellationToken. |
89 ↗ | (On Diff #160531) | NIT: maybe use a shorter name: CancelledError? |
clangd/ClangdLSPServer.cpp | ||
355 ↗ | (On Diff #160531) | I expect the pattern if cancelled -> reply with ErrorCode::RequestCancelled else reply with ErrorCode::InvalidParams to be pretty common when we add cancellation to more operations. Maybe add a small helper that does that? E.g. /// Replies to a request with an error, passing proper error codes to the LSP client. /// Properly handles cancellations. void replyError(Error E); |
627 ↗ | (On Diff #160531) | NIT: use Lock or L for consistencty with the rest of clangd. it's somewhat easy to misread std::lock_guard<std::mutex> _(Mut) as std::lock_guard<std::mutex> (Mut) |
clangd/Cancellation.cpp | ||
---|---|---|
17 ↗ | (On Diff #160531) | As talked offline, copying std::shared_ptr is thread safe. |
- Resolve discussions.
- Get rid of CancellationHandler class.
- Change error class name.
- Improve documentation.
clangd/Cancellation.h | ||
---|---|---|
71 ↗ | (On Diff #160531) | As similar to above mentioned case copying doesn't introduce any data races. |
- Made TaskHandle move-only. Since it is costly and most likely unnecessary to copy it other than to move it into Context.
- Provided an explicit clone method for copying.
Mostly LSP-specific comments and comments about making TaskHandle thread-safe.
I'm also starting to wonder if the scope of this patch is too big, we could potentially split it into three separate bits:
- Generic cancellation API, i.e. Cancellation.h and unit-tests of the API,
- ClangdServer using the cancellation API.
- LSP-specific bits, i.e. cancelRequest etc.
Not sure if it's worth splitting the last two, but splitting out the Cancellation API in a separate change could make review simpler.
But also happy to review the whole thing and land after it's done, feel free to pick whatever suits you best.
clangd/Cancellation.cpp | ||
---|---|---|
17 ↗ | (On Diff #160531) | My mistake, thanks for explaining this one to me. |
clangd/Cancellation.h | ||
101 ↗ | (On Diff #160652) | s/asyn/async |
clangd/Cancellation.h | ||
---|---|---|
86 ↗ | (On Diff #160652) | NIT: Maybe replace 'Totally thread-safe' with just Thread-safe? The latter conveys the same information. |
90 ↗ | (On Diff #160652) | NIT: maybe use Token->load() instead of static_cast<bool>(Token)? Arguably, looks more concise. |
92 ↗ | (On Diff #160652) | It's a bit confusing that this name clashes with a member function. if (isCancelled()) // <-- get token from ctx, then use implicit conversion to check if it was cancelled return llvm::makeError<CancelledError>(); Which is probably a nice pattern. But we could have two functions that do the same thing with less magic (i.e. implicit conversions) involved at the call sites: CancellationToken getCurrentCancellation(); void setCurrentCancellation(CancellationToken CT); inline bool isCancelled() { return getCurrentCancellation().isCancelled(); } Would allow to get rid of implicit conversion and friend function with the same signature as member. Would also make cases where we actually want to stash the token somewhere cleaner, e.g. instead of CancellationToken CT = isCancelled(), we'll have CancellationToken CT = getCurrentCancellation(). |
104 ↗ | (On Diff #160652) | As discussed offline, maybe we should make TaskHandle move-only? |
108 ↗ | (On Diff #160652) | I wonder if we should have a createCancellationToken() method and a method to stash it in the context instead? |
clangd/ClangdLSPServer.cpp | ||
74 ↗ | (On Diff #160652) | NIT: return std::string, const does not buys us anything here. |
354 ↗ | (On Diff #160652) | NIT: remove braces around short, single-statement branches |
631 ↗ | (On Diff #160652) | Use a value type here, i.e. std::string instead of const std::string&. |
642 ↗ | (On Diff #160652) | Use a value type here, i.e. std::string instead of const std::string&. |
643 ↗ | (On Diff #160652) | NIT: remove these braces, they don't buy us anything here? |
clangd/ClangdLSPServer.h | ||
178 ↗ | (On Diff #160652) | I'm not sure what associated with only their thread means. |
179 ↗ | (On Diff #160652) | These two methods look like a nice suit for better wrapped in a RAII-object. It would add the value to the associated map on construction and remove on destruction. Maybe add a comment that calls to this function should always be paired and a FIXME that we should have an API that enforces this? |
clangd/ClangdServer.cpp | ||
167 ↗ | (On Diff #160652) | NIT: remove braces around single-statement branches |
clangd/JSONRPCDispatcher.cpp | ||
135 ↗ | (On Diff #160652) | Isn't there a form of handleErrors that can handle the second case in the last argument? |
clangd/JSONRPCDispatcher.h | ||
67 ↗ | (On Diff #160652) | Maybe adds docs explaining what this function does? |
115 ↗ | (On Diff #160652) | naming-nit: use lowerCamelCase for function names |
115 ↗ | (On Diff #160652) | I suggest we document this function and put to next to reply and replyError, since they are closely related (i.e. the latter put the current request-id into the replies). |
unittests/clangd/CancellationTests.cpp | ||
64 ↗ | (On Diff #160652) | Another interesting case to test: |
The last set of comments is for the previous version, might be missing some updates, sorry about that. Will make sure to review the one too!
clangd/Cancellation.h | ||
---|---|---|
92 ↗ | (On Diff #160652) | Totally agreed, actually, didn't even notice they had the same signature :D, thanks! |
clangd/ClangdLSPServer.h | ||
179 ↗ | (On Diff #160652) | Yeah that would be really nice to have them in a RAII object, but the thing is cleanup can occur only after the task dies. Which we don't directly know since threads are being detached. So the only way to trigger destruction is at the callback, which is where I call cleanup currently. Maybe we can store the object within context to trigger destructor at the end of the thread, but still don't think it would look any better or ease the job of the caller. |
Mostly NITs, but please take a look at the CancelParams handling problem. I believe there might be a potential bug hiding there :-)
clangd/Cancellation.h | ||
---|---|---|
87 ↗ | (On Diff #161018) | s/Can be created with clangd::isCancelled()/Tokens for the currently running task can be obtained via clangd::getCurrentCancellationToken/? Another idea: maybe add a reference no why someone would want to use CancellationToken instead of isCancelled() helper, e.g. /// If cancellation checks are rare, one could use the isCancelled() helper to simplify the code. /// However, if cancellation checks are frequent, the guideline is first obtain the CancellationToken for the currently running task with getCurrentCancellationToken() and do cancel checks using it to avoid extra lookups in the Context |
137 ↗ | (On Diff #161018) | s/got/was? |
clangd/ClangdLSPServer.cpp | ||
621 ↗ | (On Diff #161018) | Wouldn't it work incorrectly for string IDs? |
643 ↗ | (On Diff #161018) | Could we elog if insert indicated that a value already exists in a map? This would probably mean malformed user input, but could also be immensely helpful for debugging. |
clangd/ClangdLSPServer.h | ||
179 ↗ | (On Diff #160652) | Having these paired method calls is probably fine for the first iteration, but we should abstract away this pattern when supporting cancellations for more operations. |
clangd/ClangdServer.cpp | ||
152 ↗ | (On Diff #161018) | Maybe use a shorter name? E.g. Task or TH :-) |
153 ↗ | (On Diff #161018) | Instead of moving cancellation token around explicitly, we could stash in a context right away and it would nicely propagate into the thread. This looks like less boilerplate. auto CancellableTaskHandle = TaskHandle::create(); WithContext ContextWithCancellation(setCurrentCancellationToken(TaskHandle.createCancellationToken())); auto Task = [](File, Callback<CodeCompleteResult> CB, llvm::Expected<InputsAndPreamble> IP) { if (getCancellationToken().isCancelled()) return CB(make_error<CancelledError>()); }; /// rest of the code is the same |
178 ↗ | (On Diff #161018) | Thinking out loud: the TaskHandle and CancellationToken handling will probably eventually move into TUScheduler::runWithPreamble and other thread-spawning code to avoid this kind of boilerplate here. No need to change anything in this patch, though. |
clangd/Protocol.h | ||
871 ↗ | (On Diff #161018) | Maybe add a comment from the LSP spec to be consistent with the rest of the file? |
clangd/ClangdLSPServer.cpp | ||
---|---|---|
621 ↗ | (On Diff #161018) | You are right, that's exactly the case, thanks! |
clangd/ClangdLSPServer.cpp | ||
---|---|---|
80 ↗ | (On Diff #161261) | This still misses string escaping issues. E.g. " will get quoted as \", etc. |
clangd/ClangdServer.cpp | ||
166 ↗ | (On Diff #161261) | NIT: check for it even before looking at the preamble? |
- Check cancellation earlier && Fix normalization difference between string and integer request ids.
clangd/Cancellation.h | ||
---|---|---|
96 ↗ | (On Diff #161428) | As chatted offline, I have questions about the motivation of the CancellationToken class. Intuitively, it seems to me that this can be merged into TaskHandle, and we can simply stash the TaskHandle instead of a token into the context. There would be fewer states to maintain, and the design would be a bit more straightforward. I might be missing context/concerns here, so I'd like to hear what you and Ilya think. @ilya-biryukov |
clangd/ClangdLSPServer.cpp | ||
75 ↗ | (On Diff #161428) | Use ID.getAsString()? |
clangd/ClangdLSPServer.cpp | ||
---|---|---|
75 ↗ | (On Diff #161428) | Unfortunately id can be both a string or a number according to LSP specs therefore we normalize both cases into a string. |
clangd/Cancellation.h | ||
---|---|---|
96 ↗ | (On Diff #161428) | I am for splitting cancellation checks from cancellation triggers. |
clangd/Cancellation.h | ||
---|---|---|
96 ↗ | (On Diff #161428) | Couldn't we prevent this by passing only const handle references to users who are not expect to call cancel()? |
clangd/Cancellation.h | ||
---|---|---|
96 ↗ | (On Diff #161428) | As long as TaskHandle does not have a copy constructor, that LG.
This should give a simpler implementation.
In general, I think having our own wrappers there opens up more room for experiments with the API later (the API is very constrained, so it's easy to refactor the implementation). OTOH, if we believe the design is right or changing the clients is not too much work (which is probably true), either way is fine with me. But either solution has its pros and cons, I'm happy with trying out any of those on code completion, see how it goes and decide whether we want to change anything before finalizing our design and adding cancellation to all operations in clangd. |
LG, just a few last nits
clangd/Cancellation.cpp | ||
---|---|---|
30 ↗ | (On Diff #161716) | NIT: maybe consider inlining it? Since Task has a public default constructor, I don't think having this method buys us anything |
clangd/Cancellation.h | ||
22 ↗ | (On Diff #161716) | s/Taskr/Task |
23 ↗ | (On Diff #161716) | remove mentions of CancellationToken |
49 ↗ | (On Diff #161716) | s/CancellationToken/getCurrentTask().isCancelled() |
clangd/ClangdLSPServer.cpp | ||
76 ↗ | (On Diff #161716) | Maybe extract the relevant code into a helper to avoid creating the unrelated class (CancelParams)? |
621 ↗ | (On Diff #161716) | Invert condition to reduce nesting? |
Mostly NITs, but please take a look at the potential data race
clangd/Cancellation.cpp | ||
---|---|---|
23 ↗ | (On Diff #161908) | Maybe add 'assert' that TaskKey is present? |
clangd/Cancellation.h | ||
27 ↗ | (On Diff #161908) | s/propagete/propagate |
114 ↗ | (On Diff #161908) | Maybe add a comment that the current Context must be have the task stashed in it? |
clangd/ClangdLSPServer.cpp | ||
351 ↗ | (On Diff #161908) | CleanupTaskHandle() can run in a separate thread, so can potentially run before the StoreTaskHandle call. |
621 ↗ | (On Diff #161908) | s/it/It |
clangd/Protocol.cpp | ||
629 ↗ | (On Diff #161908) | Maybe switch on different kinds of json::Value instead of using the ObjectMapper? |
clangd/Protocol.h | ||
883 ↗ | (On Diff #161908) | Maybe simplify the signature to: Optional<string> parseNumberOrString(const llvm::json::Value&); Also, maybe call it normalizeRequestId? |
clangd/ClangdLSPServer.cpp | ||
---|---|---|
351 ↗ | (On Diff #161908) | Thanks! |
- Change helper for normalizing Number | String.
clangd/Protocol.h | ||
---|---|---|
883 ↗ | (On Diff #161908) | LSP has a few more places that can have paremeters of type number | string so I believe this one could be used with them as well. Therefore I think it is better to keep the name that way. The normalizeRequestID is a sepcial case and handled within ClangdLSPServer. |