Details
Diff Detail
Event Timeline
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
131 | Do vector buffer/flat/global/scratch and scalar buffer/flat all have the slc bit? And not DS? | |
349–350 | According to http://llvm.org/docs/LangRef.html#load-instruction : !nontemporal does not have any defined semantics for atomic loads. My reading of this is that it is allowed to have nontemporal on atomic loads, but the meaning of it is not defined by LLVM. If so an assert is not correct here. My suggestion is that AMDGPU target treats it the same as non-atomic. | |
389–390 | Same comment as for loads. | |
test/CodeGen/AMDGPU/memory-legalizer-nontemporal-load.ll | ||
2–13 | Add tests for private address space so scratch buffer instructions can be tested. Add tests with uniform values so scalar instructions are generated. For gfx9 could global/scratch instructions be generated? Does the LLVM IR validator allow NonTemporal on load atomic, if so add that? | |
test/CodeGen/AMDGPU/memory-legalizer-nontemporal-store.ll | ||
2–12 | Same comment as load. |
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
349–350 | I'd prefer ignoring the non-temporal in the case of atomics. Note that one would have to resort to coding in IR to even get it applied; no languages have a means to request non-temporal atomic operations. |
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
68 | Is this needed? Isn't it duplicating the information from AtomicOrdering != NotAtomic? | |
69 | Is having a default value a good thing? The only use seems to be for fences so why not simply pass in false? | |
80 | assert(Ordering != NotAtomic || IsNonTemporal)? | |
262–264 | What are examples of instructions that have multiple memory operands? Is is valid to treat them as system scope sequentially consistent atomic and !nontemporarl? Would it be clearer to call the constructor with the arguments explicit so can see what this is actually returning? Same comment for other places doing this. | |
348 | Should this be: if (MOI.IsNonTemporal) { assert(!MOI.IsAtomic); Since above comments are that nontemporal is not allowed on atomic. | |
349–350 | So another patch should update the LLVM IR documentation. The syntax for load/store atomic already does not list nontemprarl as an allowed attribute, but the text mentions it. And also add a validator test to enforce that non-temporal is not allowed on load/store/rmw atomic. | |
355 | Is there really a need for IsAtomic()? Isn't AtomicOrdering != NotAtomic giving the same information? | |
388 | Same comment as above. |
Address review feedback.
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
262–264 | I don't know of any. |
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
115 | It would be better to check if != 0. If you define an operand as an i1imm for example it is the sign extended constant value -1 |
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
349–350 | Is a validator needed, or can we just update the documentation to state that any nontemporal on atomic operations is ignored? |
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
349–350 | Currently the documentation gives the syntax for atomic instructions as not allowing the nontemporal attribute. So would seem that the validator should enforce that syntax, or the syntax should be changed. |
Address review feedback and split out cleanups discussed offline into separate changes.
lib/Target/AMDGPU/SIMemoryLegalizer.cpp | ||
---|---|---|
206 | If there are are no MMOs then the default ought to be false. | |
365–366 | Suggest removing "else" from these "if"s as each one handles a specific case and returns. So no need for "else if" since plain "if" will be sufficient. At this point want to give the unhandled error since the preceding "if"s should have handled all the cases, so putting it outside the else makes that clearer. Same comment for other expand functions. | |
371 | Suggest adding a comment along the lines of: // Atomic instructions do not have the nontemporal attribute. Same comment for other expand functions. |
Is this needed? Isn't it duplicating the information from AtomicOrdering != NotAtomic?