Fixes mlopen 5x5 case.
This is a preview, not for submitting.
Failed tests:
LLVM :: CodeGen/AMDGPU/indirect-private-64.ll LLVM :: CodeGen/AMDGPU/insert_vector_elt.ll LLVM :: CodeGen/AMDGPU/private-element-size.ll
Paths
| Differential D19371
[AMDGPU] prohibit >4 bytes private memory access.
AbandonedPublic Authored by vpykhtin on Apr 21 2016, 10:03 AM.
Details
Summary Fixes mlopen 5x5 case. This is a preview, not for submitting. Failed tests: LLVM :: CodeGen/AMDGPU/indirect-private-64.ll LLVM :: CodeGen/AMDGPU/insert_vector_elt.ll LLVM :: CodeGen/AMDGPU/private-element-size.ll
Diff Detail Event Timelinevpykhtin updated this object. Comment Actions If this fixes correctness issue, commit please. Optimization can come after, if not along. Comment Actions
I think what Matt suggests is more reasonable here. I would wait for his patch.
Revision Contents
Diff 54521 include/llvm/Target/TargetLowering.h
lib/Target/AMDGPU/SIISelLowering.h
lib/Target/AMDGPU/SIISelLowering.cpp
|