This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/Driver/
-
clang/
-
Driver/
-
Options.td
-
lib/Driver/ToolChains/
-
Driver/
-
ToolChains/
-
Clang.cpp
-
llvm/
-
lib/Target/Mips/
-
Target/
-
Mips/
-
CMakeLists.txt
-
Mips.h
11
MipsMulMulBugPass.cpp
-
MipsTargetMachine.cpp
-
test/CodeGen/Mips/
-
CodeGen/
-
Mips/
-
vr4300-mulbranch.ll
-
vr4300-mulmul.ll

Differential D116238

[mips] Add -mfix4300 flag to enable vr4300 mulmul bugfix pass
ClosedPublic

Authored by Random06457 on Dec 23 2021, 1:48 PM.

Download Raw Diff

Details

Reviewers

atanasyan

Commits

rG2edcde00cb39: [MIPS] Add -mfix4300 flag to enable vr4300 mulmul bugfix pass

Summary

Early revisions of the VR4300 have a hardware bug where two consecutive multiplications can produce an incorrect result in the second multiply.
This revision adds the -mfix4300 flag to llvm (and clang) which, when passed, provides a software fix for this issue.

More precise description of the "mulmul" bug:

1: mul.[s,d] fd,fs,ft
2: mul.[s,d] fd,fs,ft  or  [D]MULT[U] rs,rt

When the above sequence is executed by the CPU, if at least one of the source operands of the first mul instruction happens to be sNaN, 0 or Infinity, then the second mul instruction may produce an incorrect result.
This can happen both if the two mul instructions are next to each other and if the first one is in a delay slot and the second is the first instruction of the branch target.

Description of the fix:
This fix adds a backend pass to llvm which scans for mul instructions in each basic block and inserts a nop whenever the following conditions are met:

The current instruction is a single or double-precision floating-point mul instruction.
The next instruction is either a mul instruction (any kind) or a branch instruction.

Note:
I chose -mfix4300 as a name for the flag to follow the GCC nomenclature but I don't know if this is a good name.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Random06457 created this revision.Dec 23 2021, 1:48 PM

Herald added subscribers: dang, jrtc27, hiraditya and 3 others. · View Herald TranscriptDec 23 2021, 1:48 PM

Random06457 requested review of this revision.Dec 23 2021, 1:48 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptDec 23 2021, 1:48 PM

Herald added subscribers: llvm-commits, cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B140550: Diff 396079.Dec 23 2021, 2:34 PM

Random06457 edited the summary of this revision. (Show Details)Dec 23 2021, 2:59 PM

Thanks for the patch. Some notes are below.

llvm/lib/Target/Mips/MipsMulMulBugPass.cpp
17	We can move this option to the `MipsTargetMachine.cpp` and just do not add the pass when it is not necessary.
20	Put this class into an anonymous namespace to reduce work for a linker.
32	Let's rename the function to the `fixMulMulBB` and move it to the `private` section of the class.
37	I do not think it's a good idea to save `MipsInstrInfo` into the static field. AFAIK now passes cannot be run in parallel. But if that changes in the future we get a problem with the static field. As to me I would get a reference to the `MipsInstrInfo` in the `runOnMachineFunction` and pass this reference to the `FixMulMulBB` as a parameter.
38	Do you really need to keep a pointer to the `Subtarget` in the object?
49–50	These lines can be merged into the single one: MipsII = MF.getSubtarget<MipsSubtarget>().getInstrInfo();
53–56	This code can be made a bit more compact: for (auto &MBB: MF) Modified \|= FixMulMulBB(MBB);
61	This function does not work with null pointer so change the argument's type to a reference.
74	Ditto
96–104	`std::next` call and the iterator incrementation are cheap calls. So we can write the loop in a more idiomatic form: for (MachineBasicBlock::instr_iterator MII = MBB.instr_begin(), E = MBB.instr_end(); MII != E; ++MII) { MachineBasicBlock::instr_iterator NextMII = std::next(MII); ...
111	You do not need a new `MBB` variable. Use `MBB` passed as an argument to the `FixMulMulBB`.

This revision now requires changes to proceed.Dec 24 2021, 10:13 AM

Addressed the comments.
I also updated isFirstMul to exclude integer multiplications which do not produce the bug. That change made me fix the tests too.

LGTM. Thanks for the patch.

This revision is now accepted and ready to land.Dec 31 2021, 5:00 AM

This revision was landed with ongoing or failed builds.Dec 31 2021, 5:01 AM

Closed by commit rG2edcde00cb39: [MIPS] Add -mfix4300 flag to enable vr4300 mulmul bugfix pass (authored by Random06457, committed by atanasyan). · Explain Why

This revision was automatically updated to reflect the committed changes.

atanasyan added a commit: rG2edcde00cb39: [MIPS] Add -mfix4300 flag to enable vr4300 mulmul bugfix pass.

Random06457 mentioned this in D117615: [mips] fix -mfix4300.Jan 18 2022, 2:56 PM

atanasyan mentioned this in rGee198df2e14c: [mips] Improve vr4300 mulmul bugfix pass.Jan 20 2022, 9:10 AM

Revision Contents

Path

Size

clang/

include/

clang/

Driver/

Options.td

1 line

lib/

Driver/

ToolChains/

Clang.cpp

5 lines

llvm/

lib/

Target/

Mips/

CMakeLists.txt

1 line

Mips.h

2 lines

MipsMulMulBugPass.cpp

134 lines

MipsTargetMachine.cpp

10 lines

test/

CodeGen/

Mips/

vr4300-mulbranch.ll

27 lines

vr4300-mulmul.ll

24 lines

Diff 396774

clang/include/clang/Driver/Options.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 3,611 Lines • ▼ Show 20 Lines
	def mxgot : Flag<["-"], "mxgot">, Group<m_mips_Features_Group>;			def mxgot : Flag<["-"], "mxgot">, Group<m_mips_Features_Group>;
	def mno_xgot : Flag<["-"], "mno-xgot">, Group<m_mips_Features_Group>;			def mno_xgot : Flag<["-"], "mno-xgot">, Group<m_mips_Features_Group>;
	def mldc1_sdc1 : Flag<["-"], "mldc1-sdc1">, Group<m_mips_Features_Group>;			def mldc1_sdc1 : Flag<["-"], "mldc1-sdc1">, Group<m_mips_Features_Group>;
	def mno_ldc1_sdc1 : Flag<["-"], "mno-ldc1-sdc1">, Group<m_mips_Features_Group>;			def mno_ldc1_sdc1 : Flag<["-"], "mno-ldc1-sdc1">, Group<m_mips_Features_Group>;
	def mcheck_zero_division : Flag<["-"], "mcheck-zero-division">,			def mcheck_zero_division : Flag<["-"], "mcheck-zero-division">,
	Group<m_mips_Features_Group>;			Group<m_mips_Features_Group>;
	def mno_check_zero_division : Flag<["-"], "mno-check-zero-division">,			def mno_check_zero_division : Flag<["-"], "mno-check-zero-division">,
	Group<m_mips_Features_Group>;			Group<m_mips_Features_Group>;
				def mfix4300 : Flag<["-"], "mfix4300">, Group<m_mips_Features_Group>;
	def mcompact_branches_EQ : Joined<["-"], "mcompact-branches=">,			def mcompact_branches_EQ : Joined<["-"], "mcompact-branches=">,
	Group<m_mips_Features_Group>;			Group<m_mips_Features_Group>;
	def mbranch_likely : Flag<["-"], "mbranch-likely">, Group<m_Group>,			def mbranch_likely : Flag<["-"], "mbranch-likely">, Group<m_Group>,
	IgnoredGCCCompat;			IgnoredGCCCompat;
	def mno_branch_likely : Flag<["-"], "mno-branch-likely">, Group<m_Group>,			def mno_branch_likely : Flag<["-"], "mno-branch-likely">, Group<m_Group>,
	IgnoredGCCCompat;			IgnoredGCCCompat;
	def mindirect_jump_EQ : Joined<["-"], "mindirect-jump=">,			def mindirect_jump_EQ : Joined<["-"], "mindirect-jump=">,
	Group<m_mips_Features_Group>,			Group<m_mips_Features_Group>,
	▲ Show 20 Lines • Show All 2,874 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,923 Lines • ▼ Show 20 Lines	void Clang::AddMIPSTargetArgs(const ArgList &Args,
if (Arg *A = Args.getLastArg(options::OPT_mcheck_zero_division,		if (Arg *A = Args.getLastArg(options::OPT_mcheck_zero_division,
options::OPT_mno_check_zero_division)) {		options::OPT_mno_check_zero_division)) {
if (A->getOption().matches(options::OPT_mno_check_zero_division)) {		if (A->getOption().matches(options::OPT_mno_check_zero_division)) {
CmdArgs.push_back("-mllvm");		CmdArgs.push_back("-mllvm");
CmdArgs.push_back("-mno-check-zero-division");		CmdArgs.push_back("-mno-check-zero-division");
}		}
}		}

		if (Arg *A = Args.getLastArg(options::OPT_mfix4300)) {
		CmdArgs.push_back("-mllvm");
		CmdArgs.push_back("-mfix4300");
		}

if (Arg *A = Args.getLastArg(options::OPT_G)) {		if (Arg *A = Args.getLastArg(options::OPT_G)) {
StringRef v = A->getValue();		StringRef v = A->getValue();
CmdArgs.push_back("-mllvm");		CmdArgs.push_back("-mllvm");
CmdArgs.push_back(Args.MakeArgString("-mips-ssection-threshold=" + v));		CmdArgs.push_back(Args.MakeArgString("-mips-ssection-threshold=" + v));
A->claim();		A->claim();
}		}

Arg *GPOpt = Args.getLastArg(options::OPT_mgpopt, options::OPT_mno_gpopt);		Arg *GPOpt = Args.getLastArg(options::OPT_mgpopt, options::OPT_mno_gpopt);
▲ Show 20 Lines • Show All 6,153 Lines • Show Last 20 Lines

llvm/lib/Target/Mips/CMakeLists.txt

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	add_llvm_target(MipsCodeGen
MipsSEInstrInfo.cpp		MipsSEInstrInfo.cpp
MipsSEISelDAGToDAG.cpp		MipsSEISelDAGToDAG.cpp
MipsSEISelLowering.cpp		MipsSEISelLowering.cpp
MipsSERegisterInfo.cpp		MipsSERegisterInfo.cpp
MipsSubtarget.cpp		MipsSubtarget.cpp
MipsTargetMachine.cpp		MipsTargetMachine.cpp
MipsTargetObjectFile.cpp		MipsTargetObjectFile.cpp
MicroMipsSizeReduction.cpp		MicroMipsSizeReduction.cpp
		MipsMulMulBugPass.cpp

LINK_COMPONENTS		LINK_COMPONENTS
Analysis		Analysis
AsmPrinter		AsmPrinter
CodeGen		CodeGen
Core		Core
MC		MC
MipsDesc		MipsDesc
Show All 14 Lines

llvm/lib/Target/Mips/Mips.h

Show All 32 Lines	namespace llvm {
FunctionPass *createMipsModuleISelDagPass();		FunctionPass *createMipsModuleISelDagPass();
FunctionPass *createMipsOptimizePICCallPass();		FunctionPass *createMipsOptimizePICCallPass();
FunctionPass *createMipsDelaySlotFillerPass();		FunctionPass *createMipsDelaySlotFillerPass();
FunctionPass *createMipsBranchExpansion();		FunctionPass *createMipsBranchExpansion();
FunctionPass *createMipsConstantIslandPass();		FunctionPass *createMipsConstantIslandPass();
FunctionPass *createMicroMipsSizeReducePass();		FunctionPass *createMicroMipsSizeReducePass();
FunctionPass *createMipsExpandPseudoPass();		FunctionPass *createMipsExpandPseudoPass();
FunctionPass *createMipsPreLegalizeCombiner();		FunctionPass *createMipsPreLegalizeCombiner();
		FunctionPass *createMipsMulMulBugPass();

InstructionSelector *createMipsInstructionSelector(const MipsTargetMachine &,		InstructionSelector *createMipsInstructionSelector(const MipsTargetMachine &,
MipsSubtarget &,		MipsSubtarget &,
MipsRegisterBankInfo &);		MipsRegisterBankInfo &);

void initializeMipsDelaySlotFillerPass(PassRegistry &);		void initializeMipsDelaySlotFillerPass(PassRegistry &);
void initializeMipsBranchExpansionPass(PassRegistry &);		void initializeMipsBranchExpansionPass(PassRegistry &);
void initializeMicroMipsSizeReducePass(PassRegistry &);		void initializeMicroMipsSizeReducePass(PassRegistry &);
void initializeMipsPreLegalizerCombinerPass(PassRegistry&);		void initializeMipsPreLegalizerCombinerPass(PassRegistry&);
		void initializeMipsMulMulBugFixPass(PassRegistry&);
} // end namespace llvm;		} // end namespace llvm;

#endif		#endif

llvm/lib/Target/Mips/MipsMulMulBugPass.cpp

This file was added.

				//===- MipsMulMulBugPass.cpp - Mips VR4300 mulmul bugfix pass -------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Early revisions of the VR4300 have a hardware bug where two consecutive
				// multiplications can produce an incorrect result in the second multiply.
				//
				// This pass scans for mul instructions in each basic block and inserts
				// a nop whenever the following conditions are met:
				//
				// - The current instruction is a single or double-precision floating-point
				// mul instruction.
				// - The next instruction is either a mul instruction (any kind)
				atanasyanUnsubmitted Not Done Reply Inline Actions We can move this option to the `MipsTargetMachine.cpp` and just do not add the pass when it is not necessary. atanasyan: We can move this option to the `MipsTargetMachine.cpp` and just do not add the pass when it is…
				// or a branch instruction.
				//===----------------------------------------------------------------------===//

				atanasyanUnsubmitted Not Done Reply Inline Actions Put this class into an anonymous namespace to reduce work for a linker. atanasyan: Put this class into an anonymous namespace to reduce work for a linker.
				#include "Mips.h"
				#include "MipsInstrInfo.h"
				#include "MipsSubtarget.h"
				#include "llvm/CodeGen/MachineBasicBlock.h"
				#include "llvm/CodeGen/MachineFunction.h"
				#include "llvm/CodeGen/MachineFunctionPass.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/Debug.h"
				#include "llvm/Target/TargetMachine.h"

				#define DEBUG_TYPE "mips-vr4300-mulmul-fix"

				atanasyanUnsubmitted Not Done Reply Inline Actions Let's rename the function to the `fixMulMulBB` and move it to the `private` section of the class. atanasyan: Let's rename the function to the `fixMulMulBB` and move it to the `private` section of the…
				using namespace llvm;

				namespace {

				class MipsMulMulBugFix : public MachineFunctionPass {
				atanasyanUnsubmitted Not Done Reply Inline Actions I do not think it's a good idea to save `MipsInstrInfo` into the static field. AFAIK now passes cannot be run in parallel. But if that changes in the future we get a problem with the static field. As to me I would get a reference to the `MipsInstrInfo` in the `runOnMachineFunction` and pass this reference to the `FixMulMulBB` as a parameter. atanasyan: I do not think it's a good idea to save `MipsInstrInfo` into the static field. AFAIK now…
				public:
				atanasyanUnsubmitted Not Done Reply Inline Actions Do you really need to keep a pointer to the `Subtarget` in the object? atanasyan: Do you really need to keep a pointer to the `Subtarget` in the object?
				MipsMulMulBugFix() : MachineFunctionPass(ID) {
				initializeMipsMulMulBugFixPass(*PassRegistry::getPassRegistry());
				}

				StringRef getPassName() const override { return "Mips VR4300 mulmul bugfix"; }

				MachineFunctionProperties getRequiredProperties() const override {
				return MachineFunctionProperties().set(
				MachineFunctionProperties::Property::NoVRegs);
				}

				bool runOnMachineFunction(MachineFunction &MF) override;
				atanasyanUnsubmitted Not Done Reply Inline Actions These lines can be merged into the single one: MipsII = MF.getSubtarget<MipsSubtarget>().getInstrInfo(); atanasyan: These lines can be merged into the single one: ``` MipsII = MF.getSubtarget<MipsSubtarget>().

				static char ID;

				private:
				bool fixMulMulBB(MachineBasicBlock &MBB, const MipsInstrInfo &MipsII);
				};
				atanasyanUnsubmitted Not Done Reply Inline Actions This code can be made a bit more compact: for (auto &MBB: MF) Modified \|= FixMulMulBB(MBB); atanasyan: This code can be made a bit more compact: ``` for (auto &MBB: MF) Modified \|= FixMulMulBB…

				} // namespace

				INITIALIZE_PASS(MipsMulMulBugFix, "mips-vr4300-mulmul-fix",
				"Mips VR4300 mulmul bugfix", false, false)
				atanasyanUnsubmitted Not Done Reply Inline Actions This function does not work with null pointer so change the argument's type to a reference. atanasyan: This function does not work with null pointer so change the argument's type to a reference.

				char MipsMulMulBugFix::ID = 0;

				bool MipsMulMulBugFix::runOnMachineFunction(MachineFunction &MF) {
				const MipsInstrInfo &MipsII =
				static_cast<const MipsInstrInfo >(MF.getSubtarget().getInstrInfo());

				bool Modified = false;

				for (auto &MBB : MF)
				Modified \|= fixMulMulBB(MBB, MipsII);

				return Modified;
				atanasyanUnsubmitted Not Done Reply Inline Actions Ditto atanasyan: Ditto
				}

				static bool isFirstMul(const MachineInstr &MI) {
				switch (MI.getOpcode()) {
				case Mips::FMUL_S:
				case Mips::FMUL_D:
				case Mips::FMUL_D32:
				case Mips::FMUL_D64:
				return true;
				default:
				return false;
				}
				}

				static bool isSecondMulOrBranch(const MachineInstr &MI) {
				if (MI.isBranch() \|\| MI.isIndirectBranch() \|\| MI.isCall())
				return true;

				switch (MI.getOpcode()) {
				case Mips::MUL:
				case Mips::FMUL_S:
				case Mips::FMUL_D:
				case Mips::FMUL_D32:
				case Mips::FMUL_D64:
				case Mips::MULT:
				case Mips::MULTu:
				case Mips::DMULT:
				case Mips::DMULTu:
				return true;
				default:
				atanasyanUnsubmitted Not Done Reply Inline Actions `std::next` call and the iterator incrementation are cheap calls. So we can write the loop in a more idiomatic form: for (MachineBasicBlock::instr_iterator MII = MBB.instr_begin(), E = MBB.instr_end(); MII != E; ++MII) { MachineBasicBlock::instr_iterator NextMII = std::next(MII); ... atanasyan: `std::next` call and the iterator incrementation are cheap calls. So we can write the loop in a…
				return false;
				}
				}

				bool MipsMulMulBugFix::fixMulMulBB(MachineBasicBlock &MBB,
				const MipsInstrInfo &MipsII) {
				bool Modified = false;
				atanasyanUnsubmitted Not Done Reply Inline Actions You do not need a new `MBB` variable. Use `MBB` passed as an argument to the `FixMulMulBB`. atanasyan: You do not need a new `MBB` variable. Use `MBB` passed as an argument to the `FixMulMulBB`.

				// Iterate through the instructions in the basic block
				for (MachineBasicBlock::instr_iterator MII = MBB.instr_begin(),
				E = MBB.instr_end();
				MII != E; ++MII) {

				MachineBasicBlock::instr_iterator NextMII = std::next(MII);

				// Trigger when the current instruction is a mul and the next instruction
				// is either a mul or a branch in case the branch target start with a mul
				if (NextMII != E && isFirstMul(MII) && isSecondMulOrBranch(NextMII)) {
				LLVM_DEBUG(dbgs() << "Found mulmul!");

				const MCInstrDesc &NewMCID = MipsII.get(Mips::NOP);
				BuildMI(MBB, NextMII, DebugLoc(), NewMCID);
				Modified = true;
				}
				}

				return Modified;
				}

				FunctionPass *llvm::createMipsMulMulBugPass() { return new MipsMulMulBugFix(); }

llvm/lib/Target/Mips/MipsTargetMachine.cpp

	Show All 39 Lines
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#include "llvm/Target/TargetOptions.h"			#include "llvm/Target/TargetOptions.h"
	#include <string>			#include <string>

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "mips"			#define DEBUG_TYPE "mips"

				static cl::opt<bool>
				EnableMulMulFix("mfix4300", cl::init(false),
				cl::desc("Enable the VR4300 mulmul bug fix."), cl::Hidden);

	extern "C" LLVM_EXTERNAL_VISIBILITY void LLVMInitializeMipsTarget() {			extern "C" LLVM_EXTERNAL_VISIBILITY void LLVMInitializeMipsTarget() {
	// Register the target.			// Register the target.
	RegisterTargetMachine<MipsebTargetMachine> X(getTheMipsTarget());			RegisterTargetMachine<MipsebTargetMachine> X(getTheMipsTarget());
	RegisterTargetMachine<MipselTargetMachine> Y(getTheMipselTarget());			RegisterTargetMachine<MipselTargetMachine> Y(getTheMipselTarget());
	RegisterTargetMachine<MipsebTargetMachine> A(getTheMips64Target());			RegisterTargetMachine<MipsebTargetMachine> A(getTheMips64Target());
	RegisterTargetMachine<MipselTargetMachine> B(getTheMips64elTarget());			RegisterTargetMachine<MipselTargetMachine> B(getTheMips64elTarget());

	PassRegistry *PR = PassRegistry::getPassRegistry();			PassRegistry *PR = PassRegistry::getPassRegistry();
	initializeGlobalISel(*PR);			initializeGlobalISel(*PR);
	initializeMipsDelaySlotFillerPass(*PR);			initializeMipsDelaySlotFillerPass(*PR);
	initializeMipsBranchExpansionPass(*PR);			initializeMipsBranchExpansionPass(*PR);
	initializeMicroMipsSizeReducePass(*PR);			initializeMicroMipsSizeReducePass(*PR);
	initializeMipsPreLegalizerCombinerPass(*PR);			initializeMipsPreLegalizerCombinerPass(*PR);
				initializeMipsMulMulBugFixPass(*PR);
	}			}

	static std::string computeDataLayout(const Triple &TT, StringRef CPU,			static std::string computeDataLayout(const Triple &TT, StringRef CPU,
	const TargetOptions &Options,			const TargetOptions &Options,
	bool isLittle) {			bool isLittle) {
	std::string Ret;			std::string Ret;
	MipsABIInfo ABI = MipsABIInfo::computeTargetABI(TT, CPU, Options.MCOptions);			MipsABIInfo ABI = MipsABIInfo::computeTargetABI(TT, CPU, Options.MCOptions);

	▲ Show 20 Lines • Show All 218 Lines • ▼ Show 20 Lines
	void MipsPassConfig::addPreEmitPass() {			void MipsPassConfig::addPreEmitPass() {
	// Expand pseudo instructions that are sensitive to register allocation.			// Expand pseudo instructions that are sensitive to register allocation.
	addPass(createMipsExpandPseudoPass());			addPass(createMipsExpandPseudoPass());

	// The microMIPS size reduction pass performs instruction reselection for			// The microMIPS size reduction pass performs instruction reselection for
	// instructions which can be remapped to a 16 bit instruction.			// instructions which can be remapped to a 16 bit instruction.
	addPass(createMicroMipsSizeReducePass());			addPass(createMicroMipsSizeReducePass());

				// This pass inserts a nop instruction between two back-to-back multiplication
				// instructions when the "mfix4300" flag is passed.
				if (EnableMulMulFix)
				addPass(createMipsMulMulBugPass());

	// The delay slot filler pass can potientially create forbidden slot hazards			// The delay slot filler pass can potientially create forbidden slot hazards
	// for MIPSR6 and therefore it should go before MipsBranchExpansion pass.			// for MIPSR6 and therefore it should go before MipsBranchExpansion pass.
	addPass(createMipsDelaySlotFillerPass());			addPass(createMipsDelaySlotFillerPass());

	// This pass expands branches and takes care about the forbidden slot hazards.			// This pass expands branches and takes care about the forbidden slot hazards.
	// Expanding branches may potentially create forbidden slot hazards for			// Expanding branches may potentially create forbidden slot hazards for
	// MIPSR6, and fixing such hazard may potentially break a branch by extending			// MIPSR6, and fixing such hazard may potentially break a branch by extending
	// its offset out of range. That's why this pass combine these two tasks, and			// its offset out of range. That's why this pass combine these two tasks, and
	Show All 32 Lines

llvm/test/CodeGen/Mips/vr4300-mulbranch.ll

This file was added.

				; RUN: llc -march=mips -mfix4300 -verify-machineinstrs < %s \| FileCheck %s

				; Function Attrs: nounwind
				define dso_local void @fun_s(float %a) local_unnamed_addr #0 {
				entry:
				; CHECK-LABEL: fun_s
				; CHECK: mul.s
				; CHECK-NEXT: nop
				%mul = fmul float %a, %a
				tail call void @foo_s(float %mul) #2
				ret void
				}

				declare dso_local void @foo_s(float) local_unnamed_addr #1

				; Function Attrs: nounwind
				define dso_local void @fun_d(double %a) local_unnamed_addr #0 {
				entry:
				; CHECK-LABEL: fun_d
				; CHECK: mul.d
				; CHECK-NEXT: nop
				%mul = fmul double %a, %a
				tail call void @foo_d(double %mul) #2
				ret void
				}

				declare dso_local void @foo_d(double) local_unnamed_addr #1

llvm/test/CodeGen/Mips/vr4300-mulmul.ll

This file was added.

				; RUN: llc -march=mips -mfix4300 -verify-machineinstrs < %s \| FileCheck %s

				; Function Attrs: mustprogress nofree norecurse nosync nounwind readnone willreturn
				define dso_local float @fun_s(float %x) local_unnamed_addr #0 {
				entry:
				; CHECK-LABEL: fun_s
				; CHECK: mul.s
				; CHECK-NEXT: nop
				; CHECK: mul.s
				%mul = fmul float %x, %x
				%mul1 = fmul float %mul, %x
				ret float %mul1
				}

				define dso_local double @fun_d(double %x) local_unnamed_addr #0 {
				entry:
				; CHECK-LABEL: fun_d
				; CHECK: mul.d
				; CHECK-NEXT: nop
				; CHECK: mul.d
				%mul = fmul double %x, %x
				%mul1 = fmul double %mul, %x
				ret double %mul1
				}