Those are quite large tests. How much time does it add to 'ninja check'? If it's reasonably short (I don't really have a number in mind for this) then LGTM. Otherwise we may need to think about trimming it down a bit
Also, could you double check the memory usage for this. Some of the bots have a fairly limited amount of memory so the 'ninja check' time will be amplified on those if it consumes a lot.