Consider program P, which runs on a 1GHz machine M in 10 seconds. An optimization is made to P, replacing all instances of multiplying a value by 4 (mult X, X, 4) with two instructions that set x = x + x twice. Call this new optimized program, P'. The CPI of a multiply instruction is 4, and the CPI of an 'add' is 1. After recompiling, the program now runs in 9 seconds on machine M. How many multiplies were replaced by the new compiler?