Some SSE instructions take one scalar input for one scalar output, such as, sqrtss, rsqrtss, rcpss, ... These instructions don't change the upper bits of the output register, so I believe it has a dependency on the output register.
Is it worth putting an extra xorps to break the dependency when the output register of such an instruction is different from the input register?