I understand the the x86 operation to perform integer multiplication of two numbers (e.g. on 64 bits) is MUL.
My question is, how is this operation generally implemented at the hardware level? (for instance, on a modern Intel processor). Also, is it executed in a single CPU cycle?
 
    