I want to OCR some scanned forms (filled by hand). It is the first time I do something serious using computer vision. So far I'm able to locate the squares containing digits for a date field:
Looking at the example handwritten digit dataset that comes with OpenCV, I see digits are centralized and resized to (20, 20):
Since this may be a fairly common problem, I'm wondering if the algorithm is already implemented in OpenCV (or numpy, scipy, etc) so I don't have to reinvent the wheel.
The question is: is there a built-in pipeline in Python in order to normalize the samples?


 
    