The following lines were added (+) and removed (-):
Sometimes tesseract works better if the image is bigger, sharper, or has higher contract. Using a combination of imagemagick and tesseract we can get a more accurate OCR text file. convert -colorspace gray -fill white -resize 480% -sharpen 0x1 documentpage01.png documentpage01.jpg tesseract documentpage01.jpg documentpage01.txt