After OCR, text often contains odd breaks and characters. Run it through GoodText to normalize paragraphs and spaces.
Use GoodText now →