Historical documents with faded ink, stains, and irregular layouts caused standard OCR line detection to fail, resulting in poor text extraction accuracy.
Developed a hybrid approach combining traditional computer vision with deep learning to robustly identify text lines even in challenging conditions, with special handling for curved baselines and interlinear annotations.