Traditional rule-based column detection failed on historical newspapers with irregular layouts, warped scans, and complex multi-column formats, leading to poor OCR segmentation.
Applied 2D Fourier transforms to analyze the spatial frequency of text regions, identifying dominant columnar structures even in noisy or distorted scans. Combined with adaptive thresholding to handle varying column widths.