ocrxword
ocrxword is an open-source software project that aims to automate the digitization of crossword puzzles through optical character recognition. It provides tools to process images or scans of crosswords, extracting the grid structure and the associated clues into a structured, machine-readable format.
The core functionality includes grid detection and cell segmentation, OCR for letters and digits, and layout
The project employs a modular pipeline built with open-source components for image processing and text recognition.
ocrxword is maintained by an international community of volunteers and hosted in a public repository, welcoming