Intelligent Character recognition for Mathematical Documents

From a mathematical document detects the mathematical equation and converts it into latex code and rest into text.

Project pipeline

Mathematical documents contains equations which are not recognised by optical character recognition. Unlike texts, for proper interpretation of mathematical equation the equations are to be converted into equivaent latex code. In this project, the region of mathematical equations are detected using a single shot detector. The detected equation are cropped and converted into latex code using OCR model. The equation area is removed to get remaining text which is passed to separate OCR model to generate text output.