Advanced pdf file recognition
We have a data set after .pdf parsing. The task is for the program to correctly recognize the parsing elements (text, tables, code, table of contents, headings, etc.). The main goal is to get a correctly structured document.