Страница публикации

Heuristic Algorithm for Recovering a Physical Structure of Spreadsheet Header

Авторы: Paramonov V., Shigarov A., Vetrova V., Mikhailov A.

Журнал: Advances in Intelligent Systems and Computing

Том: 1050

Номер:

Год: 2020

Отчётный год: 2020

Издательство:

Местоположение издательства:

URL:

Аннотация: Tables in electronic documents (spreadsheets) contain large volumes of useful information about different domains. Efficient extraction of data from document tables plays a crucial role in its further usage including analysis and integration. The visual or logical structure of table elements might differ from its physical structure. Such differences cause difficulties for automated table processing and understanding. Automated correction from physical form to visual allows to simplify tables processing operations. In this paper, we propose a heuristic approach for transformation of tables’ header cells. The main goal of the proposed approach is to provide an algorithm and software tool for recovering a physical structure of a spreadsheet header. The proposed approach is illustrated by application to the Statistical Abstract of the United States (SAUS) dataset.

Индексируется WOS: 0

Индексируется Scopus: 1

Индексируется РИНЦ: 0

Публикация в печати: 0

Добавил в систему: